Memory Cache

An MCP service that reduces token consumption in language model interactions through efficient data caching

Developer tools Knowledge management and memory #Cache optimization #Token saving #MCP service Local .TypeScript

rating : 2.5 points

downloads : 4.9K

update time : 2025-04-28

Open Site

What is an in-memory cache server?

An in-memory cache server is a tool based on the MCP (Model Context Protocol) designed to reduce token consumption by storing reusable data. It can automatically cache data when you interact with the language model, thereby improving efficiency and saving costs.

How to use an in-memory cache server?

Simply install and run the in-memory cache server, and then configure it in your MCP client. No manual intervention is required, and the cache will automatically handle all data.

Applicable scenarios

The in-memory cache server is particularly suitable for application scenarios that require frequent access to the same data, such as file reading, data analysis, and project navigation.

Main features

Automatic caching

When you interact with the language model, the in-memory cache server will automatically save commonly used data and directly return the cached content on the next request.

Dynamic cleaning

Automatically remove expired or infrequently used cache entries according to the set rules to ensure that the memory does not grow indefinitely.

Multi-client compatibility

Supports any client that follows the MCP protocol and can be easily integrated into the existing workflow.

Advantages

Significantly reduce token consumption and lower costs

Improve response speed and enhance the user experience

Easy to use without additional configuration

Limitations

May not show obvious effects for one-time tasks

Requires reasonable setting of the cache size to balance performance and memory usage

How to use

Install the in-memory cache server

You can choose to install it automatically through Smithery or manually clone the code and deploy it locally.

Configure the MCP client

Add the in-memory cache server to your MCP client settings and specify its path and parameters.

Start the server

After the configuration is completed, the in-memory cache server will automatically run in the background.

Usage examples

File reading test

A certain amount of tokens will be consumed when reading a large file for the first time, and the second reading will directly obtain the data from the cache.

Data processing test

After performing complex calculations on a set of data, subsequent requests can directly reference the results in the cache.

Frequently Asked Questions

How to check if the cache is working properly?

Will the cache grow indefinitely?

What data will be cached?

Related resources

Official documentation

Learn more about the in-memory cache server.

GitHub repository

View the source code and contribution guidelines.

Video tutorial

Watch the demonstration video to learn how to get started quickly.

🚀 Memory Cache Server

A Model Context Protocol (MCP) server that reduces token consumption by efficiently caching data between language model interactions. It is suitable for any MCP client and any language model that uses tokens.

🚀 Quick Start

📦 Installation

Automatic Installation via Smithery

To automatically install the memory cache server for Claude Desktop via Smithery:

npx -y @smithery/cli install @tosin2013/mcp-memory-cache-server --client claude

Manual Installation

Clone the repository:

git clone https://github.com/tosin2013/mcp-memory-cache-server.git
cd mcp-memory-cache-server

Install dependencies:

npm install

Build the project:

npm run build

⚙️ Configuration

Using a Configuration File

Create a config.json file with the following content:

{
  "max_entries": 1000,
  "max_memory": 104857600, // 100MB
  "default_ttl": 3600,     // 1 hour
  "check_interval": 60000,  // 1 minute
  "stats_interval": 120000  // 2 minutes
}

Environment Variable Configuration

You can set environment variables in the MCP client section of package.json:

{
  "mcpServers": {
    "memory-cache": {
      "command": "node",
      "args": ["dist/index.js"],
      "env": {
        "MAX_ENTRIES": "5000",
        "MAX_MEMORY": "209715200", // 200MB
        "DEFAULT_TTL": "7200",     // 2 hours
        "CHECK_INTERVAL": "360000"  // 1 hour
      }
    }
  }
}