Knowledge Rag
K

Knowledge Rag

The Knowledge RAG System is a local retrieval - augmented generation system that integrates with Claude Code through the MCP protocol. It supports hybrid search (semantics + keywords) for multiple document formats such as PDF and Markdown, and provides keyword routing functions, suitable for personal knowledge base management.
2.5 points
4.9K

What is the Knowledge RAG System?

Knowledge RAG is a 100% locally - run intelligent document search system specifically designed for Claude Code. It can understand the content in your documents. When you ask Claude a question, the system will automatically search for relevant document fragments and provide them to Claude as a reference for answering. Imagine you have a folder containing various technical documents, notes, and code examples. When you ask Claude a technical question, the system will automatically find the most relevant information from your documents, making Claude's answers more accurate and personalized.

How to use the Knowledge RAG System?

It's very easy to use: 1. Put your documents into the documents folder by category. 2. Start Claude Code. 3. Ask Claude questions directly. The system will automatically search your documents in the background, find relevant information, and integrate it into Claude's answers. You don't need to manually search or copy - paste document content.

Applicable scenarios

• Technical document query: Quickly find API documentation and configuration instructions. • Security research: Search for vulnerability exploitation and penetration testing methods. • Study notes: Find previous notes and knowledge points. • Code reference: Search for code examples and best practices. • Personal knowledge base: Manage documents and materials in various formats.

Main features

Hybrid intelligent search
Combines semantic understanding (understanding the meaning of concepts) and keyword matching (precise term search) to provide the most accurate search results. You can adjust the search strategy through the hybrid_alpha parameter.
Fully local operation
All data processing is completed on your computer. Document content will not be uploaded to any cloud server, ensuring privacy and security.
Support for multiple document formats
Supports multiple file formats such as PDF, Markdown, plain text, Python code, and JSON, and automatically parses the content.
Intelligent classification routing
Automatically determines the document category (such as security, development, CTF, etc.) based on the keywords in the question, improving search accuracy.
Seamless integration with Claude
Integrated directly into Claude Code as an MCP server. It automatically searches when you ask questions, eliminating the need to switch tools.
Fast index update
After documents are added or modified, the index can be quickly re - built, supporting parallel processing to improve speed.
Advantages
🔒 Privacy protection: All data is processed locally and not uploaded to the cloud.
⚡ Flexible search: Supports multiple search modes from pure keywords to pure semantics.
📚 Format compatibility: Supports multiple document formats such as PDF, Markdown, and code.
🎯 Intelligent classification: Automatically identifies the question type and accurately searches for relevant documents.
🚀 Fast response: The keyword search mode returns results almost instantly.
🔄 Easy to update: Documents can be quickly re - indexed after changes.
Limitations
💻 Requires local resources: Requires running Ollama and Python environments.
📦 Initial setup: Python and Ollama need to be configured during the first installation.
🔧 Technical dependency: Basic command - line operation knowledge is required.
💾 Storage occupation: The vector database will occupy a certain amount of disk space.
🐢 Slow semantic search: Pure semantic search requires generating embedding vectors, which is slower.

How to use

Installation preparation
Make sure your computer has Python 3.11 or 3.12 installed, and download and install Ollama.
Download and install the system
Clone the project repository and run the installation script to automatically set up the virtual environment and dependencies.
Download the embedding model
Download the model for understanding document semantics in Ollama.
Configure Claude Code
Edit the Claude configuration file and add the MCP server configuration.
Add your documents
Put your documents into the documents folder by category.
Start using
Restart Claude Code. The system will automatically index the documents, and then you can directly ask questions.

Usage examples

Technical document query
When you need to find the specific usage or configuration parameters of an API
Security research reference
When conducting penetration testing or security research, you need to refer to specific vulnerability exploitation methods
Code example search
You need code examples or best practices for a programming task
Study note review
Review previously learned knowledge points or technical concepts

Frequently Asked Questions

Do I need to upload my documents to the cloud?
What types of documents are supported?
What is the search speed like?
How to add new documents?
Do I need to be connected to the Internet to use it?
Can I search Chinese documents?
Is there a limit to the number of documents?
How to optimize search results?

Related resources

GitHub repository
Project source code and latest version
Ollama official website
Local large - model running environment
Python download
Python programming language
Claude Code documentation
Claude Code usage guide
MCP protocol documentation
Model Context Protocol official documentation

Installation

Copy the following command to your Client for configuration
{
     "mcpServers": {
       "knowledge-rag": {
         "type": "stdio",
         "command": "cmd",
         "args": ["/c", "cd /d C:\\path\\to\\knowledge-rag && .\\venv\\Scripts\\python.exe -m mcp_server.server"],
         "env": {}
       }
     }
   }
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

A
Airweave
Airweave is an open - source context retrieval layer for AI agents and RAG systems. It connects and synchronizes data from various applications, tools, and databases, and provides relevant, real - time, multi - source contextual information to AI agents through a unified search interface.
Python
6.2K
5 points
V
Vestige
Vestige is an AI memory engine based on cognitive science. By implementing 29 neuroscience modules such as prediction error gating, FSRS - 6 spaced repetition, and memory dreaming, it provides long - term memory capabilities for AI. It includes a 3D visualization dashboard and 21 MCP tools, runs completely locally, and does not require the cloud.
Rust
4.9K
4.5 points
M
Moltbrain
MoltBrain is a long-term memory layer plugin designed for OpenClaw, MoltBook, and Claude Code, capable of automatically learning and recalling project context, providing intelligent search, observation recording, analysis statistics, and persistent storage functions.
TypeScript
5.5K
4.5 points
B
Better Icons
An MCP server and CLI tool that provides search and retrieval of over 200,000 icons, supports more than 150 icon libraries, and helps AI assistants and developers quickly obtain and use icons.
TypeScript
6.1K
4.5 points
H
Haiku.rag
Haiku RAG is an intelligent retrieval - augmented generation system built on LanceDB, Pydantic AI, and Docling. It supports hybrid search, re - ranking, Q&A agents, multi - agent research processes, and provides local - first document processing and MCP server integration.
Python
10.4K
5 points
C
Claude Context
Claude Context is an MCP plugin that provides in - depth context of the entire codebase for AI programming assistants through semantic code search. It supports multiple embedding models and vector databases to achieve efficient code retrieval.
TypeScript
16.9K
5 points
A
Acemcp
Acemcp is an MCP server for codebase indexing and semantic search, supporting automatic incremental indexing, multi-encoding file processing, .gitignore integration, and a Web management interface, helping developers quickly search for and understand code context.
Python
17.1K
5 points
M
MCP
The Microsoft official MCP server provides search and access functions for the latest Microsoft technical documentation for AI assistants
15.0K
5 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
24.7K
4.3 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
34.6K
5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
72.6K
4.3 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
20.5K
4.5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
63.8K
4.5 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
32.5K
5 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
21.1K
4.5 points
M
Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
49.5K
4.8 points
AIBase
Zhiqi Future, Your AI Solution Think Tank
© 2026AIBase