Documentation Crawler
D

Documentation Crawler

A development documentation server based on the MCP protocol, providing functions such as document crawling, local loading, precise search, and detail retrieval, to solve the document hallucination problem in AI development.
2.5 points
15.3K

What is McpDocServer?

McpDocServer is a development documentation server based on the MCP protocol, specifically designed for various development framework documents. It can provide functions such as multi-threaded document crawling, local document loading, keyword search, and document detail retrieval.

How to use McpDocServer?

First, you need to install and configure McpDocServer. Then, specify the document source through the configuration file and run the crawler script. Finally, start the server and integrate it into your development environment for use.

Applicable scenarios

It is suitable for developers who need to quickly find and use development framework documents, especially when collaborating across frameworks or learning new frameworks.

Main features

Multi-threaded document crawling
Supports simultaneous content crawling from multiple document sources to improve crawling efficiency.
Local document loading
Supports loading local document data in JSON format to reduce network dependency.
Keyword search
Supports quick search of document content by keywords.
Document detail retrieval
Provides an interface for querying detailed document information.
Document reloading
Supports triggering document reloading through special keywords.
Advantages
Efficiently crawl document content
Support multiple document formats
Provide accurate document search function
Limitations
Requires a certain technical background for configuration
Some complex documents may not be fully parsed

How to use

Installation and running
Ensure that the Chrome browser is installed and set the PUPPETEER_SKIP_DOWNLOAD environment variable. Run npm install to complete the installation.
Crawl document data
Configure the crawler and run npm run crawl to crawl documents.
Start the MCP server
Run npm start to start the server.
Run tests
Run npm test to verify the server's functionality.

Usage examples

Document crawling
By configuring the crawler, the documents of the Taro framework were successfully crawled and stored.
Document search
Detailed information about components in the Taro framework was obtained through keyword search.
Document detail retrieval
The detailed description of a specific component in the Taro framework was obtained through the document ID.

Frequently Asked Questions

How to solve the Cursor hallucination problem?
How to customize the crawler behavior?
Does it support multi-language documents?

Related resources

McpDocServer GitHub repository
The open-source code repository of McpDocServer.
MCP Protocol official website
The official website of the MCP protocol.

Installation

Copy the following command to your Client for configuration
{
  "mcpServers": {
    "文档 MCP 服务器": {
      "command": "node",
      "args": ["/绝对路径/server.js"],
      "env": { "NODE_ENV": "development" }
    }
  }
}
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

A
Airweave
Airweave is an open - source context retrieval layer for AI agents and RAG systems. It connects and synchronizes data from various applications, tools, and databases, and provides relevant, real - time, multi - source contextual information to AI agents through a unified search interface.
Python
15.1K
5 points
V
Vestige
Vestige is an AI memory engine based on cognitive science. By implementing 29 neuroscience modules such as prediction error gating, FSRS - 6 spaced repetition, and memory dreaming, it provides long - term memory capabilities for AI. It includes a 3D visualization dashboard and 21 MCP tools, runs completely locally, and does not require the cloud.
Rust
9.4K
4.5 points
M
Moltbrain
MoltBrain is a long-term memory layer plugin designed for OpenClaw, MoltBook, and Claude Code, capable of automatically learning and recalling project context, providing intelligent search, observation recording, analysis statistics, and persistent storage functions.
TypeScript
10.0K
4.5 points
B
Bm.md
A feature-rich Markdown typesetting tool that supports multiple style themes and platform adaptation, providing real-time editing preview, image export, and API integration capabilities
TypeScript
14.7K
5 points
S
Security Detections MCP
Security Detections MCP is a server based on the Model Context Protocol that allows LLMs to query a unified security detection rule database covering Sigma, Splunk ESCU, Elastic, and KQL formats. The latest version 3.0 is upgraded to an autonomous detection engineering platform that can automatically extract TTPs from threat intelligence, analyze coverage gaps, generate SIEM-native format detection rules, run tests, and verify. The project includes over 71 tools, 11 pre-built workflow prompts, and a knowledge graph system, supporting multiple SIEM platforms.
TypeScript
6.7K
4 points
P
Paperbanana
Python
8.9K
5 points
B
Better Icons
An MCP server and CLI tool that provides search and retrieval of over 200,000 icons, supports more than 150 icon libraries, and helps AI assistants and developers quickly obtain and use icons.
TypeScript
10.7K
4.5 points
A
Assistant Ui
assistant - ui is an open - source TypeScript/React library for quickly building production - grade AI chat interfaces, providing composable UI components, streaming responses, accessibility, etc., and supporting multiple AI backends and models.
TypeScript
9.9K
5 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
39.0K
5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
81.2K
4.3 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
27.2K
4.3 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
24.8K
4.5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
69.4K
4.5 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
37.3K
5 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
24.9K
4.5 points
M
Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
56.2K
4.8 points
AIBase
Zhiqi Future, Your AI Solution Think Tank
© 2026AIBase