Pdf MCP
A Python - based MCP server that provides functions for reading, searching, and extracting content from PDF documents. It supports paginated reading, full - text search, and image extraction, and uses an SQLite cache for persistent storage.
2.5 points
6.6K

What is PDF-MCP?

PDF-MCP is a Model Context Protocol (MCP) server specifically designed to handle PDF documents. It allows AI assistants (such as Claude, Copilot, etc.) to directly access and manipulate PDF files, including reading content, searching for keywords, extracting images, and obtaining document information. Through an intelligent caching mechanism, the cache of processed documents is retained even when the server is restarted, improving the efficiency of repeated access.

How to use PDF-MCP?

PDF-MCP runs as a background service and needs to be used in conjunction with an AI client that supports the MCP protocol. After installation, add the server configuration to the client configuration file and restart the client to use it. The AI assistant will automatically recognize the available PDF tools, and users can operate on PDF documents through natural language instructions.

Applicable scenarios

PDF-MCP is particularly suitable for scenarios such as long - document analysis, research report reading, contract review, academic paper summarization, and multi - document information extraction. When you need to quickly obtain specific information from a PDF without manually flipping through the pages, this tool can significantly improve efficiency.

Main features

Intelligent paginated reading
Supports reading PDF content by page range, avoiding context overflow caused by loading large documents at once. You can specify single pages, multiple pages, or continuous page ranges.
Full - text search
Search for keywords or phrases in PDF documents and quickly locate the pages where the relevant content is located, without the need to manually flip through the entire document.
Image extraction
Extract embedded images from PDFs and return them in base64 - encoded PNG format, facilitating AI assistants to analyze and describe the image content.
Document information retrieval
Retrieve the metadata of a PDF, including the number of pages, file size, creation date, author, title, and other information, as well as the estimated number of tokens.
Table of contents parsing
Automatically parse the table of contents structure of a PDF, display chapter titles and corresponding page numbers, and help quickly navigate to the desired section.
URL support
Not only supports local PDF files but also can directly load remote PDF documents from HTTP/HTTPS URLs without first downloading them to the local machine.
SQLite persistent cache
Use an SQLite database to cache the processed PDF content. The cached data is retained even after the server is restarted, significantly improving the speed of repeated access.
Multi - client support
Compatible with various AI clients that support the MCP protocol, such as Claude Desktop, VS Code Copilot, Codex CLI, and Kiro.
Advantages
More efficient in handling large documents: Paginated reading avoids context limitations, and intelligent search quickly locates information.
Performance optimization: SQLite cache reduces repeated parsing and improves response speed.
Easy to use: Can be operated through natural language instructions without learning complex commands.
Cross - session persistence: Cached data remains valid after the server is restarted.
Multifunctional integration: Eight dedicated tools cover common PDF processing needs.
Limitations
Requires client support: Must use an AI assistant that supports the MCP protocol.
Limited support for scanned PDFs: The text recognition ability for image - based PDFs depends on the quality of the original document.
Complex table processing: The extraction of complex - format tables may not be perfect.
Memory limitation: Extremely large files (hundreds of MB) may be limited by system memory.
Requires configuration: Simple configuration is required on the client for initial use.

How to use

Install PDF-MCP
Install the PDF-MCP server via the Python package manager pip.
Configure the AI client
Add the PDF-MCP server configuration to the configuration file according to the AI client you are using (Claude, VS Code, etc.).
Restart the client
Restart the AI client to load the PDF-MCP server.
Start using
Operate on PDF documents through natural language instructions in the AI assistant.

Usage examples

Annual report analysis
Analyze the company's annual report and extract key financial data and risk factors.
Academic paper research
Quickly browse multiple academic papers and extract research methods and conclusions.
Contract review
Review the key terms and potential risks in a contract document.
Image data organization
Extract all product images and descriptions from a product manual.

Frequently Asked Questions

Which AI clients does PDF-MCP support?
What is the maximum size of PDF files that can be processed?
Where is the cached data stored?
How to handle scanned PDFs?
How to clear the cache?
Does it support Chinese PDFs?

Related resources

GitHub repository
Source code, issue tracking, and the latest version of PDF-MCP.
PyPI project page
Project page on the Python Package Index, including version history and download statistics.
MCP protocol documentation
Official documentation and specifications of the Model Context Protocol.
How to build PDF-MCP
Developer blog post introducing the design concept and implementation details of PDF-MCP.
MCP server security guide
In - depth article on the best security practices for MCP servers.

Installation

Copy the following command to your Client for configuration
{
  "mcpServers": {
    "pdf-mcp": {
      "command": "pdf-mcp"
    }
  }
}

{
  "mcpServers": {
    "pdf-mcp": {
      "command": "pdf-mcp",
      "args": [],
      "disabled": false
    }
  }
}

{
  "mcpServers": {
    "pdf-mcp": {
      "command": "uvx",
      "args": ["pdf-mcp"]
    }
  }
}
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

V
Vestige
Vestige is an AI memory engine based on cognitive science. By implementing 29 neuroscience modules such as prediction error gating, FSRS - 6 spaced repetition, and memory dreaming, it provides long - term memory capabilities for AI. It includes a 3D visualization dashboard and 21 MCP tools, runs completely locally, and does not require the cloud.
Rust
6.4K
4.5 points
M
Moltbrain
MoltBrain is a long-term memory layer plugin designed for OpenClaw, MoltBook, and Claude Code, capable of automatically learning and recalling project context, providing intelligent search, observation recording, analysis statistics, and persistent storage functions.
TypeScript
5.0K
4.5 points
B
Bm.md
A feature-rich Markdown typesetting tool that supports multiple style themes and platform adaptation, providing real-time editing preview, image export, and API integration capabilities
TypeScript
4.3K
5 points
S
Security Detections MCP
Security Detections MCP is a server based on the Model Context Protocol that allows LLMs to query a unified security detection rule database covering Sigma, Splunk ESCU, Elastic, and KQL formats. The latest version 3.0 is upgraded to an autonomous detection engineering platform that can automatically extract TTPs from threat intelligence, analyze coverage gaps, generate SIEM-native format detection rules, run tests, and verify. The project includes over 71 tools, 11 pre-built workflow prompts, and a knowledge graph system, supporting multiple SIEM platforms.
TypeScript
6.5K
4 points
P
Paperbanana
Python
6.8K
5 points
F
Finlab Ai
FinLab AI is a quantitative financial analysis platform that helps users discover excess returns (alpha) in investment strategies through AI technology. It provides a rich dataset, backtesting framework, and strategy examples, supporting automated installation and integration into mainstream AI programming assistants.
7.3K
4 points
B
Better Icons
An MCP server and CLI tool that provides search and retrieval of over 200,000 icons, supports more than 150 icon libraries, and helps AI assistants and developers quickly obtain and use icons.
TypeScript
6.6K
4.5 points
A
Assistant Ui
assistant - ui is an open - source TypeScript/React library for quickly building production - grade AI chat interfaces, providing composable UI components, streaming responses, accessibility, etc., and supporting multiple AI backends and models.
TypeScript
6.7K
5 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
26.0K
4.3 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
21.6K
4.5 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
34.9K
5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
73.6K
4.3 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
64.4K
4.5 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
32.8K
5 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
22.2K
4.5 points
C
Context7
Context7 MCP is a service that provides real-time, version-specific documentation and code examples for AI programming assistants. It is directly integrated into prompts through the Model Context Protocol to solve the problem of LLMs using outdated information.
TypeScript
97.6K
4.7 points
AIBase
Zhiqi Future, Your AI Solution Think Tank
© 2026AIBase