Pdf MCP

A Python - based MCP server that provides functions for reading, searching, and extracting content from PDF documents. It supports paginated reading, full - text search, and image extraction, and uses an SQLite cache for persistent storage.

Research and data Developer tools #PDF processing #Document parsing #Content extraction #Cache management .Python

rating : 2.5 points

downloads : 10.2K

update time : 2026-03-13

Open Site

What is PDF-MCP?

PDF-MCP is a Model Context Protocol (MCP) server specifically designed to handle PDF documents. It allows AI assistants (such as Claude, Copilot, etc.) to directly access and manipulate PDF files, including reading content, searching for keywords, extracting images, and obtaining document information. Through an intelligent caching mechanism, the cache of processed documents is retained even when the server is restarted, improving the efficiency of repeated access.

How to use PDF-MCP?

PDF-MCP runs as a background service and needs to be used in conjunction with an AI client that supports the MCP protocol. After installation, add the server configuration to the client configuration file and restart the client to use it. The AI assistant will automatically recognize the available PDF tools, and users can operate on PDF documents through natural language instructions.

Applicable scenarios

PDF-MCP is particularly suitable for scenarios such as long - document analysis, research report reading, contract review, academic paper summarization, and multi - document information extraction. When you need to quickly obtain specific information from a PDF without manually flipping through the pages, this tool can significantly improve efficiency.

Main features

Intelligent paginated reading

Supports reading PDF content by page range, avoiding context overflow caused by loading large documents at once. You can specify single pages, multiple pages, or continuous page ranges.

Full - text search

Search for keywords or phrases in PDF documents and quickly locate the pages where the relevant content is located, without the need to manually flip through the entire document.

Image extraction

Extract embedded images from PDFs and return them in base64 - encoded PNG format, facilitating AI assistants to analyze and describe the image content.

Document information retrieval

Retrieve the metadata of a PDF, including the number of pages, file size, creation date, author, title, and other information, as well as the estimated number of tokens.

Table of contents parsing

Automatically parse the table of contents structure of a PDF, display chapter titles and corresponding page numbers, and help quickly navigate to the desired section.

URL support

Not only supports local PDF files but also can directly load remote PDF documents from HTTP/HTTPS URLs without first downloading them to the local machine.

SQLite persistent cache

Use an SQLite database to cache the processed PDF content. The cached data is retained even after the server is restarted, significantly improving the speed of repeated access.

Multi - client support

Compatible with various AI clients that support the MCP protocol, such as Claude Desktop, VS Code Copilot, Codex CLI, and Kiro.

Advantages

More efficient in handling large documents: Paginated reading avoids context limitations, and intelligent search quickly locates information.

Performance optimization: SQLite cache reduces repeated parsing and improves response speed.

Easy to use: Can be operated through natural language instructions without learning complex commands.

Cross - session persistence: Cached data remains valid after the server is restarted.

Multifunctional integration: Eight dedicated tools cover common PDF processing needs.

Limitations

Requires client support: Must use an AI assistant that supports the MCP protocol.

Limited support for scanned PDFs: The text recognition ability for image - based PDFs depends on the quality of the original document.

Complex table processing: The extraction of complex - format tables may not be perfect.

Memory limitation: Extremely large files (hundreds of MB) may be limited by system memory.

Requires configuration: Simple configuration is required on the client for initial use.

How to use

Install PDF-MCP

Install the PDF-MCP server via the Python package manager pip.

Configure the AI client

Add the PDF-MCP server configuration to the configuration file according to the AI client you are using (Claude, VS Code, etc.).

Restart the client

Restart the AI client to load the PDF-MCP server.

Start using

Operate on PDF documents through natural language instructions in the AI assistant.

Usage examples

Annual report analysis

Analyze the company's annual report and extract key financial data and risk factors.

Academic paper research

Quickly browse multiple academic papers and extract research methods and conclusions.

Contract review

Review the key terms and potential risks in a contract document.

Image data organization

Extract all product images and descriptions from a product manual.

Frequently Asked Questions

Which AI clients does PDF-MCP support?

What is the maximum size of PDF files that can be processed?

Where is the cached data stored?

How to handle scanned PDFs?

How to clear the cache?

Does it support Chinese PDFs?

Related resources

GitHub repository

Source code, issue tracking, and the latest version of PDF-MCP.

PyPI project page

Project page on the Python Package Index, including version history and download statistics.

MCP protocol documentation

Official documentation and specifications of the Model Context Protocol.

How to build PDF-MCP

Developer blog post introducing the design concept and implementation details of PDF-MCP.

MCP server security guide

In - depth article on the best security practices for MCP servers.

🚀 pdf-mcp

A Model Context Protocol (MCP) server that enables AI agents to read, search, and extract content from PDF files. Built with Python and PyMuPDF, with SQLite-based caching for persistence across server restarts.

mcp-name: io.github.jztan/pdf-mcp

✨ Features

8 specialized tools for different PDF operations
SQLite caching — persistent cache survives server restarts (essential for STDIO transport)
Paginated reading — read large PDFs in manageable chunks
Full-text search — find content without loading the entire document
Image extraction — extract images as base64 PNG
URL support — read PDFs from HTTP/HTTPS URLs

📦 Installation

pip install pdf-mcp

🚀 Quick Start

Claude Code

claude mcp add pdf-mcp -- pdf-mcp

Or add to ~/.claude.json:

{
  "mcpServers": {
    "pdf-mcp": {
      "command": "pdf-mcp"
    }
  }
}

Claude Desktop

Add to your claude_desktop_config.json:

{
  "mcpServers": {
    "pdf-mcp": {
      "command": "pdf-mcp"
    }
  }
}

Config file location:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json

Restart Claude Desktop after updating the config.

Visual Studio Code

Requires VS Code 1.102+ with GitHub Copilot.

CLI:

code --add-mcp '{"name":"pdf-mcp","command":"pdf-mcp"}'

Command Palette:

Open Command Palette (Cmd/Ctrl+Shift+P)
Run MCP: Open User Configuration (global) or MCP: Open Workspace Folder Configuration (project-specific)

Add the configuration:

{
  "servers": {
    "pdf-mcp": {
      "command": "pdf-mcp"
    }
  }
}

Save. VS Code will automatically load the server.

Manual: Create .vscode/mcp.json in your workspace:

{
  "servers": {
    "pdf-mcp": {
      "command": "pdf-mcp"
    }
  }
}

Codex CLI

codex mcp add pdf-mcp -- pdf-mcp

Or configure manually in ~/.codex/config.toml:

[mcp_servers.pdf-mcp]
command = "pdf-mcp"

Kiro

Create or edit .kiro/settings/mcp.json in your workspace:

{
  "mcpServers": {
    "pdf-mcp": {
      "command": "pdf-mcp",
      "args": [],
      "disabled": false
    }
  }
}

Save and restart Kiro.

Other MCP Clients

Most MCP clients use a standard configuration format:

{
  "mcpServers": {
    "pdf-mcp": {
      "command": "pdf-mcp"
    }
  }
}

With uvx (for isolated environments):

{
  "mcpServers": {
    "pdf-mcp": {
      "command": "uvx",
      "args": ["pdf-mcp"]
    }
  }
}

Verify Installation

pdf-mcp --help

💻 Usage Examples

Tools

`pdf_info` — Get Document Information

Returns page count, metadata, table of contents, file size, and estimated token count. Call this first to understand a document before reading it.

"Read the PDF at /path/to/document.pdf"

`pdf_read_pages` — Read Specific Pages

Read selected pages to manage context size.

"Read pages 1-10 of the PDF"
"Read pages 15, 20, and 25-30"

`pdf_read_all` — Read Entire Document

Read a complete document in one call. Subject to a safety limit on page count.

"Read the entire PDF (it's only 10 pages)"

`pdf_search` — Search Within PDF

Find relevant pages before loading content.

"Search for 'quarterly revenue' in the PDF"

`pdf_get_toc` — Get Table of Contents

"Show me the table of contents"

`pdf_extract_images` — Extract Images

"Extract images from pages 1-5"

`pdf_cache_stats` — View Cache Statistics

"Show PDF cache statistics"

`pdf_cache_clear` — Clear Cache

"Clear expired PDF cache entries"

Example Workflow

For a large document (e.g., a 200-page annual report):

User: "Summarize the risk factors in this annual report"

Agent workflow:
1. pdf_info("report.pdf")
   → 200 pages, TOC shows "Risk Factors" on page 89

2. pdf_search("report.pdf", "risk factors")
   → Relevant pages: 89-110

3. pdf_read_pages("report.pdf", "89-100")
   → First batch

4. pdf_read_pages("report.pdf", "101-110")
   → Second batch

5. Synthesize answer from chunks

🔧 Technical Details

Caching

The server uses SQLite for persistent caching. This is necessary because MCP servers using STDIO transport are spawned as a new process for each conversation.

Cache location: ~/.cache/pdf-mcp/cache.db

Data	Benefit
Metadata	Avoid re-parsing document info
Page text	Skip re-extraction
Images	Skip re-encoding
TOC	Skip re-parsing

Cache invalidation:

Automatic when file modification time changes
Manual via the pdf_cache_clear tool
TTL: 24 hours (configurable)

Configuration

Environment variables:

# Cache directory (default: ~/.cache/pdf-mcp)
PDF_MCP_CACHE_DIR=/path/to/cache

# Cache TTL in hours (default: 24)
PDF_MCP_CACHE_TTL=48

Development

git clone https://github.com/jztan/pdf-mcp.git
cd pdf-mcp

# Install with dev dependencies
pip install -e ".[dev]"

# Run tests
pytest tests/ -v

# Type checking
mypy src/

# Linting
flake8 src/

# Formatting
black src/

Why pdf-mcp?

	Without pdf-mcp	With pdf-mcp
Large PDFs	Context overflow	Chunked reading
Repeated access	Re-parse every time	SQLite cache
Finding content	Load everything	Search first
Tool design	Single monolithic tool	8 specialized tools

🤝 Contributing

Contributions are welcome. Please submit a pull request.

📄 License

MIT — see LICENSE.

🔗 Links

PyPI
GitHub
MCP Documentation
How I Built pdf-mcp — The story behind this project
MCP Server Security: 8 Vulnerabilities — Security lessons from building MCP servers

pdf_info

Get PDF document information, including metadata, number of pages, and table of contents. It is recommended to call this command first to understand the document structure.

Parameters

path : str*

Description

PDF file path (absolute path, relative path, or URL)

pdf_read_pages

Read the text content of specific pages in a PDF. Use the page range to control the amount of loaded content.

Parameters

path : str*

Description

PDF file path (absolute path, relative path, or URL)

Parameters

pages : str*

Description

Page specification: '1 - 10' (pages 1 to 10), '1,5,10' (pages 1, 5, 10), '1 - 5,10,15 - 20' (combined range)

Parameters

include_images : bool*

Description

If True, extract images in base64 format (will increase the response size)

pdf_read_all

Read the entire PDF document. Only suitable for small documents. For large documents, use pdf_read_pages.

Parameters

path : str*

Description

PDF file path (absolute path, relative path, or URL)

Parameters

max_pages : int*

Description

Maximum number of pages to read (safety limit, default 50, maximum 500)

pdf_search

Search for text within a PDF document. Used to find relevant pages before reading the full content.

Parameters

path : str*

Description

PDF file path (absolute path, relative path, or URL)

Parameters

query : str*

Description

Text to search for (case - insensitive)

Parameters

max_results : int*

Description

Maximum number of matches to return (default 10, maximum 100)

Parameters

context_chars : int*

Description

Number of context characters around each match (default 200, maximum 2000)

pdf_get_toc

Get the table of contents (bookmarks/outline) from a PDF. Used to understand the document structure and navigate to specific sections.

Parameters

path : str*

Description

PDF file path (absolute path, relative path, or URL)

pdf_extract_images

Extract images from PDF pages as base64 - encoded PNG format.

Parameters

path : str*

Description

PDF file path (absolute path, relative path, or URL)

Parameters

pages : str | None*

Description

Page specification (default: all pages). The format is the same as pdf_read_pages.

Parameters

max_images : int*

Description

Maximum number of images to extract (default 20, maximum 50)

pdf_cache_stats

Get PDF cache statistics.

pdf_cache_clear

Clear the PDF cache.

Parameters

expired_only : bool*

Description

If True, only clear expired entries. If False, clear all content.

Markdownify MCP

Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.

A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.

The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.

TypeScript

30.9K

4.3 points

Duckduckgo MCP Server

Certified

The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.

UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.

41.0K

5 points

Figma Context MCP

Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.

Context7 MCP is a service that provides real-time, version-specific documentation and code examples for AI programming assistants. It is directly integrated into prompts through the Model Context Protocol to solve the problem of LLMs using outdated information.

The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.

Python

64.1K

4.8 points

Zhiqi Future, Your AI Solution Think Tank

English 简体中文繁體中文にほんご

Pdf MCP

Overview

Installation

Tools List

Content Details

Alternatives

What is PDF-MCP?

How to use PDF-MCP?

Applicable scenarios

Main features

How to use

Usage examples

Frequently Asked Questions

Related resources

Installation

🚀 pdf-mcp

✨ Features

📦 Installation

🚀 Quick Start

Verify Installation

💻 Usage Examples

Tools

`pdf_info` — Get Document Information

`pdf_read_pages` — Read Specific Pages

`pdf_read_all` — Read Entire Document

`pdf_search` — Search Within PDF

`pdf_get_toc` — Get Table of Contents

`pdf_extract_images` — Extract Images

`pdf_cache_stats` — View Cache Statistics

`pdf_cache_clear` — Clear Cache

Example Workflow

🔧 Technical Details

Caching

Configuration

Development

Why pdf-mcp?

🤝 Contributing

📄 License

🔗 Links

Alternatives

Pdf MCP

Overview

Installation

Tools List

Content Details

Alternatives

What is PDF-MCP?

How to use PDF-MCP?

Applicable scenarios

Main features

How to use

Usage examples

Frequently Asked Questions

Related resources

Installation

🚀 pdf-mcp

✨ Features

📦 Installation

🚀 Quick Start

Verify Installation

💻 Usage Examples

Tools

pdf_info — Get Document Information

pdf_read_pages — Read Specific Pages

pdf_read_all — Read Entire Document

pdf_search — Search Within PDF

pdf_get_toc — Get Table of Contents

pdf_extract_images — Extract Images

pdf_cache_stats — View Cache Statistics

pdf_cache_clear — Clear Cache

Example Workflow

🔧 Technical Details

Caching

Configuration

Development

Why pdf-mcp?

🤝 Contributing

📄 License

🔗 Links

Alternatives

`pdf_info` — Get Document Information

`pdf_read_pages` — Read Specific Pages

`pdf_read_all` — Read Entire Document

`pdf_search` — Search Within PDF

`pdf_get_toc` — Get Table of Contents

`pdf_extract_images` — Extract Images

`pdf_cache_stats` — View Cache Statistics

`pdf_cache_clear` — Clear Cache