Zotero Chunk MCP
Z

Zotero Chunk MCP

DeepZotero is a tool that provides semantic search functionality for the Zotero literature library. It extracts text, tables, and images from PDFs, chunks, embeds, and stores them in ChromaDB. Finally, it provides 13 types of semantic search, Boolean search, table/image search, context expansion, citation graph query, index management, and cost - tracking tools to clients such as Claude Code through the MCP server.
2.5 points
6.2K

What is DeepZotero?

DeepZotero is an enhanced academic literature management tool based on semantic search. It can deeply analyze PDF documents in your Zotero literature library, extract text content, table data, and chart information, and perform intelligent indexing through artificial intelligence technology. You can search your literature library in natural language, just like having a conversation with an assistant, and quickly find relevant research content, data tables, or charts.

How to use DeepZotero?

Using DeepZotero mainly consists of three steps: First, install and configure the necessary API keys. Then, run the indexing program to analyze your literature library. Finally, use various search tools through Claude Code or other MCP clients. The entire process is highly automated, and even non - technical users can easily get started.

Applicable scenarios

DeepZotero is particularly suitable for researchers, students, and scholars. You may need it when: 1. Quickly find relevant research on a specific topic in a large number of literatures. 2. Find papers containing specific data tables. 3. Track the citation relationships of a certain concept. 4. Organize and review the research progress in a certain field. 5. Find relevant literatures and evidence for writing papers.

Main features

Intelligent semantic search
Search literature content using natural language, understand the semantic meaning of the query, rather than just keyword matching. You can search text paragraphs, table content, and chart descriptions.
Intelligent table extraction
Use AI vision technology to accurately extract table data from PDFs, convert it into a structured format, and support searching by table content.
Chart recognition and search
Automatically recognize charts and their titles in PDFs, extract them as searchable image files, and support searching by chart descriptions.
Intelligent re - ranking
Search results are not only based on similarity but also consider the importance of literature chapters (e.g., the results section has a higher weight) and journal quality, providing a more relevant ranking.
Citation relationship analysis
Find the citation relationships of literature through the OpenAlex database, understand which papers cite a certain literature, and which other studies are cited by this literature.
Context expansion
After finding relevant paragraphs, you can view their context content. For table results, you can also find relevant discussions in the text that refer to the table.
13 search tools
Provide diverse search tools, including semantic search, topic search, table search, chart search, Boolean search, etc., to meet different search needs.
Advantages
Intelligent search: Understand semantics and find truly relevant content, not just keyword matching.
Comprehensive coverage: Search text, tables, and charts simultaneously, providing a complete literature analysis.
Easy to use: Interact through natural language without learning complex query syntax.
Incremental indexing: Only process new literatures, saving time and computing resources.
Highly configurable: You can adjust parameters such as search weights and chunk sizes according to your needs.
Limitations
Requires API keys: Using the Gemini and Anthropic APIs requires registration and obtaining keys.
Initial indexing is time - consuming: Analyzing a large number of literatures for the first time takes a long time.
Depends on Zotero: You need to have Zotero installed and have a certain number of literatures.
Visual extraction cost: Using AI to extract tables incurs a small fee.
Technical requirements: Installation and configuration require basic command - line operation capabilities.

How to use

Installation preparation
Ensure that Python 3.10+ and Zotero are installed, and prepare the necessary API keys.
Configuration settings
Create a configuration file and fill in the path to your Zotero data directory and API keys.
Index the literature library
Run the indexing program to analyze your literature library. This process will automatically extract and index all PDF content.
Configure the MCP client
Add the DeepZotero server configuration in the Claude Code settings. You can use it after restarting.
Start searching
Use natural language to directly search your literature library in Claude Code, such as 'Find recent research on the impact of climate change on agriculture'.

Usage examples

Literature review preparation
When you need to write a literature review for a certain research topic, you can use DeepZotero to quickly find all relevant research and sort them by importance.
Data search
When you need to cite specific data in your writing, you can search for tables containing that data.
Citation tracking
When you find a key literature and want to understand its influence and subsequent research, you can view the citation relationships.
Cross - literature concept search
When you want to understand the expression and application of a certain concept in different studies, you can conduct a cross - literature search.

Frequently Asked Questions

Does DeepZotero need to be used online?
How long does it take to index my literature library?
How much does it cost to use DeepZotero?
Does DeepZotero support Chinese literatures?
Can I index literatures while Zotero is running?
How to update the indexed literatures?

Related resources

GitHub repository
The source code and latest version of DeepZotero
Zotero official website
The official website of the literature management tool Zotero
Gemini API application
The page for applying for a Gemini API key
Anthropic API console
The console for applying for an Anthropic API key and managing usage
Model Context Protocol documentation
The official technical documentation of the MCP protocol

Installation

Copy the following command to your Client for configuration
{
    "mcpServers": {
        "deep-zotero": {
            "command": "/path/to/.venv/bin/python",
            "args": ["-m", "deep_zotero.server"]
        }
    }
}

{
    "mcpServers": {
        "deep-zotero": {
            "command": "C:\\path\\to\\.venv\\Scripts\\python.exe",
            "args": ["-m", "deep_zotero.server"]
        }
    }
}
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

A
Airweave
Airweave is an open - source context retrieval layer for AI agents and RAG systems. It connects and synchronizes data from various applications, tools, and databases, and provides relevant, real - time, multi - source contextual information to AI agents through a unified search interface.
Python
5.9K
5 points
F
Finlab Ai
FinLab AI is a quantitative financial analysis platform that helps users discover excess returns (alpha) in investment strategies through AI technology. It provides a rich dataset, backtesting framework, and strategy examples, supporting automated installation and integration into mainstream AI programming assistants.
5.4K
4 points
B
Better Icons
An MCP server and CLI tool that provides search and retrieval of over 200,000 icons, supports more than 150 icon libraries, and helps AI assistants and developers quickly obtain and use icons.
TypeScript
5.6K
4.5 points
A
Apify MCP Server
The Apify MCP Server is a tool based on the Model Context Protocol (MCP) that allows AI assistants to extract data from websites such as social media, search engines, and e-commerce through thousands of ready-to-use crawlers, scrapers, and automation tools (Apify Actors). It supports OAuth and Skyfire proxy payment and can be integrated into MCP clients such as Claude and VS Code through HTTPS endpoints or local stdio.
TypeScript
7.4K
5 points
P
Praisonai
PraisonAI is a production-ready multi-AI agent framework with self-reflection capabilities, designed to create AI agents to automate the solution of various problems from simple tasks to complex challenges. It simplifies the construction and management of multi-agent LLM systems by integrating PraisonAI agents, AG2, and CrewAI into a low-code solution, emphasizing simplicity, customization, and effective human-machine collaboration.
Python
10.4K
5 points
H
Haiku.rag
Haiku RAG is an intelligent retrieval - augmented generation system built on LanceDB, Pydantic AI, and Docling. It supports hybrid search, re - ranking, Q&A agents, multi - agent research processes, and provides local - first document processing and MCP server integration.
Python
9.1K
5 points
C
Claude Context
Claude Context is an MCP plugin that provides in - depth context of the entire codebase for AI programming assistants through semantic code search. It supports multiple embedding models and vector databases to achieve efficient code retrieval.
TypeScript
17.4K
5 points
M
Maverick MCP
MaverickMCP is a personal stock analysis server based on FastMCP 2.0, providing professional level financial data analysis, technical indicator calculation, and investment portfolio optimization tools for MCP clients such as Claude Desktop. It comes pre-set with 520 S&P 500 stock data, supports multiple technical analysis strategies and parallel processing, and can run locally without complex authentication.
Python
11.1K
4 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
20.2K
4.5 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
24.2K
4.3 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
35.2K
5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
72.3K
4.3 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
31.0K
5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
64.2K
4.5 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
21.0K
4.5 points
C
Context7
Context7 MCP is a service that provides real-time, version-specific documentation and code examples for AI programming assistants. It is directly integrated into prompts through the Model Context Protocol to solve the problem of LLMs using outdated information.
TypeScript
97.8K
4.7 points
AIBase
Zhiqi Future, Your AI Solution Think Tank
© 2026AIBase