Zotero Chunk MCP
DeepZotero is a tool that provides semantic search functionality for the Zotero literature library. It extracts text, tables, and images from PDFs, chunks, embeds, and stores them in ChromaDB. Finally, it provides 13 types of semantic search, Boolean search, table/image search, context expansion, citation graph query, index management, and cost - tracking tools to clients such as Claude Code through the MCP server.
rating : 2.5 points
downloads : 6.2K
What is DeepZotero?
DeepZotero is an enhanced academic literature management tool based on semantic search. It can deeply analyze PDF documents in your Zotero literature library, extract text content, table data, and chart information, and perform intelligent indexing through artificial intelligence technology. You can search your literature library in natural language, just like having a conversation with an assistant, and quickly find relevant research content, data tables, or charts.How to use DeepZotero?
Using DeepZotero mainly consists of three steps: First, install and configure the necessary API keys. Then, run the indexing program to analyze your literature library. Finally, use various search tools through Claude Code or other MCP clients. The entire process is highly automated, and even non - technical users can easily get started.Applicable scenarios
DeepZotero is particularly suitable for researchers, students, and scholars. You may need it when: 1. Quickly find relevant research on a specific topic in a large number of literatures. 2. Find papers containing specific data tables. 3. Track the citation relationships of a certain concept. 4. Organize and review the research progress in a certain field. 5. Find relevant literatures and evidence for writing papers.Main features
Intelligent semantic search
Search literature content using natural language, understand the semantic meaning of the query, rather than just keyword matching. You can search text paragraphs, table content, and chart descriptions.
Intelligent table extraction
Use AI vision technology to accurately extract table data from PDFs, convert it into a structured format, and support searching by table content.
Chart recognition and search
Automatically recognize charts and their titles in PDFs, extract them as searchable image files, and support searching by chart descriptions.
Intelligent re - ranking
Search results are not only based on similarity but also consider the importance of literature chapters (e.g., the results section has a higher weight) and journal quality, providing a more relevant ranking.
Citation relationship analysis
Find the citation relationships of literature through the OpenAlex database, understand which papers cite a certain literature, and which other studies are cited by this literature.
Context expansion
After finding relevant paragraphs, you can view their context content. For table results, you can also find relevant discussions in the text that refer to the table.
13 search tools
Provide diverse search tools, including semantic search, topic search, table search, chart search, Boolean search, etc., to meet different search needs.
Advantages
Intelligent search: Understand semantics and find truly relevant content, not just keyword matching.
Comprehensive coverage: Search text, tables, and charts simultaneously, providing a complete literature analysis.
Easy to use: Interact through natural language without learning complex query syntax.
Incremental indexing: Only process new literatures, saving time and computing resources.
Highly configurable: You can adjust parameters such as search weights and chunk sizes according to your needs.
Limitations
Requires API keys: Using the Gemini and Anthropic APIs requires registration and obtaining keys.
Initial indexing is time - consuming: Analyzing a large number of literatures for the first time takes a long time.
Depends on Zotero: You need to have Zotero installed and have a certain number of literatures.
Visual extraction cost: Using AI to extract tables incurs a small fee.
Technical requirements: Installation and configuration require basic command - line operation capabilities.
How to use
Installation preparation
Ensure that Python 3.10+ and Zotero are installed, and prepare the necessary API keys.
Configuration settings
Create a configuration file and fill in the path to your Zotero data directory and API keys.
Index the literature library
Run the indexing program to analyze your literature library. This process will automatically extract and index all PDF content.
Configure the MCP client
Add the DeepZotero server configuration in the Claude Code settings. You can use it after restarting.
Start searching
Use natural language to directly search your literature library in Claude Code, such as 'Find recent research on the impact of climate change on agriculture'.
Usage examples
Literature review preparation
When you need to write a literature review for a certain research topic, you can use DeepZotero to quickly find all relevant research and sort them by importance.
Data search
When you need to cite specific data in your writing, you can search for tables containing that data.
Citation tracking
When you find a key literature and want to understand its influence and subsequent research, you can view the citation relationships.
Cross - literature concept search
When you want to understand the expression and application of a certain concept in different studies, you can conduct a cross - literature search.
Frequently Asked Questions
Does DeepZotero need to be used online?
How long does it take to index my literature library?
How much does it cost to use DeepZotero?
Does DeepZotero support Chinese literatures?
Can I index literatures while Zotero is running?
How to update the indexed literatures?
Related resources
GitHub repository
The source code and latest version of DeepZotero
Zotero official website
The official website of the literature management tool Zotero
Gemini API application
The page for applying for a Gemini API key
Anthropic API console
The console for applying for an Anthropic API key and managing usage
Model Context Protocol documentation
The official technical documentation of the MCP protocol

Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
20.2K
4.5 points

Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
24.2K
4.3 points

Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
35.2K
5 points

Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
72.3K
4.3 points

Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
31.0K
5 points

Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
64.2K
4.5 points

Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
21.0K
4.5 points

Context7
Context7 MCP is a service that provides real-time, version-specific documentation and code examples for AI programming assistants. It is directly integrated into prompts through the Model Context Protocol to solve the problem of LLMs using outdated information.
TypeScript
97.8K
4.7 points





