Decompose
Decompose is a deterministic text classification tool for AI agents. It decomposes text into structured semantic units through pure regular expressions and heuristic methods, enabling fast and offline document pre - processing without LLM and significantly reducing the number of tokens processed by LLM.
rating : 2.5 points
downloads : 5.1K
What is Decompose?
Decompose is a text pre - processing tool designed specifically for AI agents. It can automatically decompose complex documents (such as technical specifications, contracts, reports, etc.) into structured semantic units, and add classification labels, risk scores, and importance markers to each unit. This enables AI agents to more intelligently decide how to process different parts of the document, thereby saving computing resources and improving processing efficiency.How to use Decompose?
Decompose offers three ways of use: integrating it into an AI agent as an MCP server, using it directly as a command - line tool, or embedding it into your application as a Python library. The most common way is through the MCP server, allowing your AI agents (such as Claude, Cursor, etc.) to directly call the text decomposition function.Applicable scenarios
Decompose is particularly suitable for processing highly structured texts such as technical documents, legal contracts, engineering specifications, and regulatory files. It can help AI agents quickly identify important content such as key requirements, safety regulations, and compliance clauses in the document, while filtering out background information and duplicate content.Main features
Deterministic text classification
Perform text classification based on rules and heuristic algorithms without relying on large - language models, ensuring the consistency and predictability of the results.
Multi - dimensional semantic annotation
Provide multi - dimensional labels such as authority, risk level, content type, and importance score for each text unit to help AI understand the semantic value of the text.
Automatic entity extraction
Automatically identify and extract entities such as standards, specifications, and regulations referenced in the text, such as ASTM, ASCE, ISO standard numbers.
Intelligent content filtering
Automatically filter low - value content based on importance scores and risk levels, which can reduce 60 - 80% of LLM processing overhead.
Multi - format support
Support three ways of use through the MCP server, command - line tool, and Python library to meet different integration requirements.
Advantages
Ultra - fast processing: Process a 50 - page document within 500 milliseconds
Completely offline: No network connection or API key required
Deterministic results: The same input always produces the same output
Zero cost: No LLM inference fees
Easy to integrate: Provide multiple ways of use
Limitations
Rule - based: Unable to handle complex semantics outside the rules
Domain - specific: Optimized mainly for technical documents and legal texts
Requires structured input: Limited effectiveness for unstructured texts
Cannot generate content: Only perform classification and extraction, no new text generation
How to use
Install Decompose
Install the decompose - mcp package via pip
Configure the MCP server
Add the Decompose MCP server configuration to the AI agent's configuration file
Use the decomposition tool
Call the decompose_text or decompose_url tool in the AI agent to process the document
Usage examples
Technical specification analysis
Analyze the technical specifications of a construction project and extract all mandatory requirements and safety regulations
Contract review
Review legal contracts and identify financial clauses and compliance requirements
Document summarization
Generate a concise summary for a long document, containing only key information
Frequently Asked Questions
Does Decompose require an internet connection or an API key?
Can Decompose process Chinese documents?
How to customize classification rules?
What is the difference between Decompose and ordinary text chunking?
What file formats are supported?
Related resources
GitHub repository
View the source code, submit issues, and participate in contributions
PyPI package page
View the latest version and installation instructions
Technical blog post
Understand why rule engines are superior to LLM in some scenarios
MCP protocol documentation
Understand the detailed specifications of the Model Context Protocol

Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
20.4K
4.5 points

Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
25.5K
4.3 points

Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
35.4K
5 points

Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
72.2K
4.3 points

Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
32.2K
5 points

Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
64.4K
4.5 points

Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
21.0K
4.5 points

Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
47.8K
4.8 points




