Decompose
Decompose is a deterministic text classification tool for AI agents. It decomposes text into structured semantic units through pure regular expressions and heuristic methods, enabling fast and offline document pre - processing without LLM and significantly reducing the number of tokens processed by LLM.
2.5 points
7.0K

What is Decompose?

Decompose is a text pre - processing tool designed specifically for AI agents. It can automatically decompose complex documents (such as technical specifications, contracts, reports, etc.) into structured semantic units, and add classification labels, risk scores, and importance markers to each unit. This enables AI agents to more intelligently decide how to process different parts of the document, thereby saving computing resources and improving processing efficiency.

How to use Decompose?

Decompose offers three ways of use: integrating it into an AI agent as an MCP server, using it directly as a command - line tool, or embedding it into your application as a Python library. The most common way is through the MCP server, allowing your AI agents (such as Claude, Cursor, etc.) to directly call the text decomposition function.

Applicable scenarios

Decompose is particularly suitable for processing highly structured texts such as technical documents, legal contracts, engineering specifications, and regulatory files. It can help AI agents quickly identify important content such as key requirements, safety regulations, and compliance clauses in the document, while filtering out background information and duplicate content.

Main features

Deterministic text classification
Perform text classification based on rules and heuristic algorithms without relying on large - language models, ensuring the consistency and predictability of the results.
Multi - dimensional semantic annotation
Provide multi - dimensional labels such as authority, risk level, content type, and importance score for each text unit to help AI understand the semantic value of the text.
Automatic entity extraction
Automatically identify and extract entities such as standards, specifications, and regulations referenced in the text, such as ASTM, ASCE, ISO standard numbers.
Intelligent content filtering
Automatically filter low - value content based on importance scores and risk levels, which can reduce 60 - 80% of LLM processing overhead.
Multi - format support
Support three ways of use through the MCP server, command - line tool, and Python library to meet different integration requirements.
Advantages
Ultra - fast processing: Process a 50 - page document within 500 milliseconds
Completely offline: No network connection or API key required
Deterministic results: The same input always produces the same output
Zero cost: No LLM inference fees
Easy to integrate: Provide multiple ways of use
Limitations
Rule - based: Unable to handle complex semantics outside the rules
Domain - specific: Optimized mainly for technical documents and legal texts
Requires structured input: Limited effectiveness for unstructured texts
Cannot generate content: Only perform classification and extraction, no new text generation

How to use

Install Decompose
Install the decompose - mcp package via pip
Configure the MCP server
Add the Decompose MCP server configuration to the AI agent's configuration file
Use the decomposition tool
Call the decompose_text or decompose_url tool in the AI agent to process the document

Usage examples

Technical specification analysis
Analyze the technical specifications of a construction project and extract all mandatory requirements and safety regulations
Contract review
Review legal contracts and identify financial clauses and compliance requirements
Document summarization
Generate a concise summary for a long document, containing only key information

Frequently Asked Questions

Does Decompose require an internet connection or an API key?
Can Decompose process Chinese documents?
How to customize classification rules?
What is the difference between Decompose and ordinary text chunking?
What file formats are supported?

Related resources

GitHub repository
View the source code, submit issues, and participate in contributions
PyPI package page
View the latest version and installation instructions
Technical blog post
Understand why rule engines are superior to LLM in some scenarios
MCP protocol documentation
Understand the detailed specifications of the Model Context Protocol

Installation

Copy the following command to your Client for configuration
{
  "mcpServers": {
    "decompose": {
      "command": "uvx",
      "args": ["decompose-mcp", "--serve"]
    }
  }
}

{
  "mcpServers": {
    "decompose": {
      "command": "python3",
      "args": ["-m", "decompose", "--serve"]
    }
  }
}
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

V
Vestige
Vestige is an AI memory engine based on cognitive science. By implementing 29 neuroscience modules such as prediction error gating, FSRS - 6 spaced repetition, and memory dreaming, it provides long - term memory capabilities for AI. It includes a 3D visualization dashboard and 21 MCP tools, runs completely locally, and does not require the cloud.
Rust
8.7K
4.5 points
M
Moltbrain
MoltBrain is a long-term memory layer plugin designed for OpenClaw, MoltBook, and Claude Code, capable of automatically learning and recalling project context, providing intelligent search, observation recording, analysis statistics, and persistent storage functions.
TypeScript
10.4K
4.5 points
B
Bm.md
A feature-rich Markdown typesetting tool that supports multiple style themes and platform adaptation, providing real-time editing preview, image export, and API integration capabilities
TypeScript
15.2K
5 points
S
Security Detections MCP
Security Detections MCP is a server based on the Model Context Protocol that allows LLMs to query a unified security detection rule database covering Sigma, Splunk ESCU, Elastic, and KQL formats. The latest version 3.0 is upgraded to an autonomous detection engineering platform that can automatically extract TTPs from threat intelligence, analyze coverage gaps, generate SIEM-native format detection rules, run tests, and verify. The project includes over 71 tools, 11 pre-built workflow prompts, and a knowledge graph system, supporting multiple SIEM platforms.
TypeScript
9.0K
4 points
P
Paperbanana
Python
10.2K
5 points
F
Finlab Ai
FinLab AI is a quantitative financial analysis platform that helps users discover excess returns (alpha) in investment strategies through AI technology. It provides a rich dataset, backtesting framework, and strategy examples, supporting automated installation and integration into mainstream AI programming assistants.
10.2K
4 points
B
Better Icons
An MCP server and CLI tool that provides search and retrieval of over 200,000 icons, supports more than 150 icon libraries, and helps AI assistants and developers quickly obtain and use icons.
TypeScript
9.9K
4.5 points
A
Assistant Ui
assistant - ui is an open - source TypeScript/React library for quickly building production - grade AI chat interfaces, providing composable UI components, streaming responses, accessibility, etc., and supporting multiple AI backends and models.
TypeScript
10.2K
5 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
27.9K
4.3 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
39.6K
5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
81.2K
4.3 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
23.4K
4.5 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
38.8K
5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
70.8K
4.5 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
23.4K
4.5 points
M
Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
56.2K
4.8 points
AIBase
Zhiqi Future, Your AI Solution Think Tank
© 2026AIBase