Maamcp
An MCP server based on MaaFramework, providing Android device and Windows desktop automation capabilities for AI assistants, supporting operations such as OCR recognition, clicking, swiping, and text input, and capable of converting operation processes into reusable Pipelines.
3.5 points
10.5K

What is MaaMCP?

MaaMCP is a bridge connecting AI assistants and automation frameworks. It allows AI assistants (such as Claude, Cursor, etc.) to control Android phones/emulators and Windows computers through standardized interfaces, performing operations such as clicking, swiping, inputting text, and recognizing screen content, to achieve real automated task execution.

How to use MaaMCP?

You only need to configure the MaaMCP server in an AI client that supports the MCP protocol (such as Cursor, Cherry Studio, etc.), and then you can tell the AI assistant what operations you want to perform in natural language, such as 'Help me open Meituan and order a takeaway' or 'Adjust the computer screen brightness to 50%'. The AI will automatically call the MaaMCP tool to complete these tasks.

Applicable scenarios

MaaMCP is particularly suitable for scenarios that require repetitive operations, such as: mobile app testing, data entry, batch file processing, cross - device workflow automation, providing operation assistance for the visually impaired, teaching demonstrations, etc.

Main features

Multi - device control
Connect and control multiple Android devices (via ADB) and Windows application windows simultaneously to achieve cross - device collaborative work.
Intelligent screen recognition (OCR)
Automatically recognize the text content on the screen, allowing AI to 'understand' the interface and make correct operation decisions. This is the most core and efficient interaction method.
Precise operation
Supports a variety of operation types such as clicking, double - clicking, swiping, text input, key presses (including combination keys), and mouse wheel scrolling to meet various interaction needs.
Background automation (Windows)
When taking screenshots and controlling on Windows, it runs in the background by default without occupying your mouse and keyboard. You can use the computer to do other things at the same time.
Process recording and reuse (Pipeline)
AI can intelligently convert a series of successfully executed operations into an automated script (Pipeline JSON) that can be run repeatedly, achieving 'one operation, infinite reuse'.
Advantages
๐Ÿค– Natural language - driven: Tell the AI what to do in a conversational way without learning complex programming or scripts.
๐Ÿ”— Standardized integration: Based on the Model Context Protocol (MCP), it can be easily integrated into various AI clients that support MCP.
๐Ÿ‘๏ธ Intelligent decision - making: AI combines the OCR recognition results to understand the interface and make operation judgments like a human.
๐Ÿ’พ Reusable processes: Through the Pipeline function, successful operation sequences can be saved and run with one click later.
โšก Background operation: Windows automation does not interfere with foreground work, improving efficiency.
Limitations
โš ๏ธ Some application restrictions: The anti - cheating mechanisms of certain games or applications may prevent background automation operations.
๐Ÿ”ง Permission requirements: If the target Windows application runs with administrator privileges, MaaMCP also needs to be started with administrator privileges.
๐ŸชŸ Window state: Operations cannot be performed on minimized Windows windows. The target window needs to be in a non - minimized state.
๐Ÿ“ฑ Device connection: Android devices need to be connected via ADB. Make sure the device has USB debugging enabled.

How to use

Install MaaMCP
Choose any of the following methods to install the MaaMCP server program. The uv method is recommended.
Configure the AI client
In the settings of the AI client you are using (such as Cursor, Cherry Studio, etc.), add MaaMCP as the MCP server.
Start using
Restart the AI client, and then describe the tasks you want the AI to perform in natural language in the chat box. The resource files required for OCR recognition will be automatically downloaded for the first use.

Usage examples

Example 1: Ordering takeaway on a mobile phone
Let the AI assistant operate your mobile phone to open the takeaway app and complete the entire process of selecting food and placing an order.
Example 2: Beautifying a PPT on a computer
Let the AI assistant view your current PPT page and add animation effects according to your requirements.
Example 3: Generating an automated script (Pipeline)
Let the AI save the process as an automated script that can be run repeatedly after performing a series of operations.

Frequently Asked Questions

What should I do if it prompts 'Failed to load det or rec' or the OCR resources do not exist?
What should I do if my mouse and keyboard are occupied when the AI operates Windows?
How can I view the running logs to troubleshoot problems?
Does it support iOS devices?
What is the use of the Pipeline function?

Related resources

MaaMCP GitHub repository
Project source code, latest version, and issue feedback.
MaaFramework official website
The underlying automation framework on which MaaMCP is based. Learn more technical details.
Model Context Protocol (MCP)
Learn about the MCP protocol standard, which is the future trend of AI tool integration.
Bilibili video demonstration
Watch the actual operation demonstration video of MaaMCP to understand its capabilities more intuitively.
PyPI project page
The project page on the Python Package Index. View the released versions.

Installation

Copy the following command to your Client for configuration
{
  "mcpServers": {
    "MaaMCP": {
      "command": "maa-mcp"
    }
  }
}
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

V
Vestige
Vestige is an AI memory engine based on cognitive science. By implementing 29 neuroscience modules such as prediction error gating, FSRS - 6 spaced repetition, and memory dreaming, it provides long - term memory capabilities for AI. It includes a 3D visualization dashboard and 21 MCP tools, runs completely locally, and does not require the cloud.
Rust
6.1K
4.5 points
M
Moltbrain
MoltBrain is a long-term memory layer plugin designed for OpenClaw, MoltBook, and Claude Code, capable of automatically learning and recalling project context, providing intelligent search, observation recording, analysis statistics, and persistent storage functions.
TypeScript
6.2K
4.5 points
B
Bm.md
A feature-rich Markdown typesetting tool that supports multiple style themes and platform adaptation, providing real-time editing preview, image export, and API integration capabilities
TypeScript
14.4K
5 points
S
Security Detections MCP
Security Detections MCP is a server based on the Model Context Protocol that allows LLMs to query a unified security detection rule database covering Sigma, Splunk ESCU, Elastic, and KQL formats. The latest version 3.0 is upgraded to an autonomous detection engineering platform that can automatically extract TTPs from threat intelligence, analyze coverage gaps, generate SIEM-native format detection rules, run tests, and verify. The project includes over 71 tools, 11 pre-built workflow prompts, and a knowledge graph system, supporting multiple SIEM platforms.
TypeScript
5.9K
4 points
P
Paperbanana
Python
8.5K
5 points
B
Better Icons
An MCP server and CLI tool that provides search and retrieval of over 200,000 icons, supports more than 150 icon libraries, and helps AI assistants and developers quickly obtain and use icons.
TypeScript
7.7K
4.5 points
A
Assistant Ui
assistant - ui is an open - source TypeScript/React library for quickly building production - grade AI chat interfaces, providing composable UI components, streaming responses, accessibility, etc., and supporting multiple AI backends and models.
TypeScript
7.2K
5 points
A
Apify MCP Server
The Apify MCP Server is a tool based on the Model Context Protocol (MCP) that allows AI assistants to extract data from websites such as social media, search engines, and e-commerce through thousands of ready-to-use crawlers, scrapers, and automation tools (Apify Actors). It supports OAuth and Skyfire proxy payment and can be integrated into MCP clients such as Claude and VS Code through HTTPS endpoints or local stdio.
TypeScript
7.2K
5 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
21.7K
4.5 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
25.9K
4.3 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
37.3K
5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
74.8K
4.3 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
36.5K
5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
67.3K
4.5 points
M
Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
51.2K
4.8 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
22.1K
4.5 points