Maamcp
An MCP server based on MaaFramework, providing Android device and Windows desktop automation capabilities for AI assistants, supporting operations such as OCR recognition, clicking, swiping, and text input, and capable of converting operation processes into reusable Pipelines.
3.5 points
5.8K

What is MaaMCP?

MaaMCP is a bridge connecting AI assistants and automation frameworks. It allows AI assistants (such as Claude, Cursor, etc.) to control Android phones/emulators and Windows computers through standardized interfaces, performing operations such as clicking, swiping, inputting text, and recognizing screen content, to achieve real automated task execution.

How to use MaaMCP?

You only need to configure the MaaMCP server in an AI client that supports the MCP protocol (such as Cursor, Cherry Studio, etc.), and then you can tell the AI assistant what operations you want to perform in natural language, such as 'Help me open Meituan and order a takeaway' or 'Adjust the computer screen brightness to 50%'. The AI will automatically call the MaaMCP tool to complete these tasks.

Applicable scenarios

MaaMCP is particularly suitable for scenarios that require repetitive operations, such as: mobile app testing, data entry, batch file processing, cross - device workflow automation, providing operation assistance for the visually impaired, teaching demonstrations, etc.

Main features

Multi - device control
Connect and control multiple Android devices (via ADB) and Windows application windows simultaneously to achieve cross - device collaborative work.
Intelligent screen recognition (OCR)
Automatically recognize the text content on the screen, allowing AI to 'understand' the interface and make correct operation decisions. This is the most core and efficient interaction method.
Precise operation
Supports a variety of operation types such as clicking, double - clicking, swiping, text input, key presses (including combination keys), and mouse wheel scrolling to meet various interaction needs.
Background automation (Windows)
When taking screenshots and controlling on Windows, it runs in the background by default without occupying your mouse and keyboard. You can use the computer to do other things at the same time.
Process recording and reuse (Pipeline)
AI can intelligently convert a series of successfully executed operations into an automated script (Pipeline JSON) that can be run repeatedly, achieving 'one operation, infinite reuse'.
Advantages
๐Ÿค– Natural language - driven: Tell the AI what to do in a conversational way without learning complex programming or scripts.
๐Ÿ”— Standardized integration: Based on the Model Context Protocol (MCP), it can be easily integrated into various AI clients that support MCP.
๐Ÿ‘๏ธ Intelligent decision - making: AI combines the OCR recognition results to understand the interface and make operation judgments like a human.
๐Ÿ’พ Reusable processes: Through the Pipeline function, successful operation sequences can be saved and run with one click later.
โšก Background operation: Windows automation does not interfere with foreground work, improving efficiency.
Limitations
โš ๏ธ Some application restrictions: The anti - cheating mechanisms of certain games or applications may prevent background automation operations.
๐Ÿ”ง Permission requirements: If the target Windows application runs with administrator privileges, MaaMCP also needs to be started with administrator privileges.
๐ŸชŸ Window state: Operations cannot be performed on minimized Windows windows. The target window needs to be in a non - minimized state.
๐Ÿ“ฑ Device connection: Android devices need to be connected via ADB. Make sure the device has USB debugging enabled.

How to use

Install MaaMCP
Choose any of the following methods to install the MaaMCP server program. The uv method is recommended.
Configure the AI client
In the settings of the AI client you are using (such as Cursor, Cherry Studio, etc.), add MaaMCP as the MCP server.
Start using
Restart the AI client, and then describe the tasks you want the AI to perform in natural language in the chat box. The resource files required for OCR recognition will be automatically downloaded for the first use.

Usage examples

Example 1: Ordering takeaway on a mobile phone
Let the AI assistant operate your mobile phone to open the takeaway app and complete the entire process of selecting food and placing an order.
Example 2: Beautifying a PPT on a computer
Let the AI assistant view your current PPT page and add animation effects according to your requirements.
Example 3: Generating an automated script (Pipeline)
Let the AI save the process as an automated script that can be run repeatedly after performing a series of operations.

Frequently Asked Questions

What should I do if it prompts 'Failed to load det or rec' or the OCR resources do not exist?
What should I do if my mouse and keyboard are occupied when the AI operates Windows?
How can I view the running logs to troubleshoot problems?
Does it support iOS devices?
What is the use of the Pipeline function?

Related resources

MaaMCP GitHub repository
Project source code, latest version, and issue feedback.
MaaFramework official website
The underlying automation framework on which MaaMCP is based. Learn more technical details.
Model Context Protocol (MCP)
Learn about the MCP protocol standard, which is the future trend of AI tool integration.
Bilibili video demonstration
Watch the actual operation demonstration video of MaaMCP to understand its capabilities more intuitively.
PyPI project page
The project page on the Python Package Index. View the released versions.

Installation

Copy the following command to your Client for configuration
{
  "mcpServers": {
    "MaaMCP": {
      "command": "maa-mcp"
    }
  }
}
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

R
Runno
Runno is a collection of JavaScript toolkits for securely running code in multiple programming languages in environments such as browsers and Node.js. It achieves sandboxed execution through WebAssembly and WASI, supports languages such as Python, Ruby, JavaScript, SQLite, C/C++, and provides integration methods such as web components and MCP servers.
TypeScript
4.6K
5 points
N
Netdata
Netdata is an open-source real-time infrastructure monitoring platform that provides second-level metric collection, visualization, machine learning-driven anomaly detection, and automated alerts. It can achieve full-stack monitoring without complex configuration.
Go
6.2K
5 points
M
MCP Server
The Mapbox MCP Server is a model context protocol server implemented in Node.js, providing AI applications with access to Mapbox geospatial APIs, including functions such as geocoding, point - of - interest search, route planning, isochrone analysis, and static map generation.
TypeScript
6.2K
4 points
U
Uniprof
Uniprof is a tool that simplifies CPU performance analysis. It supports multiple programming languages and runtimes, does not require code modification or additional dependencies, and can perform one-click performance profiling and hotspot analysis through Docker containers or the host mode.
TypeScript
7.7K
4.5 points
G
Gk Cli
GitKraken CLI is a command - line tool that provides multi - repository workflow management, AI - generated commit messages and pull requests, and includes a local MCP server for integrating tools such as Git, GitHub, and Jira.
5.6K
4.5 points
M
MCP
A collection of official Microsoft MCP servers, providing AI assistant integration tools for various services such as Azure, GitHub, Microsoft 365, and Fabric. It supports local and remote deployment, helping developers connect AI models with various data sources and tools through a standardized protocol.
C#
6.3K
5 points
C
Claude Context
Claude Context is an MCP plugin that provides in - depth context of the entire codebase for AI programming assistants through semantic code search. It supports multiple embedding models and vector databases to achieve efficient code retrieval.
TypeScript
11.4K
5 points
A
Acemcp
Acemcp is an MCP server for codebase indexing and semantic search, supporting automatic incremental indexing, multi-encoding file processing, .gitignore integration, and a Web management interface, helping developers quickly search for and understand code context.
Python
12.2K
5 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
18.4K
4.5 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
19.9K
4.3 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
28.2K
5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
58.2K
4.3 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
24.7K
5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
53.3K
4.5 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
18.3K
4.5 points
C
Context7
Context7 MCP is a service that provides real-time, version-specific documentation and code examples for AI programming assistants. It is directly integrated into prompts through the Model Context Protocol to solve the problem of LLMs using outdated information.
TypeScript
80.5K
4.7 points