Maamcp
An MCP server based on MaaFramework, providing Android device and Windows desktop automation capabilities for AI assistants, supporting operations such as OCR recognition, clicking, swiping, and text input, and capable of converting operation processes into reusable Pipelines.
3.5 points
9.4K

What is MaaMCP?

MaaMCP is a bridge connecting AI assistants and automation frameworks. It allows AI assistants (such as Claude, Cursor, etc.) to control Android phones/emulators and Windows computers through standardized interfaces, performing operations such as clicking, swiping, inputting text, and recognizing screen content, to achieve real automated task execution.

How to use MaaMCP?

You only need to configure the MaaMCP server in an AI client that supports the MCP protocol (such as Cursor, Cherry Studio, etc.), and then you can tell the AI assistant what operations you want to perform in natural language, such as 'Help me open Meituan and order a takeaway' or 'Adjust the computer screen brightness to 50%'. The AI will automatically call the MaaMCP tool to complete these tasks.

Applicable scenarios

MaaMCP is particularly suitable for scenarios that require repetitive operations, such as: mobile app testing, data entry, batch file processing, cross - device workflow automation, providing operation assistance for the visually impaired, teaching demonstrations, etc.

Main features

Multi - device control
Connect and control multiple Android devices (via ADB) and Windows application windows simultaneously to achieve cross - device collaborative work.
Intelligent screen recognition (OCR)
Automatically recognize the text content on the screen, allowing AI to 'understand' the interface and make correct operation decisions. This is the most core and efficient interaction method.
Precise operation
Supports a variety of operation types such as clicking, double - clicking, swiping, text input, key presses (including combination keys), and mouse wheel scrolling to meet various interaction needs.
Background automation (Windows)
When taking screenshots and controlling on Windows, it runs in the background by default without occupying your mouse and keyboard. You can use the computer to do other things at the same time.
Process recording and reuse (Pipeline)
AI can intelligently convert a series of successfully executed operations into an automated script (Pipeline JSON) that can be run repeatedly, achieving 'one operation, infinite reuse'.
Advantages
๐Ÿค– Natural language - driven: Tell the AI what to do in a conversational way without learning complex programming or scripts.
๐Ÿ”— Standardized integration: Based on the Model Context Protocol (MCP), it can be easily integrated into various AI clients that support MCP.
๐Ÿ‘๏ธ Intelligent decision - making: AI combines the OCR recognition results to understand the interface and make operation judgments like a human.
๐Ÿ’พ Reusable processes: Through the Pipeline function, successful operation sequences can be saved and run with one click later.
โšก Background operation: Windows automation does not interfere with foreground work, improving efficiency.
Limitations
โš ๏ธ Some application restrictions: The anti - cheating mechanisms of certain games or applications may prevent background automation operations.
๐Ÿ”ง Permission requirements: If the target Windows application runs with administrator privileges, MaaMCP also needs to be started with administrator privileges.
๐ŸชŸ Window state: Operations cannot be performed on minimized Windows windows. The target window needs to be in a non - minimized state.
๐Ÿ“ฑ Device connection: Android devices need to be connected via ADB. Make sure the device has USB debugging enabled.

How to use

Install MaaMCP
Choose any of the following methods to install the MaaMCP server program. The uv method is recommended.
Configure the AI client
In the settings of the AI client you are using (such as Cursor, Cherry Studio, etc.), add MaaMCP as the MCP server.
Start using
Restart the AI client, and then describe the tasks you want the AI to perform in natural language in the chat box. The resource files required for OCR recognition will be automatically downloaded for the first use.

Usage examples

Example 1: Ordering takeaway on a mobile phone
Let the AI assistant operate your mobile phone to open the takeaway app and complete the entire process of selecting food and placing an order.
Example 2: Beautifying a PPT on a computer
Let the AI assistant view your current PPT page and add animation effects according to your requirements.
Example 3: Generating an automated script (Pipeline)
Let the AI save the process as an automated script that can be run repeatedly after performing a series of operations.

Frequently Asked Questions

What should I do if it prompts 'Failed to load det or rec' or the OCR resources do not exist?
What should I do if my mouse and keyboard are occupied when the AI operates Windows?
How can I view the running logs to troubleshoot problems?
Does it support iOS devices?
What is the use of the Pipeline function?

Related resources

MaaMCP GitHub repository
Project source code, latest version, and issue feedback.
MaaFramework official website
The underlying automation framework on which MaaMCP is based. Learn more technical details.
Model Context Protocol (MCP)
Learn about the MCP protocol standard, which is the future trend of AI tool integration.
Bilibili video demonstration
Watch the actual operation demonstration video of MaaMCP to understand its capabilities more intuitively.
PyPI project page
The project page on the Python Package Index. View the released versions.

Installation

Copy the following command to your Client for configuration
{
  "mcpServers": {
    "MaaMCP": {
      "command": "maa-mcp"
    }
  }
}
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

R
Rsdoctor
Rsdoctor is a build analysis tool specifically designed for the Rspack ecosystem, fully compatible with webpack. It provides visual build analysis, multi - dimensional performance diagnosis, and intelligent optimization suggestions to help developers improve build efficiency and engineering quality.
TypeScript
8.6K
5 points
N
Next Devtools MCP
The Next.js development tools MCP server provides Next.js development tools and utilities for AI programming assistants such as Claude and Cursor, including runtime diagnostics, development automation, and document access functions.
TypeScript
10.4K
5 points
T
Testkube
Testkube is a test orchestration and execution framework for cloud-native applications, providing a unified platform to define, run, and analyze tests. It supports existing testing tools and Kubernetes infrastructure.
Go
5.7K
5 points
M
MCP Windbg
An MCP server that integrates AI models with WinDbg/CDB for analyzing Windows crash dump files and remote debugging, supporting natural language interaction to execute debugging commands.
Python
9.6K
5 points
R
Runno
Runno is a collection of JavaScript toolkits for securely running code in multiple programming languages in environments such as browsers and Node.js. It achieves sandboxed execution through WebAssembly and WASI, supports languages such as Python, Ruby, JavaScript, SQLite, C/C++, and provides integration methods such as web components and MCP servers.
TypeScript
9.7K
5 points
N
Netdata
Netdata is an open-source real-time infrastructure monitoring platform that provides second-level metric collection, visualization, machine learning-driven anomaly detection, and automated alerts. It can achieve full-stack monitoring without complex configuration.
Go
10.3K
5 points
M
MCP Server
The Mapbox MCP Server is a model context protocol server implemented in Node.js, providing AI applications with access to Mapbox geospatial APIs, including functions such as geocoding, point - of - interest search, route planning, isochrone analysis, and static map generation.
TypeScript
8.2K
4 points
U
Uniprof
Uniprof is a tool that simplifies CPU performance analysis. It supports multiple programming languages and runtimes, does not require code modification or additional dependencies, and can perform one-click performance profiling and hotspot analysis through Docker containers or the host mode.
TypeScript
7.4K
4.5 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
22.4K
4.3 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
32.8K
5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
65.7K
4.3 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
19.4K
4.5 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
30.2K
5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
60.1K
4.5 points
M
Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
45.4K
4.8 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
20.1K
4.5 points