Omniparser AutoGUI
This is an MCP server based on OmniParser that can analyze screen content and automatically operate the GUI interface, mainly running on the Windows system.
rating : 2.5 points
downloads : 37
What is OmniParser Automated GUI MCP?
This is an intelligent server that can 'see' and interact with your computer screen. It uses advanced AI (OmniParser) to understand the content displayed on the screen and can automatically perform GUI operations, such as clicking buttons or entering text.How does it work?
The server connects to applications that support MCP (such as ClaudeChat). It achieves automated tasks by analyzing the screen content and generating appropriate operation instructions based on the context.Why choose OmniParser Automated GUI MCP?
Compared with traditional scripts, OmniParser provides more powerful and flexible screen analysis capabilities. It can handle complex UI elements and provide intelligent operations through context understanding.Features
Multi - language SupportBy setting the OCR_LANG environment variable, it supports text recognition in multiple languages.
Window Target LocationUse the TARGET_WINDOW_NAME environment variable to specify the specific window to control.
Context UnderstandingGenerate intelligent operation instructions based on the screen content and context to improve the accuracy of automated tasks.
Frequently Asked Questions
Does it support Mac or Linux?
Can I use different languages for text recognition?
How do I specify the window to control?
Related Resources
OmniParser GitHub
The core AI technology for screen analysis.
Model Context Protocol Documentation
The official documentation of the MCP protocol.
LibreChat Example Integration
An example client code repository for use with this server.
Featured MCP Services

Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
140
4.5 points

Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
829
4.3 points

Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
1.7K
5 points

Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
86
4.3 points

Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
6.7K
4.5 points

Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
565
5 points

Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
282
4.5 points

Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
753
4.8 points