Mcpvil
MCPvil is a minimized Wayland compositor based on the smallvil branch of the Smithay project. It integrates an MCP server, allowing AI agents and other MCP clients to interact with the compositor through stdio, providing functions such as application launch, screen capture, and mouse and keyboard simulation.
2 points
4.1K

What is MCPvil?

MCPvil is an innovative desktop control tool that connects the Wayland desktop environment (a graphical interface system similar to Windows or macOS) with an AI assistant. Through the Model Context Protocol (MCP), the AI assistant can operate your computer desktop like a human: launch applications, take screenshots, move the mouse, click buttons, input text, etc.

How to use MCPvil?

After installing MCPvil, simply configure MCPvil as a tool in an AI assistant (such as Claude, Gemini, etc.), and the AI can control your desktop through simple dialogue instructions. For example, you can say 'Help me open the browser and visit GitHub', and the AI will automatically perform these operations.

Applicable scenarios

Suitable for scenarios that require automated desktop operations: automated testing, remote assistance, automation of repetitive tasks, assisting disabled people in operating computers, teaching demonstrations, etc.

Main features

Application control
The AI can launch and close any application on the desktop, just like you operate in the terminal or start menu.
Intelligent screenshot
The AI can take a screenshot of the desktop at any time, save it as an image file or directly obtain the image data for analysis or recording.
Mouse operation
The AI can precisely control the mouse movement, click, and scroll, simulating human mouse operations.
Keyboard input
The AI can simulate keyboard buttons to input text or shortcuts, realizing automated input operations.
Visualization window
You can open an independent GUI window to view and control the desktop status in real - time.
MCP protocol support
It uses the standard Model Context Protocol and is compatible with various AI assistant platforms such as Claude and Gemini.
Advantages
No programming knowledge required: You can control desktop operations through natural language.
Cross - platform compatibility: Supports all Linux systems that support Wayland.
AI assistant integration: Seamlessly integrates with mainstream AI assistants.
Open - source and free: Based on an open - source project, it can be freely used and modified.
Comprehensive functions: Covers the main functions of desktop operations.
Limitations
Linux only: Currently only supports Linux systems (Wayland desktop environment).
Requires technical configuration: Installation and configuration require some command - line operation experience.
Depends on system permissions: Requires installing system dependency libraries and permissions.
Real - time limitation: The operation response speed is affected by the AI assistant and the network.
Relatively basic functions: Fewer functions compared to professional automation tools.

How to use

Install system dependencies
Install necessary development libraries and dependency packages on Ubuntu/Debian systems.
Install MCPvil
Install MCPvil through the Cargo package manager.
Configure the AI assistant
Add MCPvil as a tool in Claude Code or Gemini CLI.
Start using
Start MCPvil, and then send instructions through the AI assistant to control the desktop.

Usage examples

Automated web browsing
Let the AI automatically open the browser, visit a specific website, and take a screenshot and save it.
Repetitive form filling
Automatically fill in repetitive form data.
Remote assistance demonstration
Demonstrate specific operation steps to others.
Automated testing
Automatically test the startup and basic functions of an application.

Frequently Asked Questions

Does MCPvil support Windows or macOS?
Do I need programming knowledge to use it?
Is MCPvil safe? Will it be maliciously exploited?
Which AI assistants are supported?
What should I do if an operation goes wrong?
Can it control all desktop applications?

Related resources

GitHub repository
The source code and latest version of MCPvil
Smithay project
The Wayland compositor library that MCPvil is based on
Model Context Protocol official website
The official documentation and specifications of the MCP protocol
Wayland protocol documentation
Documentation for the Wayland display server protocol
Rust programming language
The programming language used by MCPvil

Installation

Copy the following command to your Client for configuration
{
  "mcpServers": {
    "mcpvil": {
      "command": "mcpvil"
    }
  }
}
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

A
Assistant Ui
assistant - ui is an open - source TypeScript/React library for quickly building production - grade AI chat interfaces, providing composable UI components, streaming responses, accessibility, etc., and supporting multiple AI backends and models.
TypeScript
7.5K
5 points
N
Next Devtools MCP
The Next.js development tools MCP server provides Next.js development tools and utilities for AI programming assistants such as Claude and Cursor, including runtime diagnostics, development automation, and document access functions.
TypeScript
10.8K
5 points
M
MCP Windbg
An MCP server that integrates AI models with WinDbg/CDB for analyzing Windows crash dump files and remote debugging, supporting natural language interaction to execute debugging commands.
Python
11.6K
5 points
P
Praisonai
PraisonAI is a production-ready multi-AI agent framework with self-reflection capabilities, designed to create AI agents to automate the solution of various problems from simple tasks to complex challenges. It simplifies the construction and management of multi-agent LLM systems by integrating PraisonAI agents, AG2, and CrewAI into a low-code solution, emphasizing simplicity, customization, and effective human-machine collaboration.
Python
10.4K
5 points
B
Blueprint MCP
Blueprint MCP is a chart generation tool based on the Arcade ecosystem. It uses technologies such as Nano Banana Pro to automatically generate visual charts such as architecture diagrams and flowcharts by analyzing codebases and system architectures, helping developers understand complex systems.
Python
10.7K
4 points
K
Klavis
Klavis AI is an open-source project that provides a simple and easy-to-use MCP (Model Context Protocol) service on Slack, Discord, and Web platforms. It includes various functions such as report generation, YouTube tools, and document conversion, supporting non-technical users and developers to use AI workflows.
TypeScript
21.7K
5 points
D
Devtools Debugger MCP
The Node.js Debugger MCP server provides complete debugging capabilities based on the Chrome DevTools protocol, including breakpoint setting, stepping execution, variable inspection, and expression evaluation.
TypeScript
9.2K
4 points
M
Mcpjungle
MCPJungle is a self-hosted MCP gateway used to centrally manage and proxy multiple MCP servers, providing a unified tool access interface for AI agents.
Go
0
4.5 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
20.4K
4.5 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
35.4K
5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
72.2K
4.3 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
24.6K
4.3 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
32.2K
5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
65.5K
4.5 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
22.1K
4.5 points
M
Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
47.8K
4.8 points
AIBase
Zhiqi Future, Your AI Solution Think Tank
© 2026AIBase