Deskmate

Deskmate is a local execution agent that allows users to control personal computers through natural language. It supports multiple AI agent backends and messaging platforms, provides full access to local tools, and does not require sandbox restrictions.

Operating system automation Developer tools #Local proxy #Natural language control #Multi-platform support #Secure execution .TypeScript

rating : 2.5 points

downloads : 7.0K

update time : 2026-03-13

Open Site

What is Deskmate MCP Server?

Deskmate MCP Server is a server based on the Model Context Protocol (MCP). It exposes the functions of your local computer as tools that AI assistants can use. Through this server, you can directly control your computer in AI applications such as Claude Desktop, performing various system operations just like having a conversation with an intelligent assistant.

How to use Deskmate MCP Server?

You can use the MCP server in two ways: 1) Run it as an independent server, specifically providing services for MCP clients such as Claude Desktop; 2) Run it as part of the Deskmate gateway, supporting messaging platforms such as Telegram and MCP clients simultaneously. After configuration, you can directly ask questions about your computer or perform operations in the AI assistant.

Use cases

Deskmate MCP Server is particularly suitable for the following scenarios: Developers need to quickly execute system commands in the IDE, remote workers need to access local files, system administrators need to monitor the status of multiple machines, and any users who want to interact with the computer through natural language.

Main features

Full local system access

Provides full access to your computer, including the file system, process management, system command execution, etc., without artificial restrictions or sandbox constraints.

Seamless integration with Claude Desktop

Can be integrated with Claude Desktop through simple JSON configuration, allowing Claude to directly control your local machine.

Multi-tool support

Provides a rich set of tools, including file reading and writing, command execution, system monitoring, skill running, scheduled task management, etc.

Security approval mechanism

Sensitive operations require manual approval. Approval requests are sent through clients such as Telegram to ensure that operations are safe and controllable.

Skill system integration

Supports running predefined skills (multi-step workflows), and complex operation sequences can be directly called through MCP tools.

Health monitoring tool

Provides a system health check tool that can monitor CPU, memory, disk usage, and proxy availability status.

Advantages

Seamless integration: Deeply integrated with AI assistants such as Claude Desktop, providing a natural conversation experience.

Comprehensive functionality: Provides full access to the local system without functional limitations.

Safe and controllable: Sensitive operations require manual approval, balancing convenience and security.

Flexible deployment: Supports independent operation or combined operation with the gateway to adapt to different usage scenarios.

Cross-platform support: Supports macOS, Linux, and WSL2, covering mainstream operating systems.

Limitations

Technical requirements: Requires certain technical knowledge for initial configuration and setup.

Permission requirements: Multiple system permissions need to be manually granted on macOS.

Network dependency: Requires a stable network connection to communicate with the AI assistant.

Security responsibility: Users need to be responsible for the granted permissions to ensure that no dangerous operations are performed.

How to use

Install Deskmate

First, install the Deskmate core software package. You can install it globally via npm or build it from the source code.

Run the initialization wizard

Run the initialization command to configure the necessary API keys, Telegram credentials, and system permissions.

Configure Claude Desktop

Add the Deskmate MCP server configuration to the Claude Desktop configuration file.

Add MCP server configuration

Add the commands and parameters of the Deskmate MCP server to the configuration file.

Restart and start using

Restart Claude Desktop. Now you can directly control your local computer in Claude.

Usage examples

File management

Manage local files through natural language without leaving the AI assistant interface.

Development workflow

Quickly execute commands and check status during the development process.

System monitoring

Keep track of the computer's running status and resource usage at any time.

Automation tasks

Execute complex multi-step tasks through predefined skills.

Frequently Asked Questions

What is the difference between the MCP server mode and the gateway mode?

Do I need to configure permissions separately for the MCP server?

Is the MCP server safe? Will it allow the AI assistant to perform dangerous operations?

Can I use the same MCP server on multiple computers?

Which AI assistants does the MCP server support?

What should I do if the MCP server stops responding?

Related resources

Deskmate GitHub repository

The complete source code, issue tracking, and contribution guidelines for Deskmate.

Model Context Protocol documentation

The official specification and documentation for the MCP protocol.

Claude Desktop download

The download page for the Claude Desktop client.

Deskmate Discord community

Join the Deskmate user community to get help and share experiences.

Installation demonstration video

An animated demonstration of the Deskmate installation process.

Usage example screenshot

A screenshot example of the actual Deskmate usage interface.

🚀 Deskmate

Control your Local Machine from anywhere using natural language.

Deskmate is a local execution agent that enables you to control your personal machine using natural language and communicate with you on the channels you already use. It focuses on execution rather than autonomy or orchestration. You can send a Telegram message from your phone, and it will execute on your machine. It supports multiple agent backends, including Claude Code, Codex (OpenAI), Gemini CLI, and OpenCode, with full local tool access, no sandboxed command set, and no artificial limits.

This is a passion project developed with a simple goal: to maintain a creative and developer flow even when not sitting at the desk. It is inspired by gen-shell.

Getting Started · Gateway Mode · Agent Providers · Architecture · Discord

🚀 Quick Start

Option A: Install from npm (recommended)

npm install -g @sarkar-ai/deskmate
deskmate init

The wizard will guide you through everything, including API keys, Telegram credentials, platform permissions, and background service setup. The configuration is stored in ~/.config/deskmate/.env.

After setup, you can run it manually with deskmate or let the background service handle it.

Option B: Install from source (for contributors)

git clone https://github.com/sarkar-ai-taken/deskmate.git
cd deskmate
npm install --legacy-peer-deps
npm run build
./install.sh          # interactive: configures .env, service, permissions

Or use the TypeScript wizard instead of the shell installer:

cp .env.example .env  # edit with your credentials
npx deskmate init     # or: npm link && deskmate init

To reconfigure later, use deskmate init.

✨ Features

Full local access: The agent can run any command, read/write any file, and take screenshots. There is no artificial 6 - tool sandbox.
Multi - channel gateway: Currently supports Telegram, and will support Discord, Slack, WhatsApp in the future. One Gateway, multiple clients.
Conversation memory: Ensures session continuity across messages, allowing you to ask follow - up questions naturally.
Multi - agent backends: Comes with Claude Code (default), Codex (OpenAI), Gemini CLI (Google), and OpenCode. You can switch the agent provider by setting AGENT_PROVIDER=codex in .env.
Approve - by - default: Safe commands are auto - approved. Protected folders (such as Desktop, Documents, etc.) require confirmation via inline buttons.
MCP server: Exposes your machine as a tool server for Claude Desktop or any MCP client.
Skills system: Define reusable multi - step workflows in skills.json. Skills can run commands, agent prompts, or other skills, and are hot - reloaded on change.
Cron scheduler: Schedule recurring jobs (commands, agent queries, or skills) via crons.json. Results are delivered to your active chat channels.
Health monitoring: Built - in health checks for CPU, memory, disk, and agent availability. Accessible via /health in Telegram, deskmate health on CLI, or the get_health MCP tool.
Docker container mode: Run the core in Docker with a native sidecar for host commands. Set INSTALL_MODE=container and use docker - compose up.
Runs as service: Integrates with launchd (macOS) or systemd (Linux), starts on boot, and restarts on crash.
Extensible agent layer: You can bring your own agent via registerProvider().

📦 Installation

Requirements

macOS (tested on Ventura, Sonoma, Sequoia) or Linux (with systemd)
Windows via WSL2
Node.js 18+
One of the supported agent CLIs installed (see Agent Providers)
Telegram account (for Telegram mode)
API key for your chosen provider (Anthropic, OpenAI, or Google — OpenCode manages its own auth)

Linux Prerequisites

Screenshots: Install ImageMagick (sudo apt install imagemagick) for screenshot support
Service: systemd with user session support (systemctl --user)

macOS Permissions

The installer will guide you through these (macOS only). You can also configure them manually in System Settings > Privacy & Security.

Permission	Purpose
Screen Recording	Take screenshots when requested
Accessibility	Control system functions
Full Disk Access	Read/write files in protected locations
Automation	Control other applications via AppleScript
Background Items	Run as a background service at login
Folder Access	Access to Desktop, Documents, Downloads, etc.

💻 Usage Examples

Basic Usage

System management: "Show disk usage", "What processes are using the most CPU?", "List all running Docker containers" File operations: "Show me the contents of package.json", "Find all TypeScript files in src/", "Create a new file called notes.txt with today's date" Development: "Run the tests", "What's the git status?", "Show me recent commits" Troubleshooting: "What's using port 8080?", "Show me the last 50 lines of the error log", "Check if nginx is running" Visual: "Take a screenshot", "Show me what's on the screen"

Taking a Screenshot	Opening Google Meet

Advanced Usage

You can use the skills system and cron jobs for more complex operations. For example, define a skill in skills.json to build and deploy a project:

{
  "version": 1,
  "skills": [
    {
      "name": "deploy",
      "description": "Build and deploy the project",
      "parameters": [{ "name": "env", "required": true }],
      "steps": [
        { "type": "command", "command": "npm run build" },
        { "type": "command", "command": "./deploy.sh {{env}}" }
      ]
    }
  ]
}

Run it via /skill deploy env=staging in Telegram or the run_skill MCP tool.

📚 Documentation

Running Modes

Mode	Command	Description
Gateway	`deskmate`	Multi - client gateway (default)
MCP	`deskmate mcp`	MCP server for Claude Desktop
Both	`deskmate both`	Gateway + MCP simultaneously
Sidecar	`deskmate sidecar`	Host sidecar for container mode

Note: deskmate telegram still works but is a deprecated alias that starts the gateway.

Gateway Mode

The gateway is the default way to run Deskmate. It separates platform I/O from agent logic, so adding a new messaging client doesn't require touching auth, sessions, or the agent layer.

# Configure multi - client auth
ALLOWED_USERS=telegram:123456,discord:987654321

# Start
deskmate

The gateway auto - registers clients based on available env vars. If TELEGRAM_BOT_TOKEN is set, Telegram is active. Future clients (Discord, Slack) follow the same pattern.

Bot Commands

Command	Description
`/start`	Show welcome message
`/screenshot`	Take a screenshot and send it
`/status`	Show system info and session status
`/health`	Show system health and resource metrics
`/skill`	List or run a registered skill
`/cron`	Show cron job status
`/reset`	Clear conversation memory

MCP Server

The MCP server exposes your machine as a tool server for Claude Desktop or any MCP client.

Setup with Claude Desktop

Add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "deskmate": {
      "command": "node",
      "args": ["/path/to/deskmate/dist/index.js", "mcp"],
      "env": {
        "WORKING_DIR": "/Users/yourname"
      }
    }
  }
}

Restart Claude Desktop. You can now ask Claude to interact with your local machine.

Combined Mode (Gateway + MCP)

Run both with deskmate both. MCP handles Claude Desktop requests; the gateway handles Telegram (and future clients), sending approval notifications to your phone so you can approve sensitive operations from anywhere.

Observability

Deskmate focuses on executing actions safely. For monitoring agent behavior, resource usage, and failures across multiple local agents, see Riva (local - first agent observability).

Skills

Skills are reusable multi - step workflows defined in JSON. Place a skills.json in your project root or ~/.config/deskmate/skills.json for global skills. Skills are hot - reloaded when the file changes.

{
  "version": 1,
  "skills": [
    {
      "name": "deploy",
      "description": "Build and deploy the project",
      "parameters": [{ "name": "env", "required": true }],
      "steps": [
        { "type": "command", "command": "npm run build" },
        { "type": "command", "command": "./deploy.sh {{env}}" }
      ]
    }
  ]
}

Each step can be a command (shell), prompt (agent query), or skill (nested skill). Run via /skill deploy env=staging in Telegram or the run_skill MCP tool.

Cron Jobs

Schedule recurring jobs via crons.json (project - local or ~/.config/deskmate/crons.json).

{
  "version": 1,
  "jobs": [
    {
      "name": "daily - backup",
      "schedule": "0 2 * * *",
      "action": { "type": "command", "command": "tar czf ~/backup.tar.gz ~/Documents" },
      "notify": true
    }
  ]
}

Actions can be command, agent_query (natural language prompt), or skill. Set notify: true to receive results in your active chat channels. Check status with /cron in Telegram or list_cron_jobs MCP tool.

Security

⚠️ Important Note

The agent can execute arbitrary commands on your machine. This is by design — the strategy is approve - by - default for read - only operations, with approval gating for protected folders and write operations.

Built - in protections

Layer	What it does
User authentication	Allowlist - based access control via `SecurityManager`. Only users in `ALLOWED_USERS` can interact. Supports per - client auth (`telegram:123`, `discord:456`) and wildcards (`:`).
Action approval	`ApprovalManager` gates sensitive operations. Write commands, file writes, and folder access require explicit human approval with configurable timeouts (default 5 min).
Protected folders	OS - aware folder protection. Desktop, Documents, Downloads, Pictures, Movies/Videos, Music, and iCloud (macOS) require approval. Session - based caching avoids repeated prompts.
Safe command auto - approval	Read - only commands (`ls`, `cat`, `git status`, `docker ps`, `node - v`, etc.) auto - approve. Full list in `src/core/approval.ts`.
Command execution limits	2 - minute timeout and 10 MB output buffer per command. Prevents runaway processes and memory exhaustion.
Session isolation	Sessions keyed by `clientType:channelId`. 30 - minute idle timeout with automatic pruning. Optional disk persistence survives restarts.
Input validation	MCP tools use Zod schema validation. Telegram callbacks validated via regex patterns.
No open ports	The bot polls Telegram's servers — no inbound ports exposed.
No sudo by default	The agent won't use sudo unless you explicitly ask.
Structured logging	All actions logged with timestamps, context hierarchy, and configurable log levels for audit trails.
Stale message protection	Telegram client drops pending updates on startup (`drop_pending_updates: true`), preventing replay of messages received while offline.

Approval workflow

User sends a message that triggers a sensitive operation (e.g., writing to ~/Documents)
ApprovalManager checks if the action matches a safe auto - approve pattern
If not safe, a pending approval is created with a timeout countdown
Approval request is broadcast to all clients with recent activity (last 30 min)
User taps Approve/Reject via inline buttons (Telegram) or equivalent
Action executes on approval, or is cancelled on rejection/timeout

Set REQUIRE_APPROVAL_FOR_ALL=true to gate every operation, including reads.

Recommendations

Set WORKING_DIR to limit default command scope
Use ALLOWED_USERS for multi - client allowlisting
Use ALLOWED_FOLDERS to pre - approve specific directories
Review logs regularly (logs/stdout.log)
Keep .env secure and never commit it
Use REQUIRE_APPROVAL_FOR_ALL=true if you want to approve every operation

Execution Philosophy

Deskmate follows an approve - by - default, visible - by - design model.

Read - only operations are auto - approved
Sensitive operations require explicit confirmation
All actions are logged locally

The goal is speed without hidden behavior.

Non - goals

Deskmate is intentionally not:

A multi - agent orchestration framework
A cloud - hosted control plane
A long - running autonomous system

These constraints are deliberate.

Agent Providers

Deskmate supports multiple agent backends. Set AGENT_PROVIDER in your .env or select one during deskmate init.

Provider	Binary	Env Var	Install
Claude Code (default)	`claude`	`ANTHROPIC_API_KEY`	docs.anthropic.com
Codex (OpenAI)	`codex`	`OPENAI_API_KEY`	github.com/openai/codex
Gemini CLI (Google)	`gemini`	`GEMINI_API_KEY`	[github.com/google - gemini/gemini - cli](https://github.com/google - gemini/gemini - cli)
OpenCode	`opencode`	(manages own auth)	[github.com/opencode - ai/opencode](https://github.com/opencode - ai/opencode)

# Switch provider
AGENT_PROVIDER=codex
OPENAI_API_KEY=sk - ...

# Or use the wizard
deskmate init

Only the API key matching your selected provider is required. Keys for other providers are preserved in .env if you switch back.

🔧 Technical Details

Architecture

src/
├── core/
│   ├── agent/
│   │   ├── types.ts              # AgentProvider interface
│   │   ├── factory.ts            # Provider factory + registerProvider()
│   │   └── providers/
│   │       ├── claude - code.ts    # Claude Code SDK (default)
│   │       ├── base - cli.ts       # Base class for CLI - spawned providers
│   │       ├── codex.ts          # Codex (OpenAI)
│   │       ├── gemini.ts         # Gemini CLI (Google)
│   │       └── opencode.ts       # OpenCode
│   ├── approval.ts               # Approval manager (auto - approve + manual)
│   ├── executor.ts               # Command execution, file I/O, screenshots
│   ├── executor - factory.ts     # Creates local or remote executor
│   ├── executor - interface.ts   # IExecutor interface
│   ├── remote - executor.ts      # Executor that delegates to sidecar
│   ├── health.ts                 # Health monitoring (CPU, memory, disk, agent)
│   ├── skills/                   # Skills system
│   │   ├── types.ts              # Skill definition schema (Zod)
│   │   ├── registry.ts           # Loads skills.json, hot - reloads on change
│   │   └── executor.ts           # Runs multi - step skill workflows
│   ├── cron/                     # Cron scheduler
│   │   ├── types.ts              # Cron job definition schema (Zod)
│   │   └── scheduler.ts          # node - cron based job runner
│   └── logger.ts                 # Structured logging
├── gateway/
│   ├── types.ts                  # MessagingClient, MessageHandler interfaces
│   ├── gateway.ts                # Central coordinator
│   ├── security.ts               # Multi - client allowlist auth
│   └── session.ts                # Session manager (composite keys, idle pruning)
├── clients/
│   └── telegram.ts               # Telegram adapter (grammY)
├── sidecar/                      # Host sidecar for container mode
│   ├── server.ts                 # Express server exposing executor over HTTP
│   └── cli.ts                    # Sidecar CLI entry point
└── mcp/
    └── server.ts                 # MCP server

Agent layer: Ships with four providers: Claude Code (via @anthropic - ai/claude - agent - sdk), Codex, Gemini CLI, and OpenCode. The three non - Claude providers extend BaseCliProvider, which handles subprocess spawning and stdout streaming. Custom agent providers can be registered via registerProvider().

Gateway layer: Central coordinator handling auth (SecurityManager), sessions (SessionManager), agent orchestration, approval routing, and screenshot delivery. Platform adapters implement the MessagingClient interface and do only I/O.

Adding a new client

Create src/clients/discord.ts implementing MessagingClient (see src/gateway/types.ts)
Add DISCORD_BOT_TOKEN to .env
Add discord:userId to ALLOWED_USERS
Register in the gateway startup: gateway.registerClient(new DiscordClient(token))

No changes to Gateway, SecurityManager, SessionManager, or the agent layer.

Bringing your own agent

import { AgentProvider, registerProvider } from "./core/agent";

class MyAgent implements AgentProvider {
  readonly name = "my - agent";
  readonly version = "1.0.0";
  // implement query(), queryStream(), isAvailable()
}

registerProvider("my - agent", MyAgent);
// then set AGENT_PROVIDER=my - agent in .env

CLI Commands

Command	Description
`deskmate`	Start the gateway (default mode)
`deskmate init`	Interactive setup wizard
`deskmate status`	Show service status and config validation
`deskmate health`	Show system health and resource metrics
`deskmate logs`	Tail stdout.log (`--stderr` for error log)
`deskmate restart`	Restart the background service
`deskmate doctor`	Run diagnostic checks
`deskmate sidecar`	Start the host sidecar (container mode)

Docker / Container Mode

Run Deskmate in a Docker container with a native sidecar handling host - level commands (screenshots, file access).

# Set install mode
INSTALL_MODE=container

# Start the sidecar on the host
deskmate sidecar

# Start the container
docker - compose up - d

The Dockerfile and docker - compose.yml are included in the package. The sidecar exposes a local HTTP API that the containerized core uses to execute commands on the host.

Service Management

macOS (launchd)

# View logs
tail - f logs/stdout.log
tail - f logs/stderr.log

# Stop the service
launchctl unload ~/Library/LaunchAgents/com.deskmate.service.plist

# Start the service
launchctl load ~/Library/LaunchAgents/com.deskmate.service.plist

# Check status
launchctl list | grep deskmate

Linux (systemd)

# View logs
tail - f logs/stdout.log
journalctl --user - u deskmate.service - f

# Stop / start / restart
systemctl --user stop deskmate.service
systemctl --user start deskmate.service
systemctl --user restart deskmate.service

# Check status
systemctl --user status deskmate.service

Uninstall

./uninstall.sh

Troubleshooting

⚠️ Important Note

If you encounter any issues, refer to the following solutions.

Bot not responding?

Check logs: tail - f logs/stderr.log
Verify your ALLOWED_USERS includes your Telegram ID (e.g. telegram:123456)
Ensure your agent CLI is installed (e.g. which claude, which codex, which gemini, which opencode)
Run deskmate doctor to diagnose configuration issues

Commands timing out?

Default timeout is 2 minutes
Long - running commands may need adjustment

Machine going to sleep?

macOS: Run ./install.sh to configure sleep prevention, or manually: sudo pmset - c sleep 0
Linux: The systemd service uses idle inhibitor. Check your desktop environment's power settings.

Permission denied errors? (macOS)

Re - run ./install.sh and go through the permissions setup
Or manually grant permissions in System Settings > Privacy & Security

Screenshots not working?

macOS: Grant Screen Recording permission in System Settings > Privacy & Security > Screen Recording
Linux: Install ImageMagick (sudo apt install imagemagick)
Restart the service after making changes

Future Work / Help Wanted

💡 Usage Tip

We welcome contributions to the following areas:

Additional messaging clients: The gateway architecture is ready. We'd welcome:

discord — Discord bot via discord.js
slack — Slack app via Bolt SDK
whatsapp — WhatsApp via the Business API

Background job handling: The current launchd (macOS) + systemd (Linux) approach works but could be improved for different device types (always - on Mac Mini vs MacBook, headless Linux servers).

Open an issue to discuss your approach.