Whisper MCP
W

Whisper MCP

A local audio transcription MCP server based on whisper.cpp, supporting multiple models and audio formats. It can work with the Apple Voice Memo MCP to implement a complete voice workflow.
2 points
6.4K

What is the Whisper MCP Server?

The Whisper MCP Server is a local audio transcription tool that allows you to directly transcribe audio files in Claude conversations. It uses OpenAI's Whisper technology but runs entirely on your computer, which means your audio data never leaves your device, ensuring privacy and security.

How to use the Whisper MCP Server?

First, you need to configure this server in Claude Desktop or Claude Code. After configuration, you can use specific tool commands in Claude conversations to transcribe audio files. You just need to provide the path of the audio file, select the transcription model and format, and Claude will call this server to process and return the text result.

Applicable Scenarios

This tool is very suitable for scenarios where you need to convert meeting recordings, interviews, podcasts, voice memos, or any other audio content into text. Journalists, students, researchers, content creators, or anyone who needs to process audio materials will find it very useful.

Main Features

Local Processing
All audio transcriptions are completed on your device, and the data is not uploaded to any server, ensuring complete privacy.
Multiple Model Options
It provides multiple Whisper models from fast and lightweight to high-precision. You can choose according to your speed and accuracy requirements.
Wide Format Support
It supports multiple common audio formats such as WAV, MP3, and M4A, facilitating the processing of audio files from various sources.
Timestamp Output
You can choose to output the transcribed text with timestamps, which is convenient for locating specific content in the audio.
Voice Memo Workflow
It works in conjunction with the Apple Voice Memo MCP Server to implement a complete workflow from recording to transcription.
Advantages
Privacy Protection: All processing is done locally, and the audio data does not leave your device.
Offline Availability: Transcription can be done without an internet connection.
Flexible Configuration: You can choose models with different precision and speed according to your needs.
Zero Cost: There is no need to pay for API calls.
Easy Integration: It is deeply integrated with Claude and is easy to use.
Limitations
Hardware Requirements: Larger models require stronger computing power, which may affect the processing speed.
Storage Space: The model files need to occupy local storage space.
macOS Only: Currently, it mainly supports the macOS system.
Dependency Installation Required: You need to install whisper-cpp and ffmpeg in advance.

How to Use

Install Dependencies
Use Homebrew to install necessary dependency software on macOS.
Install Whisper MCP
Globally install the Whisper MCP Server via npm.
Configure Claude Desktop
Add server configuration to the Claude Desktop configuration file.
Add Configuration Content
Add the following JSON configuration to the configuration file.
Restart Claude
Restart Claude Desktop for the configuration to take effect, and then you can start using it.

Usage Examples

Transcribe Meeting Recordings
Convert team meeting recordings into text for easy organization of meeting minutes and action items.
Process Voice Memos
Convert iPhone voice memos into text with timestamps for easy searching of specific content.
Academic Research Interviews
Transcribe research interview recordings for qualitative analysis and citation.
Podcast Content Organization
Convert podcast recordings into text for creating subtitles or transcripts.

Frequently Asked Questions

Do I need to pay to use this service?
Which audio formats are supported?
How fast is the transcription?
Does it support Chinese transcription?
Where are the model files stored?
Can it be used on Windows or Linux?
How to choose a suitable model?

Related Resources

GitHub Repository
Source code and latest updates of the Whisper MCP Server
whisper.cpp Project
The underlying Whisper C++ implementation
Apple Voice Memo MCP
A supporting voice memo management tool
Model Context Protocol
Official documentation of the MCP protocol
Claude Desktop Configuration Guide
Official usage documentation of Claude Desktop

Installation

Copy the following command to your Client for configuration
{
  "mcpServers": {
    "whisper-mcp": {
      "command": "npx",
      "args": ["-y", "whisper-mcp"]
    }
  }
}

{
  "mcpServers": {
    "whisper-mcp": {
      "command": "node",
      "args": ["/path/to/whisper-mcp/dist/index.js"]
    }
  }
}

{
  "mcpServers": {
    "apple-voice-memo-mcp": {
      "command": "npx",
      "args": ["-y", "apple-voice-memo-mcp"]
    },
    "whisper-mcp": {
      "command": "npx",
      "args": ["-y", "whisper-mcp"]
    }
  }
}
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

V
Vestige
Vestige is an AI memory engine based on cognitive science. By implementing 29 neuroscience modules such as prediction error gating, FSRS - 6 spaced repetition, and memory dreaming, it provides long - term memory capabilities for AI. It includes a 3D visualization dashboard and 21 MCP tools, runs completely locally, and does not require the cloud.
Rust
4.5K
4.5 points
B
Better Icons
An MCP server and CLI tool that provides search and retrieval of over 200,000 icons, supports more than 150 icon libraries, and helps AI assistants and developers quickly obtain and use icons.
TypeScript
5.7K
4.5 points
A
Assistant Ui
assistant - ui is an open - source TypeScript/React library for quickly building production - grade AI chat interfaces, providing composable UI components, streaming responses, accessibility, etc., and supporting multiple AI backends and models.
TypeScript
7.3K
5 points
A
Apify MCP Server
The Apify MCP Server is a tool based on the Model Context Protocol (MCP) that allows AI assistants to extract data from websites such as social media, search engines, and e-commerce through thousands of ready-to-use crawlers, scrapers, and automation tools (Apify Actors). It supports OAuth and Skyfire proxy payment and can be integrated into MCP clients such as Claude and VS Code through HTTPS endpoints or local stdio.
TypeScript
7.5K
5 points
R
Rsdoctor
Rsdoctor is a build analysis tool specifically designed for the Rspack ecosystem, fully compatible with webpack. It provides visual build analysis, multi - dimensional performance diagnosis, and intelligent optimization suggestions to help developers improve build efficiency and engineering quality.
TypeScript
9.4K
5 points
N
Next Devtools MCP
The Next.js development tools MCP server provides Next.js development tools and utilities for AI programming assistants such as Claude and Cursor, including runtime diagnostics, development automation, and document access functions.
TypeScript
10.8K
5 points
T
Testkube
Testkube is a test orchestration and execution framework for cloud-native applications, providing a unified platform to define, run, and analyze tests. It supports existing testing tools and Kubernetes infrastructure.
Go
6.5K
5 points
M
MCP Windbg
An MCP server that integrates AI models with WinDbg/CDB for analyzing Windows crash dump files and remote debugging, supporting natural language interaction to execute debugging commands.
Python
11.5K
5 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
24.4K
4.3 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
20.4K
4.5 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
34.3K
5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
71.9K
4.3 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
31.1K
5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
65.4K
4.5 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
21.0K
4.5 points
M
Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
48.6K
4.8 points
AIBase
Zhiqi Future, Your AI Solution Think Tank
© 2026AIBase