MCP Fish Audio Server
M

MCP Fish Audio Server

The Fish Audio MCP Server is a middleware that provides text-to-speech services. It achieves seamless integration with LLMs such as Claude through the Model Context Protocol, supporting multilingual, multi-tone, and real-time streaming audio generation.
2 points
6.4K

What is the Fish Audio MCP Server?

The Fish Audio MCP Server is a bridge that connects the Fish Audio text-to-speech API and LLMs (such as Claude). It allows users to generate high-quality speech content through natural language instructions.

How to use the Fish Audio MCP Server?

This service can be used in clients that support the MCP protocol through simple configuration. Just provide the Fish Audio API key and voice model information to start speech generation.

Applicable Scenarios

Suitable for scenarios such as AI assistants, virtual customer service, audiobook production, and multilingual content generation that require natural speech output.

Main Features

High-quality Speech Synthesis
Utilize Fish Audio's advanced speech synthesis technology to generate natural and fluent speech content.
Multiple Voice Options
Support custom voice models, and different styles of voices can be selected by ID, name, or tag.
Real-time Audio Streaming
Support HTTP/WebSocket streaming transmission to achieve low-latency real-time speech playback.
Multiple Audio Formats
Support output in multiple audio formats such as MP3, WAV, PCM, and Opus.
Intelligent Voice Selection
Automatically select the appropriate voice model according to ID, name, or tag.
Advantages
Provide high-quality and natural speech synthesis effects.
Support multiple voice models and style options.
Easy to integrate into existing MCP-compatible systems.
Support real-time audio streaming and multiple audio formats.
Limitations
An effective Fish Audio API key is required for use.
Speech generation may be affected by API call restrictions.
Long texts need to be processed in segments.

How to Use

Get an API Key
Visit the Fish Audio official website to get your API key.
Install the MCP Server
Install the Fish Audio MCP Server using npm.
Configure Environment Variables
Set the necessary environment variables, including the API key, voice model, and output path.
Start the MCP Server
Run the MCP Server to start accepting speech generation requests.

Usage Examples

Generate a Greeting Speech
Generate the speech for 'Hello, welcome to Fish Audio TTS' using the default voice.
Use a Specific Voice
Generate the speech for 'The weather is really nice today' using the voice named 'Carol'.
List All Voices
View all available voice models and their information.

Frequently Asked Questions

What do I need to use the Fish Audio MCP Server?
How to select different voices?
Where are the generated speech files stored?
What audio formats are supported?
What should I do if I encounter an API error?

Related Resources

Fish Audio Official Website
Get the API key and learn more about Fish Audio.
MCP Protocol Documentation
Learn about the Model Context Protocol (MCP).
GitHub Repository
View the source code and project documentation.

Installation

Copy the following command to your Client for configuration
{
  "mcpServers": {
    "fish-audio": {
      "command": "npx",
      "args": ["-y", "@alanse/fish-audio-mcp-server"],
      "env": {
        "FISH_API_KEY": "your_fish_audio_api_key_here",
        "FISH_MODEL_ID": "speech-1.6",
        "FISH_REFERENCE_ID": "your_voice_reference_id_here",
        "FISH_OUTPUT_FORMAT": "mp3",
        "FISH_STREAMING": "false",
        "FISH_LATENCY": "balanced",
        "FISH_MP3_BITRATE": "128",
        "FISH_AUTO_PLAY": "false",
        "AUDIO_OUTPUT_DIR": "~/.fish-audio-mcp/audio_output"
      }
    }
  }
}

{
  "mcpServers": {
    "fish-audio": {
      "command": "npx",
      "args": ["-y", "@alanse/fish-audio-mcp-server"],
      "env": {
        "FISH_API_KEY": "your_fish_audio_api_key_here",
        "FISH_MODEL_ID": "speech-1.6",
        "FISH_REFERENCES": "[{'reference_id':'id1','name':'Alice','tags':['female','english']},{'reference_id':'id2','name':'Bob','tags':['male','japanese']},{'reference_id':'id3','name':'Carol','tags':['female','japanese','anime']}]",
        "FISH_DEFAULT_REFERENCE": "id1",
        "FISH_OUTPUT_FORMAT": "mp3",
        "FISH_STREAMING": "false",
        "FISH_LATENCY": "balanced",
        "FISH_MP3_BITRATE": "128",
        "FISH_AUTO_PLAY": "false",
        "AUDIO_OUTPUT_DIR": "~/.fish-audio-mcp/audio_output"
      }
    }
  }
}
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

K
Klavis
Klavis AI is an open-source project that provides a simple and easy-to-use MCP (Model Context Protocol) service on Slack, Discord, and Web platforms. It includes various functions such as report generation, YouTube tools, and document conversion, supporting non-technical users and developers to use AI workflows.
TypeScript
10.4K
5 points
D
Devtools Debugger MCP
The Node.js Debugger MCP server provides complete debugging capabilities based on the Chrome DevTools protocol, including breakpoint setting, stepping execution, variable inspection, and expression evaluation.
TypeScript
6.9K
4 points
M
Mcpjungle
MCPJungle is a self-hosted MCP gateway used to centrally manage and proxy multiple MCP servers, providing a unified tool access interface for AI agents.
Go
0
4.5 points
N
Nexus
Nexus is an AI tool aggregation gateway that supports connecting multiple MCP servers and LLM providers, providing tool search, execution, and model routing functions through a unified endpoint, and supporting security authentication and rate limiting.
Rust
0
4 points
Z
Zen MCP Server
Zen MCP is a multi-model AI collaborative development server that provides enhanced workflow tools and cross-model context management for AI coding assistants such as Claude and Gemini CLI. It supports seamless collaboration of multiple AI models to complete development tasks such as code review, debugging, and refactoring, and can maintain the continuation of conversation context between different workflows.
Python
14.9K
5 points
O
Opendia
OpenDia is an open - source browser extension tool that allows AI models to directly control the user's browser, perform automated operations using existing login status, bookmarks and other data, support multiple browsers and AI models, and focus on privacy protection.
JavaScript
14.4K
5 points
N
Notte Browser
Certified
Notte is an open-source full-stack network AI agent framework that provides browser sessions, automated LLM-driven agents, web page observation and operation, credential management, etc. It aims to transform the Internet into an agent-friendly environment and reduce the cognitive burden of LLMs by describing website structures in natural language.
16.6K
4.5 points
B
Bing Search MCP
An MCP server for integrating Microsoft Bing Search API, supporting web page, news, and image search functions, providing network search capabilities for AI assistants.
Python
15.7K
4 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
16.6K
4.3 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
25.0K
5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
48.9K
4.3 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
16.3K
4.5 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
21.3K
5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
47.8K
4.5 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
15.6K
4.5 points
M
Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
31.8K
4.8 points
AIBase
Zhiqi Future, Your AI Solution Think Tank
© 2025AIBase