Kokoro Tts MCP
Kokoro Text to Speech (TTS) MCP Server, supporting the generation of MP3 files and optional uploading to S3 storage
rating : 2.5 points
downloads : 35
What is the Kokoro TTS MCP service?
The Kokoro TTS MCP service is a text-to-speech (TTS) solution that receives text input and generates corresponding voice MP3 files. The service is built on the Model Context Protocol (MCP), supports multiple voice styles and speed adjustments, and can automatically upload the generated audio files to AWS S3 cloud storage.How to use the Kokoro TTS service?
You can use this service through a simple command-line client or by directly calling the MCP protocol. The service supports instant text conversion or reading content from a file, and the generated audio files can be saved locally or in the cloud.Use cases
This service is suitable for various scenarios that require voice synthesis, such as: audiobook generation, voice assistant responses, educational content production, accessible access, etc. It is particularly suitable for workflows that require batch processing of text or automated voice generation.Main features
Multi-voice supportProvides a variety of preset voice styles (such as af_heart, en_female, etc.) to meet the needs of different scenarios
Speed adjustmentYou can adjust the voice playback speed (0.5 - 2.0 times the normal speed) to get the best auditory experience
S3 cloud storage integrationSupports automatically uploading the generated MP3 files to AWS S3 storage for easy sharing and management
Intelligent file managementAutomatic cleaning of old files. You can set the number of days to keep or delete the local copy immediately after uploading
Advantages and limitations
Advantages
A simple and easy-to-use command-line interface for easy integration into automated processes
Supports multiple language and voice style selections
Flexible cloud storage options to reduce local storage pressure
Open-source model support without additional licensing fees
Limitations
Requires installing dependency tools such as ffmpeg
Needs to download a large voice model file for the first use
Limited advanced voice customization functions
How to use
Environment preparation
Install the necessary dependencies, including the Python environment and the ffmpeg tool
Download the voice model
Get the Kokoro Onnx weight file from GitHub and put it in the project directory
Configure the service
Create a .env file or set environment variables to configure AWS credentials and voice parameters
Start the service
Run the MCP server using uvicorn
Use the client
Send text through the command-line client for voice synthesis
Usage examples
Generate a welcome voiceCreate multi-language welcome voices for a website
Batch process documentsConvert long documents into audiobooks
Automated voice remindersIntegrate into the notification system to generate voice reminders
Frequently asked questions
How to change the default voice?
Where are the generated audio files saved?
What languages does the service support?
How to disable the S3 upload function?
Related resources
Kokoro Onnx project
Source code and weight files of the voice model
HuggingFace demo space
Experience the Kokoro TTS effect online
FFmpeg installation guide
Get and install the FFmpeg tool
Featured MCP Services

Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
1.7K
5 points

Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
823
4.3 points

Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
79
4.3 points

Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
130
4.5 points

Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
554
5 points

Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
6.6K
4.5 points

Context7
Context7 MCP is a service that provides real-time, version-specific documentation and code examples for AI programming assistants. It is directly integrated into prompts through the Model Context Protocol to solve the problem of LLMs using outdated information.
TypeScript
5.2K
4.7 points

Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
745
4.8 points