Kokoro Tts MCP
K

Kokoro Tts MCP

Kokoro Text to Speech (TTS) MCP Server, supporting the generation of MP3 files and optional uploading to S3 storage
2.5 points
9.9K

What is the Kokoro TTS MCP service?

The Kokoro TTS MCP service is a text-to-speech (TTS) solution that receives text input and generates corresponding voice MP3 files. The service is built on the Model Context Protocol (MCP), supports multiple voice styles and speed adjustments, and can automatically upload the generated audio files to AWS S3 cloud storage.

How to use the Kokoro TTS service?

You can use this service through a simple command-line client or by directly calling the MCP protocol. The service supports instant text conversion or reading content from a file, and the generated audio files can be saved locally or in the cloud.

Use cases

This service is suitable for various scenarios that require voice synthesis, such as: audiobook generation, voice assistant responses, educational content production, accessible access, etc. It is particularly suitable for workflows that require batch processing of text or automated voice generation.

Main features

Multi-voice support
Provides a variety of preset voice styles (such as af_heart, en_female, etc.) to meet the needs of different scenarios
Speed adjustment
You can adjust the voice playback speed (0.5 - 2.0 times the normal speed) to get the best auditory experience
S3 cloud storage integration
Supports automatically uploading the generated MP3 files to AWS S3 storage for easy sharing and management
Intelligent file management
Automatic cleaning of old files. You can set the number of days to keep or delete the local copy immediately after uploading
Advantages
A simple and easy-to-use command-line interface for easy integration into automated processes
Supports multiple language and voice style selections
Flexible cloud storage options to reduce local storage pressure
Open-source model support without additional licensing fees
Limitations
Requires installing dependency tools such as ffmpeg
Needs to download a large voice model file for the first use
Limited advanced voice customization functions

How to use

Environment preparation
Install the necessary dependencies, including the Python environment and the ffmpeg tool
Download the voice model
Get the Kokoro Onnx weight file from GitHub and put it in the project directory
Configure the service
Create a .env file or set environment variables to configure AWS credentials and voice parameters
Start the service
Run the MCP server using uvicorn
Use the client
Send text through the command-line client for voice synthesis

Usage examples

Generate a welcome voice
Create multi-language welcome voices for a website
Batch process documents
Convert long documents into audiobooks
Automated voice reminders
Integrate into the notification system to generate voice reminders

Frequently asked questions

How to change the default voice?
Where are the generated audio files saved?
What languages does the service support?
How to disable the S3 upload function?

Related resources

Kokoro Onnx project
Source code and weight files of the voice model
HuggingFace demo space
Experience the Kokoro TTS effect online
FFmpeg installation guide
Get and install the FFmpeg tool

Installation

Copy the following command to your Client for configuration
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

R
Rsdoctor
Rsdoctor is a build analysis tool specifically designed for the Rspack ecosystem, fully compatible with webpack. It provides visual build analysis, multi - dimensional performance diagnosis, and intelligent optimization suggestions to help developers improve build efficiency and engineering quality.
TypeScript
9.0K
5 points
N
Next Devtools MCP
The Next.js development tools MCP server provides Next.js development tools and utilities for AI programming assistants such as Claude and Cursor, including runtime diagnostics, development automation, and document access functions.
TypeScript
9.6K
5 points
T
Testkube
Testkube is a test orchestration and execution framework for cloud-native applications, providing a unified platform to define, run, and analyze tests. It supports existing testing tools and Kubernetes infrastructure.
Go
6.2K
5 points
M
MCP Windbg
An MCP server that integrates AI models with WinDbg/CDB for analyzing Windows crash dump files and remote debugging, supporting natural language interaction to execute debugging commands.
Python
9.8K
5 points
R
Runno
Runno is a collection of JavaScript toolkits for securely running code in multiple programming languages in environments such as browsers and Node.js. It achieves sandboxed execution through WebAssembly and WASI, supports languages such as Python, Ruby, JavaScript, SQLite, C/C++, and provides integration methods such as web components and MCP servers.
TypeScript
7.6K
5 points
N
Netdata
Netdata is an open-source real-time infrastructure monitoring platform that provides second-level metric collection, visualization, machine learning-driven anomaly detection, and automated alerts. It can achieve full-stack monitoring without complex configuration.
Go
9.7K
5 points
M
MCP Server
The Mapbox MCP Server is a model context protocol server implemented in Node.js, providing AI applications with access to Mapbox geospatial APIs, including functions such as geocoding, point - of - interest search, route planning, isochrone analysis, and static map generation.
TypeScript
7.8K
4 points
U
Uniprof
Uniprof is a tool that simplifies CPU performance analysis. It supports multiple programming languages and runtimes, does not require code modification or additional dependencies, and can perform one-click performance profiling and hotspot analysis through Docker containers or the host mode.
TypeScript
7.3K
4.5 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
21.1K
4.3 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
19.3K
4.5 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
31.5K
5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
64.3K
4.3 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
27.3K
5 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
58.5K
4.5 points
M
Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
42.9K
4.8 points
C
Context7
Context7 MCP is a service that provides real-time, version-specific documentation and code examples for AI programming assistants. It is directly integrated into prompts through the Model Context Protocol to solve the problem of LLMs using outdated information.
TypeScript
85.8K
4.7 points
AIBase
Zhiqi Future, Your AI Solution Think Tank
© 2026AIBase