Kokoro Tts MCP
K

Kokoro Tts MCP

Kokoro Text to Speech (TTS) MCP Server, supporting the generation of MP3 files and optional uploading to S3 storage
2.5 points
10.1K

What is the Kokoro TTS MCP service?

The Kokoro TTS MCP service is a text-to-speech (TTS) solution that receives text input and generates corresponding voice MP3 files. The service is built on the Model Context Protocol (MCP), supports multiple voice styles and speed adjustments, and can automatically upload the generated audio files to AWS S3 cloud storage.

How to use the Kokoro TTS service?

You can use this service through a simple command-line client or by directly calling the MCP protocol. The service supports instant text conversion or reading content from a file, and the generated audio files can be saved locally or in the cloud.

Use cases

This service is suitable for various scenarios that require voice synthesis, such as: audiobook generation, voice assistant responses, educational content production, accessible access, etc. It is particularly suitable for workflows that require batch processing of text or automated voice generation.

Main features

Multi-voice support
Provides a variety of preset voice styles (such as af_heart, en_female, etc.) to meet the needs of different scenarios
Speed adjustment
You can adjust the voice playback speed (0.5 - 2.0 times the normal speed) to get the best auditory experience
S3 cloud storage integration
Supports automatically uploading the generated MP3 files to AWS S3 storage for easy sharing and management
Intelligent file management
Automatic cleaning of old files. You can set the number of days to keep or delete the local copy immediately after uploading
Advantages
A simple and easy-to-use command-line interface for easy integration into automated processes
Supports multiple language and voice style selections
Flexible cloud storage options to reduce local storage pressure
Open-source model support without additional licensing fees
Limitations
Requires installing dependency tools such as ffmpeg
Needs to download a large voice model file for the first use
Limited advanced voice customization functions

How to use

Environment preparation
Install the necessary dependencies, including the Python environment and the ffmpeg tool
Download the voice model
Get the Kokoro Onnx weight file from GitHub and put it in the project directory
Configure the service
Create a .env file or set environment variables to configure AWS credentials and voice parameters
Start the service
Run the MCP server using uvicorn
Use the client
Send text through the command-line client for voice synthesis

Usage examples

Generate a welcome voice
Create multi-language welcome voices for a website
Batch process documents
Convert long documents into audiobooks
Automated voice reminders
Integrate into the notification system to generate voice reminders

Frequently asked questions

How to change the default voice?
Where are the generated audio files saved?
What languages does the service support?
How to disable the S3 upload function?

Related resources

Kokoro Onnx project
Source code and weight files of the voice model
HuggingFace demo space
Experience the Kokoro TTS effect online
FFmpeg installation guide
Get and install the FFmpeg tool

Installation

Copy the following command to your Client for configuration
Note: Your key is sensitive information, do not share it with anyone.

Alternatives

V
Vestige
Vestige is an AI memory engine based on cognitive science. By implementing 29 neuroscience modules such as prediction error gating, FSRS - 6 spaced repetition, and memory dreaming, it provides long - term memory capabilities for AI. It includes a 3D visualization dashboard and 21 MCP tools, runs completely locally, and does not require the cloud.
Rust
5.4K
4.5 points
M
Moltbrain
MoltBrain is a long-term memory layer plugin designed for OpenClaw, MoltBook, and Claude Code, capable of automatically learning and recalling project context, providing intelligent search, observation recording, analysis statistics, and persistent storage functions.
TypeScript
6.4K
4.5 points
B
Bm.md
A feature-rich Markdown typesetting tool that supports multiple style themes and platform adaptation, providing real-time editing preview, image export, and API integration capabilities
TypeScript
4.5K
5 points
S
Security Detections MCP
Security Detections MCP is a server based on the Model Context Protocol that allows LLMs to query a unified security detection rule database covering Sigma, Splunk ESCU, Elastic, and KQL formats. The latest version 3.0 is upgraded to an autonomous detection engineering platform that can automatically extract TTPs from threat intelligence, analyze coverage gaps, generate SIEM-native format detection rules, run tests, and verify. The project includes over 71 tools, 11 pre-built workflow prompts, and a knowledge graph system, supporting multiple SIEM platforms.
TypeScript
6.7K
4 points
P
Paperbanana
Python
6.9K
5 points
B
Better Icons
An MCP server and CLI tool that provides search and retrieval of over 200,000 icons, supports more than 150 icon libraries, and helps AI assistants and developers quickly obtain and use icons.
TypeScript
7.7K
4.5 points
A
Assistant Ui
assistant - ui is an open - source TypeScript/React library for quickly building production - grade AI chat interfaces, providing composable UI components, streaming responses, accessibility, etc., and supporting multiple AI backends and models.
TypeScript
7.8K
5 points
A
Apify MCP Server
The Apify MCP Server is a tool based on the Model Context Protocol (MCP) that allows AI assistants to extract data from websites such as social media, search engines, and e-commerce through thousands of ready-to-use crawlers, scrapers, and automation tools (Apify Actors). It supports OAuth and Skyfire proxy payment and can be integrated into MCP clients such as Claude and VS Code through HTTPS endpoints or local stdio.
TypeScript
6.7K
5 points
D
Duckduckgo MCP Server
Certified
The DuckDuckGo Search MCP Server provides web search and content scraping services for LLMs such as Claude.
Python
73.9K
4.3 points
N
Notion Api MCP
Certified
A Python-based MCP Server that provides advanced to-do list management and content organization functions through the Notion API, enabling seamless integration between AI models and Notion.
Python
21.8K
4.5 points
M
Markdownify MCP
Markdownify is a multi-functional file conversion service that supports converting multiple formats such as PDFs, images, audio, and web page content into Markdown format.
TypeScript
35.1K
5 points
G
Gitlab MCP Server
Certified
The GitLab MCP server is a project based on the Model Context Protocol that provides a comprehensive toolset for interacting with GitLab accounts, including code review, merge request management, CI/CD configuration, and other functions.
TypeScript
26.1K
4.3 points
F
Figma Context MCP
Framelink Figma MCP Server is a server that provides access to Figma design data for AI programming tools (such as Cursor). By simplifying the Figma API response, it helps AI more accurately achieve one - click conversion from design to code.
TypeScript
65.8K
4.5 points
U
Unity
Certified
UnityMCP is a Unity editor plugin that implements the Model Context Protocol (MCP), providing seamless integration between Unity and AI assistants, including real - time state monitoring, remote command execution, and log functions.
C#
33.1K
5 points
G
Gmail MCP Server
A Gmail automatic authentication MCP server designed for Claude Desktop, supporting Gmail management through natural language interaction, including complete functions such as sending emails, label management, and batch operations.
TypeScript
22.3K
4.5 points
M
Minimax MCP Server
The MiniMax Model Context Protocol (MCP) is an official server that supports interaction with powerful text-to-speech, video/image generation APIs, and is suitable for various client tools such as Claude Desktop and Cursor.
Python
50.3K
4.8 points
AIBase
Zhiqi Future, Your AI Solution Think Tank
© 2026AIBase