MCP Hub
Back to servers

Kokoro TTS MCP Server

A high-quality text-to-speech MCP server using the Kokoro model, allowing AI agents to generate and play audio with adjustable voices and speeds.

Tools
1
Updated
Dec 10, 2025

mcp-kokoro

An MCP server for Kokoro TTS, enabling high-quality text-to-speech capabilities for MCP clients.

Overview

This server provides tools to generate spoken audio from text using the Kokoro model. It is designed to be used with the Model Context Protocol (MCP), allowing AI agents to "speak" to the user.

Tools

speak

Generates audio from text and plays it immediately.

  • Arguments:
    • text (str): The text to speak.
    • voice (str, optional): The voice to use (default: af_heart).
    • speed (float, optional): Speaking speed (default: 1.0).

Caching

Generated audio is cached in ~/.cache/mcp_kokoro to speed up repeated requests.

Installation

Using uv (Recommended)

uv tool install mcp-kokoro

Using pip

pip install mcp-kokoro

Configuration

To use with Claude Desktop or other MCP clients, add the following to your configuration file (e.g., ~/Library/Application Support/Claude/claude_desktop_config.json):

uv

{
  "mcpServers": {
    "kokoro": {
      "command": "uv",
      "args": [
        "tool",
        "run",
        "mcp-kokoro"
      ]
    }
  }
}

pip

{
  "mcpServers": {
    "kokoro": {
      "command": "mcp-kokoro",
      "args": []
    }
  }
}

Requirements

  • Python 3.10 or higher
  • Audio output device (for playback)
  • sounddevice system dependencies (e.g., PortAudio) may be required on some systems.
    • macOS: brew install portaudio
    • Linux: sudo apt-get install libportaudio2

License

MIT

Reviews

No reviews yet

Sign in to write a review