mcp-kokoro

An MCP server for Kokoro TTS, enabling high-quality text-to-speech capabilities for MCP clients.

Overview

This server provides tools to generate spoken audio from text using the Kokoro model. It is designed to be used with the Model Context Protocol (MCP), allowing AI agents to "speak" to the user.

Tools

`speak`

Generates audio from text and plays it immediately.

Arguments:
- text (str): The text to speak.
- voice (str, optional): The voice to use (default: af_heart).
- speed (float, optional): Speaking speed (default: 1.0).

Caching

Generated audio is cached in ~/.cache/mcp_kokoro to speed up repeated requests.

Installation

Using `uv` (Recommended)

uv tool install mcp-kokoro

Using `pip`

pip install mcp-kokoro

Configuration

To use with Claude Desktop or other MCP clients, add the following to your configuration file (e.g., ~/Library/Application Support/Claude/claude_desktop_config.json):

uv

{
  "mcpServers": {
    "kokoro": {
      "command": "uv",
      "args": [
        "tool",
        "run",
        "mcp-kokoro"
      ]
    }
  }
}

pip

{
  "mcpServers": {
    "kokoro": {
      "command": "mcp-kokoro",
      "args": []
    }
  }
}

Requirements

Python 3.10 or higher
Audio output device (for playback)
sounddevice system dependencies (e.g., PortAudio) may be required on some systems.
- macOS: brew install portaudio
- Linux: sudo apt-get install libportaudio2

License

MIT

Kokoro TTS MCP Server

mcp-kokoro

Overview

Tools

`speak`

Caching

Installation

Using `uv` (Recommended)

Using `pip`

Configuration

uv

pip

Requirements

License

Reviews

Kokoro TTS MCP Server

mcp-kokoro

Overview

Tools

speak

Caching

Installation

Using uv (Recommended)

Using pip

Configuration

uv

pip

Requirements

License

Reviews

`speak`

Using `uv` (Recommended)

Using `pip`