Open Router Audio Transcription MCP
An MCP server that transcribes audio files using OpenRouter's audio-capable language models.
Features
- Verbatim transcription — exact word-for-word output including filler words, false starts, and repetitions
- Cleaned transcription — lightly edited for readability: removes fillers, adds punctuation, sentence boundaries, and paragraph breaks; omits content not intended for transcription
- Custom prompt transcription — direct the transcription with your own prompt for specialized use cases
Supported Models
| Model | Provider |
|---|---|
google/gemini-3-flash-preview (default standard) | |
google/gemini-3.1-flash-lite-preview (default budget) | |
xiaomi/mimo-v2-omni | Xiaomi |
openai/gpt-audio | OpenAI |
openai/gpt-audio-mini (budget) | OpenAI |
mistralai/voxtral-small-24b-2507 | Mistral |
openai/gpt-4o-audio-preview | OpenAI |
Supported Audio Formats
mp3, wav, ogg, flac, m4a, aac, webm, wma, opus
Setup
1. Get an OpenRouter API key
Sign up at openrouter.ai and create an API key at openrouter.ai/keys.
2. Add to Claude Code
Run the following command to add the MCP server to Claude Code:
claude mcp add audio-transcription -e OPENROUTER_API_KEY=your-api-key-here -- npx -y or-audio-transcription-mcp
Or add it manually to your Claude Code MCP settings (~/.claude/settings.json or project .mcp.json):
{
"mcpServers": {
"audio-transcription": {
"command": "npx",
"args": ["-y", "or-audio-transcription-mcp"],
"env": {
"OPENROUTER_API_KEY": "your-api-key-here"
}
}
}
}
Alternative: Install from source
git clone https://github.com/danielrosehill/OR-Audio-Transcription-MCP.git
cd OR-Audio-Transcription-MCP
npm install
npm run build
Then configure with a direct path:
claude mcp add audio-transcription -e OPENROUTER_API_KEY=your-api-key-here -- node /path/to/OR-Audio-Transcription-MCP/dist/index.js
Tools
transcribe_audio
Transcribe an audio file.
| Parameter | Type | Required | Description |
|---|---|---|---|
file_path | string | Yes | Absolute path to the audio file |
mode | "verbatim" | "cleaned" | "custom" | Yes | Transcription mode |
custom_prompt | string | When mode=custom | Custom prompt to direct the transcription |
model | string | No | OpenRouter model ID (defaults to google/gemini-3-flash-preview) |
budget | boolean | No | Use budget model (google/gemini-3.1-flash-lite-preview). Ignored if model is set |
list_transcription_models
Lists all available audio transcription models.
License
MIT