VoxFlow isn't just a web app — it's a full developer toolkit. The voxflow CLI lets you synthesize speech, generate podcasts, transcribe audio, and translate videos directly from your terminal. The MCP server brings these capabilities into Claude Code and other AI assistants.
CLI: Quick Start
Install
npm install -g voxflow
Login
voxflow login
This opens your browser for secure OAuth login. Your token is cached locally at ~/.config/voxflow/token.json.
Your First Synthesis
voxflow say "Hello, this is my first AI voiceover" -o hello.wav
That's it. One command, one audio file.
CLI Commands Overview
| Command | What It Does |
|---|---|
voxflow say "text" |
Synthesize text to speech |
voxflow narrate script.txt |
Multi-segment narration from file |
voxflow podcast "topic" |
Generate a full AI podcast |
voxflow story "prompt" |
AI story with narration |
voxflow dub subtitles.srt |
Dub video from subtitles |
voxflow asr audio.mp3 |
Transcribe audio to text |
voxflow translate subs.srt |
Translate subtitles |
voxflow video-translate video.mp4 |
End-to-end: transcribe + translate + dub |
voxflow voices |
Browse 200+ available voices |
voxflow present "topic" |
Generate narrated video slides |
Real-World Examples
Generate a podcast from a topic
voxflow podcast "The future of AI agents" \
--voice "authoritative-boss,cutesy-girl" \
--bgm jazz \
-o ai-agents-podcast.wav
Narrate a blog post
cat blog-post.md | voxflow narrate --voice gentle-female -o narration.wav
Translate a video end-to-end
voxflow video-translate presentation.mp4 \
--target-lang ja \
-o presentation-japanese.mp4
Transcribe a meeting recording
voxflow asr meeting.mp3 --format srt -o meeting.srt
MCP Server: Use VoxFlow in Claude Code
VoxFlow provides an MCP (Model Context Protocol) server that integrates with Claude Code, enabling AI-powered voice synthesis directly in your development workflow.
Setup
Add VoxFlow to your Claude Code MCP configuration:
{
"mcpServers": {
"voxflow": {
"command": "npx",
"args": ["-y", "voxflow", "mcp"]
}
}
}
What You Can Do
Once configured, Claude Code can:
- Synthesize speech — "Read this error message aloud"
- Generate podcasts — "Create a podcast about this PR"
- Search voices — "Find a Japanese female voice"
- Transcribe audio — "Transcribe this recording"
Example Workflow
In Claude Code, you can say:
"Generate a 3-minute podcast explaining the changes in this PR, using a professional male voice"
Claude will use the VoxFlow MCP tools to analyze your code changes, write a script, and produce the audio — all without leaving your terminal.
API Access
For direct API integration, VoxFlow provides REST endpoints:
# Synthesize text
curl -X POST https://api.voxflow.studio/api/tts/synthesize-stream \
-H "Authorization: Bearer $VOXFLOW_TOKEN" \
-H "Content-Type: application/json" \
-d '{"text": "Hello world", "voiceId": "v-female-R2s4N9qJ"}' \
-o output.mp3
# List voices
curl https://api.voxflow.studio/api/tts/voices
API access is included on the free tier — 10,000 credits per month.
What's Next?
- Browse Voice Library to find voices for your project
- Check the CLI documentation for full command reference
- Try AI Lecture to generate narrated slides
- Join Discord for developer support