VoiceMode

VoiceMode

Enable voice conversations with Claude using any OpenAI-compatible STT/TTS service getvoicemode.com

463
Stars
62
Forks
20
Releases

Overview

VoiceMode is an MCP server that brings human-like voice interactions to Claude Code and other MCP clients. It enables natural voice conversations with AI assistants by bridging microphone input to speech-to-text (STT) and text-to-speech (TTS) back to the assistant, using either local OpenAI-compatible STT/TTS services or cloud providers via the OpenAI API. It supports local voice models (e.g., Whisper.cpp for STT and Kokoro for TTS) and offers a low-latency, real-time experience with automatic transport selection, including optional LiveKit-based rooms. The MCP integration makes it easy to add VoiceMode to Claude Code using the claude mcp commands. It includes silence detection to automatically stop recording when you stop speaking, providing a natural back-and-forth conversation without manual prompts. The project provides installation via Astral UV, NixOS flakes, and multiple installation paths, including from source and package managers. It emphasizes privacy by allowing local processing and provides comprehensive documentation and guides (Getting Started, LiveKit, Whisper, Kokoro).

Details

Owner
mbailey
Language
Python
License
MIT License
Updated
2025-12-07

Features

Natural Voice Conversations

Allows users to ask questions and hear responses from Claude Code and other MCP-enabled AI assistants.

Supports Local Voice Models

Works with any OpenAI API compatible STT/TTS services; enables local processing and privacy-friendly setups.

Real-time / Low-latency

Provides real-time voice interactions with automatic transport selection for low latency.

MCP Integration

Seamless integration with Claude Code and other MCP clients for voice-enabled workflows.

Silence Detection

Automatically stops recording when you stop speaking to create a natural conversation flow.

Multiple Transports

Supports local microphone or LiveKit room-based communication (optional).

Audience

DevelopersIntegrate VoiceMode MCP server with Claude Code and other MCP clients for voice-enabled AI workflows.
Claude Code usersInteract with Claude Code using natural voice conversations via MCP.
Privacy-conscious usersRun VoiceMode with local STT/TTS to preserve privacy and keep data on-device.

Tags

VoiceModeMCPClaude CodeSTTTTSOpenAI-compatibleLiveKitlocalaudiovoicesilence-detectionWhisper.cppKokoro