Voice Vibe Code

Voice Input — Built for the AI Vibe Coding & Agent Era
VoiceVibeCode

Your voice knows
your code context

The only voice input tool that reads your active coding session and recognizes your project's variable names, class names, and API parameters in real-time.

Other voice tools break on your code

💥

"UserService" becomes "user service"

Generic dictation tools have zero awareness of your codebase. They guess at technical terms and get them wrong every time.

💬

No punctuation, no formatting

Raw speech-to-text dumps a stream of words with no punctuation, no capitalization, and no structure. You spend more time fixing than coding.

🌐

Mixed language is a nightmare

Speaking Chinese with English code terms? Most tools force you to pick one language and mangle everything else.

How VoiceVibeCode works

A two-stage pipeline that understands both your voice and your code.

1

Detect active coding context

VoiceVibeCode reads your active Claude Code / Codex session, extracts variable names, class names, function names, and project-specific terminology.

2

Multi-engine speech recognition

Choose WhisperKit (on-device), Qwen3-ASR (advanced Chinese), or Volcengine (cloud-based Flash API). Pick the engine that fits your workflow.

3

LLM post-processing (local or cloud)

An LLM polishes the raw transcription using your coding context: fixes homophones, adds punctuation, removes fillers, and ensures technical terms are correct. This step can also run locally via NovaMLX.

4

Direct insertion into your app

Text appears instantly in your terminal, IDE, browser, or any active application. No copy-paste needed.

System Permissions

VoiceVibeCode needs two system permissions to work. Both are standard macOS security prompts.

🔑

Accessibility

Required for global hotkeys (Push-to-Talk, language switching) and inserting text into other applications. VoiceVibeCode uses the macOS Accessibility API to type text directly at your cursor position.

How to grant: System Settings → Privacy & Security → Accessibility → enable VoiceVibeCode
🎙️

Microphone

Required for recording your voice. Audio is processed entirely on-device via WhisperKit — no audio data is ever sent to any server.

How to grant: System Settings → Privacy & Security → Microphone → enable VoiceVibeCode
Both permissions are requested on first launch. If you deny them, the menu bar icon will show ⚠️ and guide you to the correct System Settings page.

See the difference

Same voice input, different results. Active project: a React + TypeScript codebase.

Other voice tools
"Refactor the user service class to use the
authenticate method instead of login and add
error handling for time out exceptions"
VoiceVibeCode
"Refactor the UserService class to use the
authenticate method instead of login, and add
error handling for TimeoutException."

Everything you need, nothing you don't

🔗

Deep coding tool integration

Detects your active Claude Code or Codex session and extracts project-specific terminology. Works best with tmux for reliable working directory detection.

🧠

LLM-powered polishing

Post-processing via LLM removes fillers, fixes homophones, adds smart punctuation, and ensures technical terms match your project.

3 ASR engines to choose from

WhisperKit (on-device), Qwen3-ASR (advanced Chinese), or Volcengine (cloud Flash API). On-device or cloud — your choice.

🌐

Mixed-language input

Speak Chinese with English code terms, or any combination of 9 supported languages. Seamlessly.

🎤

Voice commands

Say "Enter" at the end to press Return. More commands coming soon.

🚀

Works everywhere

iTerm2, Terminal, VS Code, Xcode, Chrome, Safari, WeChat, Slack. Direct text insertion into any app.

🔒

Privacy-first

Speech recognition runs on-device. LLM post-processing can also run locally via NovaMLX — zero data leaves your Mac.

🎨

Tone adjustment

Switch between formal, casual, and concise modes depending on where you're typing.

100% Local, 100% Secure

Your voice data never has to leave your Mac. Both stages of the pipeline can run entirely on-device.

🎙️

Stage 1: Speech Recognition — On-device or Cloud

VoiceVibeCode offers 3 ASR engines: WhisperKit and Qwen3-ASR run entirely on your Mac. Volcengine uses cloud-based Flash API for instant results. Choose on-device for privacy, or cloud for convenience.

🧠

Stage 2: LLM Post-Processing — Local or Cloud

Pair with NovaMLX to run the LLM post-processing stage locally as well. With this setup, both stages run on your machine — your voice data never touches the internet.

Prefer cloud LLMs? No problem. VoiceVibeCode supports OpenAI, Anthropic, DeepSeek, and more for post-processing. Your speech recognition is always on-device regardless of which LLM mode you pick.

Not just for coding

VoiceVibeCode is purpose-built for Vibe Coding, but it's also an excellent voice input tool for everyday productivity. Smart punctuation, multi-language support, and direct text insertion work great in any application.

Email & Messages Documents & Notes Chat & Slack WeChat & DingTalk Browser & Forms Any text field

Works with your tools

Designed for the AI-powered development workflow.

Claude Code OpenAI Codex OpenClaw Hermes iTerm2 + tmux Terminal VS Code Xcode Chrome Safari

Free & Open Source

Free

All features, no restrictions. Download and start using.

Download from GitHub