Native recording & live waveform
High-fidelity capture via the Rust-native CPAL library, with a sleek floating overlay. Voice Activity Detection ends the take and starts transcribing the moment you stop speaking.
Simplevoice is a privacy-first, fully offline speech-to-text app for the desktop. Press a key, talk, and your words land in any app — no cloud, no account, no telemetry.
From a single keystroke to finished text — a full dictation, running entirely on-device.
No setup ritual. No dashboard to learn. The whole loop takes a single keystroke — speak, and it's written.
Hit Ctrl / Cmd + Shift + Space from any app. A floating overlay appears and starts listening — wherever you are.
Talk at your own pace. Voice Activity Detection notices when you stop and transcribes on-device, automatically.
Your words are auto-pasted straight into the active field — or copied to the clipboard with a second hotkey.
A focused tool that does one job exceptionally well — capture your voice and turn it into finished text, wherever you work.
High-fidelity capture via the Rust-native CPAL library, with a sleek floating overlay. Voice Activity Detection ends the take and starts transcribing the moment you stop speaking.
Multiple ASR backends run entirely on your machine, with hardware acceleration and a safe CPU fallback.
Optional bring-your-own-key transcription. Keys are stored only in the OS keyring — never on disk.
A global hotkey records from anywhere; your dictation is typed into the active field — including pure-Rust injection on Wayland.
Browse and download recommended models in-app with live progress, or import your own.
A full local history of transcriptions, durations and word counts in SQLite, with totals and 7-day / 30-day / all-time charts. Nothing leaves your machine.
Choose the engine that fits your hardware and the model you want — all running locally — or route to a cloud provider with your own API key.
Anthropic Claude is intentionally not offered for transcription — Claude does not support audio input.
Simplevoice is private by architecture, not by policy. Recognition runs fully on-device — nothing is sent anywhere unless you explicitly choose a cloud engine.
With a local engine, your audio is transcribed entirely on your device. No internet required.
No analytics, no tracking, no phone-home. There is nothing to opt out of, because we collect nothing.
No sign-up, no login, no profile. Install it and start talking — you stay anonymous.
If you opt into a cloud engine, API keys live only in your operating system's keyring — never on disk or in localStorage.
Every line is inspectable and auditable on GitHub. Trust the code, don't just take our word for it.
Free and open source. Pick your platform and architecture — every build lives on GitHub Releases.
Prefer to compile it yourself? Build from source ↗
Everything worth knowing before you start talking.