I built HyperVoice because typing was the slowest part of my day. Press one key, talk, and your words land at the cursor in any app — voice to text with the speech engine running locally on your machine, so your audio never leaves it.
Free forever — 500 words/day, no credit card required.
Apple Silicon · signed & notarized by Apple — opens in one click. Install guide →
Enter the showroom — a walkable 3D gallery →Already powering real dictation
How It Works
The whole product is one loop, about three seconds long. Here it is, step by step.

The floating pill — this is the actual animation from the app: orb while you speak, amber while it transcribes, green when your words have landed.
Ctrl+Shift+Space from any app — or rebind it. Tap to toggle, or hold to push-to-talk. The app sits in your tray; you never alt-tab to it.
Whisper or Parakeet transcribes on your machine — your audio never leaves it. With a GPU (Vulkan: NVIDIA, AMD, Intel) it's back in under a second.
The text pastes itself into whatever was focused — editor, email, terminal, chat. Want it cleaned up first? Optional AI styles rewrite it on the way through.
▍
500 words/day free forever. No credit card required.
The Math
I type about 40 words a minute on a good day. I talk at 150. That gap is the whole reason I built this.
Average typing speed
Average speaking speed
Voice vs. keyboard
If you type 3 hours daily
A 1,000-word email takes 25 minutes to type but only ~7 minutes to dictate.
Core Features
Everything here ships in the free download. The speech engine runs on your machine; the optional AI cleanup is the only cloud piece.
Press Ctrl+Shift+Space, talk, and the words land at your cursor — VS Code, Slack, Gmail, a terminal, anything with a text field.
Whisper and Parakeet run on your machine — your audio never leaves it, and transcription works with no internet at all.
Vulkan means any GPU — NVIDIA, AMD, Intel — on Windows and Linux. Sub-second results; falls back to CPU if there's no GPU.
A tiny always-on-top pill shows what's happening — orb while you speak, timer, green flash when the text lands. Drag it anywhere, or hide it.
Optional cloud cleanup: speak rough, land polished. 7 built-in styles plus your own prompts — via your API keys or HyperVoice Cloud on Pro.
Every dictation is searchable locally, your stats show how much typing you've skipped, and a custom dictionary handles jargon and names.
Works Everywhere
If you can type in it, you can talk into it — I dictate into Claude Code, Slack, and email all day. It's also a genuine relief if RSI or hand strain makes typing the painful part.
ChatGPT, Claude, Gemini
Slack, Teams, Discord
Cursor, VS Code, Claude Code
Email, browser, docs, any app
System-level text injection — HyperVoice works with every application on your computer.
By default you get the raw transcript. Flip on an AI style and a model rewrites it before it lands — ums removed, email formatted, ticket structured. This step runs in the cloud: your own OpenAI or Anthropic key, or HyperVoice Cloud on Pro.
Remove filler words, fix grammar, and tighten your speech into clean written text.
Speak your thoughts casually, get a polished professional email ready to send.
Speak naturally, get a casual but professional message ready for Teams or Slack.
Ramble about a meeting for 2 minutes — get structured notes with action items.
Convert a stream of consciousness into a concise, organized bullet-point list.
Describe your day's work and get a formatted status update with progress and next steps.
Describe a bug or task verbally and get a structured ticket ready for Jira or GitHub.
Plus unlimited custom styles — write your own system prompt once and it becomes a one-tap mode.
Your audio never leaves your machine.
Why HyperVoice
Most dictation tools are macOS-only, cloud-only, or subscription-only. I made the opposite calls on all three.
One of the very few dictation apps with GPU acceleration on both Windows and Linux. Most alternatives run CPU-only or are macOS-only.
Transcribe speech in 99 languages with state-of-the-art accuracy. Multiple model sizes to match your needs.
Buy once and own it forever, while most dictation tools are subscription-only. Prefer to subscribe? Pro starts at $6.67/mo — or bring your own API keys for zero-markup AI cleanup.
Tiny memory footprint, instant startup. No bloated runtimes. Debug panel shows all pipeline timings.
Custom hotkeys, push-to-talk or toggle mode, 11 AI models, GPU or CPU mode, dark/light theme, auto-paste toggle. Make it work the way you want.
Audio and transcription stay on-device; history is stored locally. We log only technical metadata (model, latency, errors) to keep the service healthy — never your audio or transcribed text. Cloud processing is entirely opt-in.
How We Compare
An honest look — the alternatives win some rows too. Full breakdowns on the comparison pages.
| HyperVoice | Wispr Flow | Superwhisper | Dragon | |
|---|---|---|---|---|
| Where it runs | On your device | Cloud servers | On your device | On your device |
| Platforms | Windows, Linux, macOS | Windows, Mac, mobile | macOS only | Windows only |
| GPU acceleration | Vulkan — any GPU | Cloud-side | Metal — Apple only | CPU only |
| Pay once (lifetime) | Yes — $49.99 | No — subscription | One-time option | Yes — $200–700 |
| Voice training | Not needed | Not needed | Not needed | Required |
| Free tier | 500 words/day | None | Limited | Trial only |
Competitor details from our comparison pages, current as of 2026. Products change — see each vendor for the latest.
Pricing
500 words a day, free forever — enough to know within a week whether this is for you. Upgrade when the cap pinches or you want the built-in AI cleanup.
I kept a true lifetime license: $49.99 once, yours forever. Most dictation tools won't sell you that.
500 words/day. No credit card required.
No credit card. No strings.
Unlimited dictation + built-in AI cleanup. 7-day free trial.
No commitment. Cancel anytime.
$6.67/mo — save 17%
Unlimited dictation + built-in AI cleanup. 7-day free trial.
No commitment. Cancel anytime.
Unlimited dictation, pay once. BYOK for AI cleanup.
Roadmap
I build HyperVoice in public — much of it live on camera in the “Vibe Coding” series on YouTube. Here's what I'm working toward next.
Cloud-based speech-to-text for higher accuracy on lower-end hardware.
Linux shipped (it's in beta now) — Apple platforms are next on the list.
Dictation already works in 99 languages — next up: speak in one language, paste in another.
Record and transcribe meetings with speaker diarization.
Drag and drop audio/video files for batch transcription.
FAQ
Download it, press Ctrl+Shift+Space, and say something. That's the whole onboarding.
Get HyperVoice — Free →