// yap — from the makers of voice mirror

Talk instead of type.
Anywhere on Windows.

Press a hotkey, speak, press again. Yap transcribes on your GPU — locally, privately, free — cleans up the "um"s and grammar with AI, and types the result into whatever app you're in.

🎤 your voice → Whisper / Parakeet (local, on your GPU)
→ AI cleanup (local model, optional) → typed into the focused app

Download for Windows Star on GitHub

● nightly Windows Free & open source · MIT Tauri 2 · Rust · Svelte 5

unsigned nightly build — SmartScreen: “More info” → “Run anyway” · all releases

37 seconds — hotkey, cleanup, done · unmute for sound

// why yap

Refuse the choice.

Free dictation tools transcribe locally but dump out raw, unpunctuated text. Polished apps charge ~$15/month and send your voice to the cloud. Yap does both halves — locally.

	Free / OSS tools	Paid apps (~$15/mo)	Yap
Transcribes locally — voice never uploaded	✅	❌ cloud	✅
AI cleanup — filler, punctuation, grammar	❌ raw text	✅	✅ even fully offline
Price	Free	~$15/mo	Free, forever
Open source	✅	❌	✅ MIT

// the real thing, unedited

An actual capture, not a mockup

Dictating into Notepad with AI cleanup on — filler words in, clean text out. Recorded straight off the screen.

Yap demo — dictating into Notepad, AI cleanup stripping the filler words

// small pill, deep toolbox

More than a hotkey and a mic

gpu

Any GPU. No CUDA.

NVIDIA, AMD, or Intel — Vulkan + DirectML cover them all, with CPU fallback. 14 models downloaded in-app, SHA-256 verified.

cleanup

Offline AI cleanup

One click downloads a small local model that strips filler words, fixes grammar, and resolves "no wait, I meant…" — entirely on your PC. Or bring your own endpoint.

routing

Per-app profiles

Yap detects the app you're dictating into and applies the right cleanup style — terse for Slack, formal for email, code-aware for your IDE.

rewrite

Edit selection by voice

Select text anywhere, hit the edit hotkey, and say what to change — Yap grabs the selection and rewrites it from your spoken instruction.

dictionary

Correction dictionary

Names, jargon, product terms — teach Yap the words the models get wrong and it fixes them on every dictation.

stats

History & streaks

Local-only history with words dictated, time saved vs typing, and a daily streak — clearable, cappable, never uploaded.

$ also: toggle or push-to-talk · live partial transcripts · portable mode · signed auto-updates · translate-to-English

// pick your engine

14 models. One small installer.

From a 55 MB Moonshine to Whisper Large v3 — including multilingual Parakeet, NVIDIA Canary, and CJK-tuned SenseVoice. Downloaded in-app when you want them, unloaded from VRAM when idle.

Parakeet TDT V3Parakeet V2Whisper Large v3Whisper Large v3 TurboDistil-Large v3.5Whisper MediumWhisper SmallCanary 1B v2Canary 180M FlashCohereSenseVoiceGigaAM v3Moonshine BaseBreeze-ASR

Yap settings — audio devices, pill and overlay appearance, model picker

The Yap pill — a minimal floating capsule

the whole UI: a pill, an overlay, and one settings window

// the context mirror family

Yap dictates. Voice Mirror builds.

Yap grew out of Voice Mirror's voice stack and turned dictation into its whole job. When you're ready to build entire apps by voice — with an AI that sees and drives them — that's Voice Mirror.

Meet Voice Mirror

Your voice never leaves your PC.

Free, MIT-licensed, no accounts, no telemetry. Grab the nightly and start talking.

Download for Windows Star on GitHub Join the Discord

Talk instead of type. Anywhere on Windows.