Wordbird is an open-source voice dictation app for macOS, powered by Nvidia Parakeet running locally on Apple Silicon via MLX. You can teach it project-specific terms like brand names or frameworks using a project-level WORDBIRD.md file.
Run it via `uvx wordbird`.
Press a hotkey, speak, and your words are transcribed and pasted into whatever app is focused. A small LLM, also running locally, post-processes the transcription to fix errors. If you use apps like VS Code, Zed, Terminal.app or iTerm, Wordbird will know which directory you're in, look up a project-specific WORDBIRD.md file, and if it exists, use it for post-processing the transcription. With the VS Code extension this even works via SSH.
My friend Till Hoffmann wrote this because he was "tired of yet another subscription", and standard dictation just gets too many technical terms wrong. I'm posting this here on his behalf because he's not as active on HN. I've really fallen in love with Wordbird, already made some small PRs, and I hope that many here will find it equally useful.
Really eager to hear feedback (I'll make sure Till monitors this thread as well)!