It supports both Parakeet and Whisper models, including Parakeet MLX on Apple Silicon, and works across Linux, macOS, and Windows (x86 and ARM).
One feature people seem to enjoy is wake word activation. You can trigger recording just by saying a phrase like:
“Dumbledore”
“Mr Anderson”
“Hey Jarvis”
Then just start speaking and your words are transcribed instantly.
Under the hood the app is written in pure Go with CGO, and the Whisper and Parakeet inference code is custom implemented and optimized for CPU performance, so it runs very fast even without a GPU.
This project was a real labor of love and a deep dive into speech inference and systems programming.