So I decided to see if I could vibe code it with 0 macOS app & Swift experience.
It uses a local binary of whisper.cpp (a fast implementation of OpenAI's Whisper voice-to-text model in C++).
Github: https://github.com/richardwu/openwhisper
I also decided to take this as an opportunity to compare 3 agentic coding harnesses:
Cursor w/ Opus 4.6: - Best one-shot UI by far - Didn't get permissioning correct - Had issues making the "Cancel recording" hotkey being turned on all the time
Claude Code w/ Opus 4.6: - Fewest turns to get main functionality right (recording, hotkeys, permissions) - Was able to get a decent UI with a few more turns
Codex App w/ Codex 5.3 Extra-High: - Worst one-shot UI - None of the functionality worked without multiple subsequent prompts