frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Chirp – Local Windows dictation with ParakeetV3 no executable required

https://github.com/Whamp/chirp
6•whamp•1h ago
I’ve been working in fairly locked‑down Windows environments where I’m allowed to run Python, but not install or launch new `.exe` files. In addition the built-in windows dictations are blocked (the only good one isn't local anyway). At the same time, I really wanted accurate, fast dictation without sending audio to a cloud service, and without needing a GPU. Most speech‑to‑text setups I tried either required special launchers, GPU access, or were awkward to run day‑to‑day.

To scratch that itch, I built Chirp, a Windows dictation app that runs fully locally, uses NVIDIA’s ParakeetV3 model, and is managed end‑to‑end with `uv`. If you can run Python on your machine, you should be able to run Chirp—no additional executables required.

Under the hood, Chirp uses the Parakeet TDT 0.6B v3 ONNX bundle. ParakeetV3 has accuracy in the same ballpark as Whisper‑large‑v3 (multilingual WER ~4.9 vs ~5.0 in the open ASR leaderboard), but it’s much faster and happy on CPU.

The flow is: - One‑time setup that downloads and prepares the ONNX model: - `uv run python -m chirp.setup` - A long‑running CLI process: - `uv run python -m chirp.main` - A global hotkey that starts/stops recording and injects text into the active window.

A few details that might be interesting technically:

- Local‑only STT: Everything runs on your machine using ONNX Runtime; by default it uses CPU providers, with optional GPU providers if your environment allows.

- Config‑driven behavior: A `config.toml` file controls the global hotkey, model choice, quantization (`int8` option), language, ONNX providers, and threading. There’s also a simple `[word_overrides]` map so you can fix tokens that the model consistently mishears.

- Post‑processing pipeline: After recognition, there’s an optional “style guide” step where you can specify prompts like “sentence case” or “prepend: >>” for the final text.

- No clipboard gymnastics required on Windows: The app types directly into the focused window; there are options for clipboard‑based pasting and cleanup behavior for platforms where that makes more sense.

- Audio feedback: Start/stop sounds (configurable) let you know when the mic is actually recording.

So far I’ve mainly tested this on my own Windows machines with English dictation and CPU‑only setups. There are probably plenty of rough edges (different keyboard layouts, language settings, corporate IT policies, etc.), and I’d love feedback from people who:

- Work in restricted corporate environments and need local dictation. - Have experience with Parakeet/Whisper or ONNX Runtime and see obvious ways to improve performance or robustness. - Want specific features (e.g., better multi‑language support, more advanced post‑processing, or integrations with their editor/IDE).

Repo is here: `https://github.com/Whamp/chirp`

If you try it, I’d be very interested in:

- CPU usage and latency on your hardware, - How well it behaves with your keyboard layout and applications, - Any weird failure cases or usability annoyances you run into.

Happy to answer questions and dig into technical details in the comments.

Comments

whamp•1h ago
btw this is my first open-source project
lxe•26m ago
I've done something similar for Linux and Mac. I originally used Whisper and then switched to Parakeet. I much prefer whisper after playing with both. Maybe I'm not configuring Parakeet correctly, But the transcription that comes out of Whisper is usually pretty much spot on. It automatically removes all the "ooms" and all the "ahs" and it's just way more natural, in my opinion. I'm using Whisper.CPP with CUDA acceleration. This whole comment is just written with me dictating to a whisper, and it's probably going to automatically add quotes correctly, there's going to be no ums, there's going to be no ahs, and everything's just going to be great.

What Is a Production Process? – By Brian Potter

https://www.construction-physics.com/p/what-is-a-production-process
1•rbanffy•1m ago•0 comments

My way to remove "Linuxisms" from FreeBSD

https://eugene-andrienko.com/it/2025/03/30/freebsd-pkgs-wout-unwanted-deps.html
2•akagusu•2m ago•0 comments

Epstein Emails Reference Bitcoin Meeting with Brock Pierce at Manhattan Mansion

https://decrypt.co/348586/epstein-emails-reference-bitcoin-meeting-brock-pierce-manhattan-mansion
1•clanky•2m ago•0 comments

Can You Talk to the Dead Using AI?

https://singularityhub.com/2025/11/13/can-you-really-talk-to-the-dead-using-ai-we-tried-out-death...
1•WaitWaitWha•2m ago•0 comments

All of My Employees Are AI Agents, and So Are My Executives

https://www.wired.com/story/all-my-employees-are-ai-agents-so-are-my-executives/
1•MattGrommes•8m ago•0 comments

Management Lessons Learned from Playing Factorio

https://weitz.blog/p/factorio-management-lessons
1•konne88•12m ago•0 comments

Houston, We Have a Problem: Anthropic Rides an Artificial Wave – BIML

https://berryvilleiml.com/2025/11/14/houston-we-have-a-problem-anthropic-rides-an-artificial-wave/
1•cratermoon•12m ago•0 comments

National security swallowed the Earth observation industry

https://www.fastcompany.com/91420453/maxar-vantor-ukraine-russia-intelligence-satellite
2•johnshades•14m ago•0 comments

Quantum computing: too much to handle

https://scottaaronson.blog/?p=9325
1•fuglede_•14m ago•0 comments

Ancient Hebrew Slaves Didn't Build the Giza Pyramids

https://www.juancole.com/2025/11/ancient-hebrew-pyramids.html
2•speckx•16m ago•1 comments

Pope Leo calls for academic excellence in the church

https://www.usccb.org/resources/pope-leo-calls-academic-excellence-church
2•lordleft•17m ago•0 comments

Exploring AI with Groovy

https://groovy.apache.org/blog/groovy-ai
2•andrewstetsenko•18m ago•0 comments

The New 2025 OWASP Top Ten

https://owasp.org/Top10/2025/0x00_2025-Introduction/
2•shehackspurple•22m ago•0 comments

All Praise to the Lunch Ladies

https://bittersoutherner.com/issue-no-12/all-praise-to-the-lunch-ladies
3•gmays•24m ago•0 comments

What's the difference between an artist and a creator?

https://www.ystrickler.com/whats-the-difference-between-an-artist-and-a-creator/
1•NaOH•24m ago•0 comments

Dredger-IoT: Ruby at the Edge – Open-Source Industrial Telemetry

https://dominickm.com/dredger-iot-ruby-at-the-edge-open-source-industrial-telemetry/
1•Kerrick•25m ago•0 comments

Why Your Audiobook Habit Might Be Sabotaging Deep Learning

https://zoia.org/posts/why-your-audiobook-habit-might-be-sabotaging-deep-learning/
1•freediver•26m ago•0 comments

I wont work for Google,Twitter, or Facebook (meta) (2011)

https://naildrivin5.com/blog/2011/08/01/why-i-wont-work-for-google-twitter-facebook.html
6•dzonga•27m ago•2 comments

Show HN: Epstein Files Organized and Searchable

https://searchepsteinfiles.com/
4•searchepstein•28m ago•0 comments

Show HN: Ouverture.py – Content-addressed storage for multilingual functions

https://github.com/amirouche/ouverture.py
1•amirouche•29m ago•0 comments

Upwork warned me to stop using browser extensions

https://chromewebstore.google.com/detail/upwork-search-enhancement/pgpkjpoepjjbamidgffmedelnpiiinkk
3•riamuu•30m ago•2 comments

Better pre-commit, re-engineered in Rust

https://prek.j178.dev/
2•nikolay•32m ago•1 comments

The Push for Better Evidence on Microplastics and Health

https://www.medscape.com/viewarticle/push-better-evidence-microplastics-and-health-2025a1000vbd
1•wjb3•34m ago•0 comments

An Italian Company Builds the First Known Propellantless Space-Propulsion System

https://www.satcom.digital/news/genergo-an-italian-company-builds-the-worlds-first-known-propella...
3•maremmano•35m ago•0 comments

"Learning how to learn" via distance running

https://the-nerve-blog.ghost.io/learning-from-running/
1•mprast•36m ago•0 comments

The Orgasm Cure

https://aeon.co/essays/delayed-orgasm-the-sexual-technique-thats-better-than-sex
2•Eridanus2•37m ago•2 comments

Show HN: Four Solutions to Valid Parenthesis (LeetCode #20)

https://medium.com/@mcsimpson/solving-leetcode-0020-valid-parentheses-in-four-different-ways-bf3c...
1•smatthewaf•38m ago•0 comments

Hermes – A self-hosted video downloader for 1000 sites

https://github.com/TechSquidTV/Hermes
3•handystudio•38m ago•1 comments

'GoldenEye' at 30

https://variety.com/2025/film/news/goldeneye-at-thirty-1236581765/
2•birriel•38m ago•0 comments

Is the Gut-Autism Hypothesis a 'Dead End'?

https://www.medscape.com/viewarticle/gut-autism-hypothesis-dead-end-2025a1000vjo
1•wjb3•39m ago•1 comments