frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Whispering – Open-source, local-first dictation you can trust

https://github.com/epicenter-so/epicenter/tree/main/apps/whispering
63•braden-w•2h ago
Hey HN! Braden here, creator of Whispering, an open-source speech-to-text app.

I really like dictation. For years, I relied on transcription tools that were almost good, but they were all closed-source. Even a lot of them that claimed to be “local” or “on-device” were still black boxes that left me wondering where my audio really went.

So I built Whispering. It’s open-source, local-first, and most importantly, transparent with your data. All your data is stored locally on your device. For me, the features were good enough that I left my paid tools behind (I used Superwhisper and Wispr Flow before).

Productivity apps should be open-source and transparent with your data, but they also need to match the UX of paid, closed-software alternatives. I hope Whispering is near that point. I use it for several hours a day, from coding to thinking out loud while carrying pizza boxes back from the office.

Here’s an overview: https://www.youtube.com/watch?v=1jYgBMrfVZs, and here’s how I personally am using it with Claude Code these days: https://www.youtube.com/watch?v=tpix588SeiQ.

There are plenty of transcription apps out there, but I hope Whispering adds some extra competition from the OSS ecosystem (one of my other OSS favorites is Handy https://github.com/cjpais/Handy). Whispering has a few tricks up its sleeve, like a voice-activated mode for hands-free operation (no button holding), and customizable AI transformations with any prompt/model.

Whispering used to be in my personal GH repo, but I recently moved it as part of a larger project called Epicenter (https://github.com/epicenter-so/epicenter), which I should explain a bit...

I’m basically obsessed with local-first open-source software. I think there should be an open-source, local-first version of every app, and I would like them all to work together. The idea of Epicenter is to store your data in a folder of plaintext and SQLite, and build a suite of interoperable, local-first tools on top of this shared memory. Everything is totally transparent, so you can trust it.

Whispering is the first app in this effort. It’s not there yet regarding memory, but it’s getting there. I’ll probably write more about the bigger picture soon, but mainly I just want to make software and let it speak for itself (no pun intended in this case!), so this is my Show HN for now.

I just finished college and was about to move back with my parents and work on this instead of getting a job…and then I somehow got into YC. So my current plan is to cover my living expenses and use the YC funding to support maintainers, our dependencies, and people working on their own open-source local-first projects. More on that soon.

Would love your feedback, ideas, and roasts. If you would like to support the project, star it on GitHub here (https://github.com/epicenter-so/epicenter) and join the Discord here (https://go.epicenter.so/discord). Everything’s MIT licensed, so fork it, break it, ship your own version, copy whatever you want!

Comments

solarkraft•1h ago
Cool! I just started becoming interested in local transcription myself.

If you add Deepgram listen API compatibility, you can do live transcription via either Deepgram (duh) or OWhisper: https://news.ycombinator.com/item?id=44901853

(I haven’t gotten the Deepgram JS SDK working with it yet, currently awaiting a response by the maintainers)

braden-w•1h ago
Thank you for checking it out! Coincidentally, it's on the way:

https://github.com/epicenter-so/epicenter/pull/661

In the middle of a huge release that sets up FFMPEG integration (OWhisper needs very specifically formatted files), but hoping to add this after!

braden-w•1h ago
For those checking out the repo this morning, I'm in the middle of a release that adds Whisper C++ support!

https://github.com/epicenter-so/epicenter/pull/655

After this pushes, we'll have far more extensive local transcription support. Just fixing a few more small things :)

Johnny_Bonk•1h ago
Great work! I've been using Willow Voice but I think I will migrate to this (much cheaper) but they do have a great UI or UX just by hitting a key to start recording and the context goes into whatever text input you want. I haven't installed whispering yet but will do so. P.S
braden-w•44m ago
Amazing, thanks for giving it a try! Let me know how it goes and feel free to message me any time :) happy to add any features that you miss from closed-source altneratives!
newman314•1h ago
Does Whispering support semantic correction? I was unable to find confirmation while doing a quick search.
braden-w•49m ago
Hmm, we support prompts at both 1. the model level (the Whisper supports a "prompt" parameter that sometimes works) and 2. transformations level (inject the transcribed text into a prompt and get the output from an LLM model of your choice). Unsure how else semantic correction can be implemented, but always open expand the feature set greatly over the next few weeks!
joshred•38m ago
They might not now how whisper works. I suspect that the answer to their question is 'yes' and the reason they can't find a straightforward answer through your project is that the answer is so obvious to you that it's hardly worth documenting.

Whisper for transcription tries to transform audio data into LLM output. The transcripts generally have proper casing, punctuation and can usually stick to a specific domain based on the surrounding context.

dumbmrblah•47m ago
I’ve been using whispering for about a year now, it has really changed how I interact with the computer. I make sure to buy mice or keyboards that have programmable hotkeys so that I can use the shortcuts for whispering. I can’t go back to regular typing at this point, just feels super inefficient. Thanks again for all your hard work!
glial•47m ago
This is wonderful, thank you for sharing!

Do you have any sense of whether this type of model would work with children's speech? There are plenty of educational applications that would value a privacy-first locally deployed model. But, my understanding is that Whisper performs pretty poorly with younger speakers.

codybontecou•40m ago
Now we just need text to speech so we can truly interact with our computers hands free.
wkcheng•32m ago
Does this support using the Parakeet model locally? I'm a MacWhisper user and I find that Parakeet is way better and faster than Whisper for on-device transcription. I've been using push-to-transcribe with MacWhisper through Parakeet for a while now and it's quite magical.
satisfice•24m ago
Windows Defender says it is infected.
sa-code•9m ago
This needs to be higher, the installer on the README has a trojan.
barryfandango•7m ago
I'm no expert, but since it acts as a keyboard wedge it's likely to be unpopular with security software.
mrs6969•9m ago
am I not getting it correctly; it says local is possible but can't find any information about how to run it without any api key?

I get the whispers models, and do what? how to run in a device without internet, no documentation about it...

rpdillon•4m ago
The docs are pretty clear that you need to use speaches if you want entirely local operation.

https://speaches.ai/

Anna's Archive: An Update from the Team

https://annas-archive.org/blog/an-update-from-the-team.html
523•jerheinze•3h ago•189 comments

Show HN: We started building an AI dev tool but it turned into a Sims-style game

https://www.youtube.com/watch?v=sRPnX_f2V_c
30•max-raven•44m ago•13 comments

My Retro TVs

https://www.myretrotvs.com/
78•the-mitr•2h ago•15 comments

Show HN: Whispering – Open-source, local-first dictation you can trust

https://github.com/epicenter-so/epicenter/tree/main/apps/whispering
65•braden-w•2h ago•17 comments

How much do electric car batteries degrade?

https://www.sustainabilitybynumbers.com/p/electric-car-battery-degradation
41•xnx•1h ago•35 comments

Show HN: I built an app to block Shorts and Reels

https://scrollguard.app/
350•adrianhacar•2d ago•133 comments

FFmpeg Assembly Language Lessons

https://github.com/FFmpeg/asm-lessons
236•flykespice•5h ago•68 comments

The Cutaway Illustrations of Fred Freeman

https://5wgraphicsblog.com/2016/10/24/the-cutaway-illustrations-of-fred-freeman/
37•Michelangelo11•2d ago•3 comments

TREAD: Token Routing for Efficient Architecture-Agnostic Diffusion Training

https://arxiv.org/abs/2501.04765
26•fzliu•2h ago•3 comments

The Weight of a Cell

https://www.asimov.press/p/cell-weight
57•arbesman•4h ago•20 comments

Launch HN: Reality Defender (YC W22) – API for Deepfake and GenAI Detection

https://www.realitydefender.com/platform/api
45•bpcrd•4h ago•22 comments

Web apps in a single, portable, self-updating, vanilla HTML file

https://hyperclay.com/
541•pil0u•12h ago•192 comments

Who Invented Backpropagation?

https://people.idsia.ch/~juergen/who-invented-backpropagation.html
126•nothrowaways•3h ago•63 comments

Typechecker Zoo

https://sdiehl.github.io/typechecker-zoo/
97•todsacerdoti•3d ago•17 comments

Electromechanical reshaping, an alternative to laser eye surgery

https://medicalxpress.com/news/2025-08-alternative-lasik-lasers.html
191•Gaishan•10h ago•83 comments

Finding a Successor to the FHS

https://lwn.net/SubscriberLink/1032947/67e23ce1a3f9f129/
14•firexcy•12h ago•6 comments

Turning an iPad Pro into the Ultimate Classic Macintosh (2021)

https://blog.gingerbeardman.com/2021/04/17/turning-an-ipad-pro-into-the-ultimate-classic-macintosh/
56•rcarmo•2h ago•7 comments

A gigantic jet caught on camera: A spritacular moment for NASA astronaut

https://science.nasa.gov/science-research/heliophysics/a-gigantic-jet-caught-on-camera-a-spritacular-moment-for-nasa-astronaut-nicole-ayers/
364•acossta•3d ago•87 comments

The lottery ticket hypothesis: why neural networks work

https://nearlyright.com/how-ai-researchers-accidentally-discovered-that-everything-they-thought-about-learning-was-wrong/
9•076ae80a-3c97-4•2h ago•0 comments

Image Fulgurator (2011)

https://juliusvonbismarck.com/bank/index.php/projects/image-fulgurator/2/
33•Liftyee•2d ago•2 comments

Vibe coding tips and tricks

https://github.com/awslabs/mcp/blob/main/VIBE_CODING_TIPS_TRICKS.md
146•mooreds•6h ago•70 comments

T-Mobile claimed selling location data without consent is legal–judges disagree

https://arstechnica.com/tech-policy/2025/08/t-mobile-claimed-selling-location-data-without-consent-is-legal-judges-disagree/
7•Bender•10m ago•0 comments

SystemD Service Hardening

https://roguesecurity.dev/blog/systemd-hardening
218•todsacerdoti•14h ago•80 comments

Countrywide natural experiment links built environment to physical activity

https://www.nature.com/articles/s41586-025-09321-3
29•Anon84•2d ago•17 comments

Sky Calendar

https://abramsplanetarium.org/SkyCalendar/index.html
51•NaOH•3d ago•3 comments

The Lives and Loves of James Baldwin

https://www.newyorker.com/magazine/2025/08/18/baldwin-a-love-story-nicholas-boggs-book-review
79•Caiero•20h ago•11 comments

AWS pricing for Kiro dev tool dubbed 'a wallet-wrecking tragedy'

https://www.theregister.com/2025/08/18/aws_updated_kiro_pricing/
69•rntn•2h ago•47 comments

8x19 Text Mode Font Origins

https://www.os2museum.com/wp/8x19-text-mode-font-origins/
62•userbinator•2d ago•21 comments

Walkie-Textie Wireless Communicator

http://www.technoblogy.com/show?2AON
117•chrisjj•2d ago•79 comments

Class-action suit claims Otter AI records private work conversations

https://www.npr.org/2025/08/15/g-s1-83087/otter-ai-transcription-class-action-lawsuit
129•nsedlet•5h ago•30 comments