frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Moonshine Open-Weights STT models – higher accuracy than WhisperLargev3

https://github.com/moonshine-ai/moonshine
61•petewarden•2h ago
I wanted to share our new speech to text model, and the library to use them effectively. We're a small startup (six people, sub-$100k monthly GPU budget) so I'm proud of the work the team has done to create streaming STT models with lower word-error rates than OpenAI's largest Whisper model. Admittedly Large v3 is a couple of years old, but we're near the top the HF OpenASR leaderboard, even up against Nvidia's Parakeet family. Anyway, I'd love to get feedback on the models and software, and hear about what people might build with it.

Comments

cyanydeez•1h ago
No LICENSE no go
bangaladore•1h ago
There is a license blurb in the readme.

> This code, apart from the source in core/third-party, is licensed under the MIT License, see LICENSE in this repository.

> The English-language models are also released under the MIT License. Models for other languages are released under the Moonshine Community License, which is a non-commercial license.

> The code in core/third-party is licensed according to the terms of the open source projects it originates from, with details in a LICENSE file in each subfolder.

altruios•1h ago
reading through readme.md "License This code, apart from the source in core/third-party, is licensed under the MIT License, see LICENSE in this repository.

The English-language models are also released under the MIT License. Models for other languages are released under the Moonshine Community License, which is a non-commercial license.

The code in core/third-party is licensed according to the terms of the open source projects it originates from, with details in a LICENSE file in each subfolder."

lostmsu•1h ago
How does it compare to Microsoft VibeVoice ASR https://news.ycombinator.com/item?id=46732776 ?
armcat•47m ago
This is awesome, well done guys, I’m gonna try it as my ASR component on the local voice assistant I’ve been building https://github.com/acatovic/ova. The tiny streaming latencies you show look insane
ac29•45m ago
No idea why 'sudo pip install --break-system-packages moonshine-voice' is the recommended way to install on raspi?

The authors do acknowledge this though and give a slightly too complex way to do this with uv in an example project (FYI, you dont need to source anything if you use uv run)

g-mork•41m ago
How does this compare to Parakeet, which runs wonderfully on CPU?
pzo•30m ago
haven't tested yet but I'm wondering how it will behave when talking about many IT jargon and tech acronyms. For those reason I had to mostly run LLM after STT but that was slowing done parakeet inference. Otherwise had problems to detect properly sometimes when talking about e.g. about CoreML, int8, fp16, half float, ARKit, AVFoundation, ONNX etc.
sroussey•24m ago
onnx models for browser possible?
asqueella•20m ago
For those wondering about the language support, currently English, Arabic, Japanese, Korean, Mandarin, Spanish, Ukrainian, Vietnamese are available (most in Base size = 58M params)
Karrot_Kream•15m ago
According to the OpenASR Leaderboard [1], looks like Parakeet V2/V3 and Canary-Qwen (a Qwen finetune) handily beat Moonshine. All 3 models are open, but Parakeet is the smallest of the 3. I use Parakeet V3 with Handy and it works great locally for me.

[1]: https://huggingface.co/spaces/hf-audio/open_asr_leaderboard

I'm helping my dog vibe code games

https://www.calebleak.com/posts/dog-game/
549•cleak•6h ago•162 comments

Mac mini will be made at a new facility in Houston

https://www.apple.com/newsroom/2026/02/apple-accelerates-us-manufacturing-with-mac-mini-production/
280•haunter•2h ago•278 comments

Show HN: Moonshine Open-Weights STT models – higher accuracy than WhisperLargev3

https://github.com/moonshine-ai/moonshine
63•petewarden•2h ago•11 comments

Hacking an old Kindle to display bus arrival times

https://www.mariannefeng.com/portfolio/kindle/
144•mengchengfeng•4h ago•27 comments

Nearby Glasses

https://github.com/yjeanrenaud/yj_nearbyglasses
203•zingerlio•6h ago•85 comments

Cell Service for the Fairly Paranoid

https://www.cape.co/
24•0xWTF•1h ago•14 comments

Show HN: Emdash – Open-source agentic development environment

https://github.com/generalaction/emdash
89•onecommit•6h ago•38 comments

I pitched a roller coaster to Disneyland at age 10 in 1978

https://wordglyph.xyz/one-piece-at-a-time
380•wordglyph•11h ago•148 comments

Hugging Face Skills

https://github.com/huggingface/skills
124•armcat•6h ago•36 comments

Optophone

https://en.wikipedia.org/wiki/Optophone
23•Hooke•4d ago•3 comments

How we rebuilt Next.js with AI in one week

https://blog.cloudflare.com/vinext/
310•ghostwriternr•4h ago•94 comments

Fed's Cook says AI triggering big changes, sees possible unemployment rise

https://www.reuters.com/business/feds-cook-says-ai-triggering-big-changes-sees-possible-short-ter...
26•geox•39m ago•7 comments

Pi – a minimal terminal coding harness

https://pi.dev
101•kristianpaul•2h ago•44 comments

Build Your Own Forth Interpreter

https://codingchallenges.fyi/challenges/challenge-forth/
44•AlexeyBrin•3d ago•12 comments

IRS Tactics Against Meta Open a New Front in the Corporate Tax Fight

https://www.nytimes.com/2026/02/24/business/irs-meta-corporate-taxes.html
174•mitchbob•11h ago•190 comments

OpenAI, the US government and Persona built an identity surveillance machine

https://vmfunc.re/blog/persona/
411•rzk•5h ago•131 comments

We installed a single turnstile to feel secure

https://idiallo.com/blog/installed-single-turnstile-for-security-theater
259•firefoxd•2d ago•116 comments

The history of knocking on wood

https://resobscura.substack.com/p/neolithic-habits-machine-age-tools
7•benbreen•8h ago•0 comments

Steel Bank Common Lisp

https://www.sbcl.org/
134•tosh•5h ago•43 comments

Verge (YC S15) Is Hiring a Director of Computational Biology and AI Scientists/Eng

https://jobs.ashbyhq.com/verge-genomics
1•alicexzhang•7h ago

Mercury 2: The fastest reasoning LLM, powered by diffusion

https://www.inceptionlabs.ai/blog/introducing-mercury-2
6•fittingopposite•1h ago•1 comments

Looks like it is happening

https://www.math.columbia.edu/~woit/wordpress/?p=15500
126•jjgreen•2h ago•85 comments

Dream Recorder AI – a portal to your subconscious

https://dreamrecorder.ai/
9•level87•2h ago•9 comments

Ask HN: Programmable Watches with WiFi?

12•dakiol•3d ago•5 comments

We Are Changing Our Developer Productivity Experiment Design

https://metr.org/blog/2026-02-24-uplift-update/
28•ej88•4h ago•19 comments

Stripe reportedly makes offer to acquire PayPal

https://www.cnbc.com/2026/02/24/paypal-stock-stripe-acquisition-report.html
41•nodesocket•1h ago•25 comments

IDF killed Gaza aid workers at point blank range in 2025 massacre: Report

https://www.dropsitenews.com/p/israeli-soldiers-tel-sultan-gaza-red-crescent-civil-defense-massac...
1143•Qem•11h ago•427 comments

Show HN: Tag Promptless on any GitHub PR/Issue to get updated user-facing docs

26•prithvi2206•6h ago•5 comments

Show HN: Chaos Monkey but for Audio Video Testing (WebRTC and UDP)

https://github.com/MdSadiqMd/AV-Chaos-Monkey
30•MdSadiqMd•1d ago•2 comments

The Missing Semester of Your CS Education – Revised for 2026

https://missing.csail.mit.edu/
376•anishathalye•1d ago•113 comments