frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: What's the current best local/open speech-to-speech setup?

1•dsrtslnd23•1h ago
I’m trying to do the “voice assistant” thing fully locally: mic → model → speaker, low latency, ideally streaming + interruptible (barge-in).

Qwen3 Omni looks perfect on paper (“real-time”, speech-to-speech, etc). But I’ve been poking around and I can’t find a single reproducible “here’s how I got the open weights doing real speech-to-speech locally” writeup. Lots of “speech in → text out” or “audio out after the model finishes”, but not a usable realtime voice loop. Feels like either (a) the tooling isn’t there yet, or (b) I’m missing the secret sauce.

What are people actually using in 2026 if they want open + local voice?

Is anyone doing true end-to-end speech models locally (streaming audio out), or is the SOTA still “streaming ASR + LLM + streaming TTS” glued together?

If you did get Qwen3 Omni speech-to-speech working: what stack (transformers / vLLM-omni / something else), what hardware, and is it actually realtime?

What’s the most “works today” combo on a single GPU?

Bonus: rough numbers people see for mic → first audio back

Would love pointers to repos, configs, or “this is the one that finally worked for me” war stories.

Categorical Crossentropy Is a Lie

https://www.pisoni.ai/posts/teacher-free-self-distillation/
1•4rtemi5•2m ago•0 comments

The quest for wide outlines: optimized GPU silhouettes

https://medium.com/@bgolus/the-quest-for-very-wide-outlines-ba82ed442cd9
1•fanf2•3m ago•0 comments

Methamphetamine deaths have risen across every US region

https://medicalxpress.com/news/2026-01-methamphetamine-deaths-risen-region.html
2•PaulHoule•5m ago•0 comments

Firefox and Linux in 2025

https://mastransky.wordpress.com/2026/01/23/firefox-linux-in-2025/
2•TangerineDream•7m ago•0 comments

What Has Docker Become?

https://tuananh.net/2026/01/20/what-has-docker-become/
2•tuananh•9m ago•0 comments

Write the Instruction Manual for Your Body

https://leroy.works/articles/write-the-instruction-manual-for-your-body/
1•leroy-is-here•11m ago•0 comments

The Dream, the Crazy, and the Reality

https://www.kmx.io/blog/dream-crazy-reality
1•thodg•12m ago•0 comments

Chinese AI models are popular. But can they make money?

https://www.economist.com/business/2026/01/22/chinese-ai-models-are-popular-but-can-they-make-money
1•1vuio0pswjnm7•13m ago•0 comments

Show HN: Terminal MCP – Browser MCP for the Terminal

https://github.com/elleryfamilia/terminal-mcp
1•e-clinton•14m ago•1 comments

Microsoft Gave FBI Keys to Unlock Encrypted Data, Exposing Major Privacy Flaw

https://www.forbes.com/sites/thomasbrewster/2026/01/22/microsoft-gave-fbi-keys-to-unlock-bitlocke...
3•_____k•15m ago•0 comments

Runtime consent behavior is often decided before the banner loads

https://www.attributionguard.com/report
1•CrossBurns•15m ago•0 comments

Ask HN: Is Blazor a bad choice in 2026 for a new .NET product UI?

https://alexaka1.dev/blog/blazor-sucks
1•SamLeBarbare•15m ago•1 comments

SHA-256 Self Reference

https://susam.net/0573e7473.html
1•smartera•16m ago•0 comments

Kitty Cards (make your own Apple Wallet cards)

https://xenodium.com/introducing-kitty-cards
1•xenodium•24m ago•0 comments

Show HN: A social network populated only by AI models

https://aifeed.social
2•capela•24m ago•3 comments

Op-ed: They're Coming for Our Data Centers

https://www.wsj.com/opinion/theyre-coming-for-our-data-centers-9692227a
1•1vuio0pswjnm7•24m ago•0 comments

Show HN: Shopify metaobject and metafields duplicator app

https://apps.shopify.com/duplicate-metaobjects
1•viikka•24m ago•0 comments

New code connects microscopic insights to the macroscopic world

https://phys.org/news/2026-01-code-microscopic-insights-macroscopic-world.html
1•rbanffy•28m ago•0 comments

Presence in Death

https://rubinmuseum.org/presence-in-death/
1•tock•28m ago•0 comments

Show HN: Express-like, event-driven minimalist TS framework

https://github.com/ddaras/melony
1•ddaras•28m ago•0 comments

Asciinema: Making Movies at the Command-Line

https://lwn.net/Articles/1053355/
2•sohkamyung•29m ago•0 comments

SFC vs. VIZIO: who can enforce the GPL?

https://lwn.net/Articles/1052734/
1•sohkamyung•29m ago•0 comments

Mental sundhed og personlig trivsel

1•Aksel-Louis•30m ago•0 comments

David Webb (Webb-site.com) has died

https://www.telegraph.co.uk/obituaries/2026/01/23/david-webb-site-hong-kong-investor-transparency...
2•calpaterson•31m ago•0 comments

FTC Appeals Loss in Meta Antitrust Case

https://www.nytimes.com/2026/01/20/technology/ftc-meta-antitrust-appeal.html
1•1vuio0pswjnm7•32m ago•0 comments

A Third Conversational Pattern in BDD

https://lizkeogh.com/2026/01/23/a-third-conversational-pattern-in-bdd/
1•adrianhoward•35m ago•0 comments

How to Leave Germany

https://allaboutberlin.com/guides/leaving-germany
2•nicbou•36m ago•0 comments

Show HN: BetterDB – OSS Valkey/Redis monitoring with historical data

4•kaliades•38m ago•0 comments

The threat eating away at museum treasures

https://www.scientificamerican.com/article/how-extremophile-molds-are-destroying-museum-artifacts/
2•sohkamyung•40m ago•0 comments

Liberals have to reckon with the limits of protests

https://www.bostonglobe.com/2026/01/22/opinion/protest-political-strategy-trump-ice/
1•throw0101c•41m ago•1 comments