frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: What's the current best local/open speech-to-speech setup?

4•dsrtslnd23•5h ago
I’m trying to do the “voice assistant” thing fully locally: mic → model → speaker, low latency, ideally streaming + interruptible (barge-in).

Qwen3 Omni looks perfect on paper (“real-time”, speech-to-speech, etc). But I’ve been poking around and I can’t find a single reproducible “here’s how I got the open weights doing real speech-to-speech locally” writeup. Lots of “speech in → text out” or “audio out after the model finishes”, but not a usable realtime voice loop. Feels like either (a) the tooling isn’t there yet, or (b) I’m missing the secret sauce.

What are people actually using in 2026 if they want open + local voice?

Is anyone doing true end-to-end speech models locally (streaming audio out), or is the SOTA still “streaming ASR + LLM + streaming TTS” glued together?

If you did get Qwen3 Omni speech-to-speech working: what stack (transformers / vLLM-omni / something else), what hardware, and is it actually realtime?

What’s the most “works today” combo on a single GPU?

Bonus: rough numbers people see for mic → first audio back

Would love pointers to repos, configs, or “this is the one that finally worked for me” war stories.

AI hallucinate. Do you ever double check the output?

3•jackota•17m ago•2 comments

How do I make $10k (What are you guys doing?)

19•b_mutea•4h ago•21 comments

Ask HN: How do you find the "why" behind old code decisions?

26•siddhibansal9•17h ago•30 comments

Ask HN: What AI feature looked in demos and failed in real usage? Why?

8•kajolshah_bt•4h ago•3 comments

Ask HN: How realistically far are we from AGI?

2•HipstaJules•2h ago•3 comments

Locked out of my GCP account for 3 days, still charged, can't redirect domain

6•lifeoflee•4h ago•1 comments

Ask HN: What's the current best local/open speech-to-speech setup?

4•dsrtslnd23•5h ago•0 comments

Ask HN: What 'AI feature' created negative ROI in production?

5•kajolshah_bt•5h ago•2 comments

Ask HN: Do you have any evidence that agentic coding works?

442•terabytest•3d ago•447 comments

Ask HN: Does DDG no longer honor "site:" prefix?

18•everybodyknows•14h ago•6 comments

Why is software still built like billions don't exist in 2026?

8•yerushalayim•5h ago•8 comments

Tell HN: Cursor agent force-pushed despite explicit "ask for permission" rules

6•xinbenlv•10h ago•5 comments

Tell HN: 2 years building a kids audio app as a solo dev – lessons learned

133•oliverjanssen•2d ago•75 comments

Ask HN: Best practice securing secrets on local machines working with agents?

8•xinbenlv•1d ago•11 comments

Ask HN: Why are so many rolling out their own AI/LLM agent sandboxing solution?

30•ATechGuy•2d ago•12 comments

Ask HN: Is Claude Down for You?

26•philip1209•18h ago•19 comments

Ask HN: How do you authorize AI agent actions in production?

5•naolbeyene•1d ago•4 comments

Ask HN: COBOL devs, how are AI coding affecting your work?

168•zkid18•4d ago•183 comments

Ask HN: What is your opinion on non-mainstream mobile OS options (e.g. /e/OS)?

5•sendes•22h ago•3 comments

Ask HN: Have you managed to switch to Bluesky for tech people?

9•fuegoio•17h ago•10 comments

Ask HN: What's the best virtual Linux desktop experience on macOS for devs?

7•darkteflon•17h ago•4 comments

Tell HN: Drowning in information but still missing everything

10•akhil08agrawal•1d ago•8 comments

From Sketch to Masterpiece: Understanding Stable Diffusion Img2Img

2•bozhou•10h ago•0 comments

Ask HN: Revive a mostly dead Discord server

20•movedx•2d ago•28 comments

Ask HN: Modern test automation software (Python/Go/TS)?

7•rajkumar14•20h ago•3 comments

Ask HN: How do you verify cron jobs did what they were supposed to?

6•BlackPearl02•1d ago•9 comments

Ask HN: Does "Zapier for payment automation" exist?

8•PL_Venard•2d ago•13 comments

Tell HN: Claude session limits getting small

23•pragmaticalien8•1d ago•15 comments

Ask HN: Industrial smart glasses with online / offline capabilities?

3•aureliusm•1d ago•0 comments

Ask HN: Anyone doing production image editing with image models? How?

4•geooff_•23h ago•0 comments