frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Evolving Verifiable Trust: Bringing Binary Transparency to the Android Ecosystem

https://blog.google/security/bringing-binary-transparency-to-the-android-ecosystem/
1•concinds•2m ago•0 comments

Welcome to Gas City

https://steve-yegge.medium.com/welcome-to-gas-city-57f564bb3607
2•teruakohatu•3m ago•0 comments

New kew v4.0 "Love is gonna save us edition" [video]

https://www.youtube.com/watch?v=Ql5ZKeaX2MQ
1•ravachol•5m ago•1 comments

Pulitzer Prize Winners 2026

https://www.pulitzer.org/prize-winners-by-year/2026
2•brightbeige•8m ago•0 comments

Individual efficiency vs. administrative efficiency (2024)

https://longform.asmartbear.com/tension-autonomy-admin/
1•mooreds•9m ago•0 comments

SAP Acquiring Dremio

https://www.dremio.com/blog/sap-intends-to-acquire-dremio/
1•jamesblonde•10m ago•0 comments

Blepping in Cats

https://archcreekanimalclinic.com/blepping-in-cats/
1•mooreds•10m ago•0 comments

Bring Back the Jedi Knights

https://www.realcleardefense.com/articles/2026/05/04/bring_back_the_jedi_knights_1180442.html
1•tolerance•10m ago•1 comments

Operation Midway Blitz: How immigration raids changed Chicago

https://www.chicagotribune.com/2025/12/28/chicago-immigration-operation-midway-blitz-2/
1•mooreds•11m ago•1 comments

Building AGI Using Language Models (2020)

https://bmk.sh/2020/08/17/Building-AGI-Using-Language-Models/
1•rzk•14m ago•0 comments

Our Story – Meet the Founders of Thaura – Thaura

https://thaura.ai/story
1•abdelhousni•15m ago•0 comments

Omarchy 3.7 Linux Distribution Overhauls Gaming Support, Adds Unified CLI

https://www.phoronix.com/news/Omarchy-3.7-Released
2•breve•16m ago•0 comments

Claude Is Dead

https://www.javiertordable.com/claude-is-dead/
2•KnuthIsGod•16m ago•0 comments

We Can Do Hard Things

https://allenpike.com/2026/we-can-do-hard-things/
1•apike•17m ago•0 comments

Wine 11.8 Improves VBScript Compatibility Fixes Microsoft Golf 1999

https://www.phoronix.com/news/Wine-11.8-Released
2•breve•17m ago•0 comments

Pentagon Delays Put 150 Wind Projects on Ice as Trump Targets Wind Power

https://www.nytimes.com/2026/05/04/climate/wind-power-delays-trump-pentagon.html
4•giwook•18m ago•2 comments

Ask HN: Hypothetical to robotics/autonomy engineers working on FSD

2•Isuckatcode•19m ago•0 comments

I can't follow "don't use HN for promotion" due to my lifestyle – OK or not?

3•JamesEvery•20m ago•3 comments

Show HN: ByAllo – the online bookstore that runs itself

https://byallo.com/
3•averyintel•22m ago•0 comments

Trump's threats are giving Iran a reason for wanting a nuclear weapon

https://asiaviewnews.com/gigabots/threads?p=100058
2•mark336•23m ago•2 comments

Links to CSS Colour Palettes

https://jvns.ca/blog/2026/05/04/css-colour-palettes/
2•chmaynard•23m ago•0 comments

You've heard about the vulnpocalypse. let's talk about the slopdemic

https://cje.io/2026/05/04/thoughts-on-the-slopdemic/
4•caseyjohnellis•23m ago•1 comments

Anthropic's Boris Cherny: Coding is solved what's next

https://www.youtube.com/watch?v=JGubyPD_EU0
3•danebalia•25m ago•0 comments

When Networking Doesn't Work

https://www.os2museum.com/wp/when-networking-doesnt-work/
2•kencausey•27m ago•0 comments

VC and Accelerator Calendar for Early-Stage SaaS Founders

https://raaghavcodes.github.io/vc-fundraising-calendar/
3•raaghavcodes•28m ago•0 comments

EU accused of wasting €20B on AI computing dreams

https://www.politico.eu/article/eu-accused-wasting-20-billion-euro-ai-computing-dreams/
2•momentmaker•28m ago•0 comments

Offload MCP – Offload tasks to free models via API and save tokens

https://github.com/peterhadorn/offload-mcp
2•diioo•28m ago•0 comments

Ask HN: When did you move from AI agentic loops to simpler deterministic system?

4•laxmena•28m ago•0 comments

3D Print Flexible–Rigid Transition Mechanism for Rapid and Reversible Assembly

https://dl.acm.org/doi/10.1145/3772318.3790723
3•gnabgib•29m ago•0 comments

Breed96 – 30 years later the Amiga 500 game is back [video]

https://www.youtube.com/watch?v=E8hlHHGRCj8
2•doener•29m ago•0 comments
Open in hackernews

How OpenAI delivers low-latency voice AI at scale

https://openai.com/index/delivering-low-latency-voice-ai-at-scale/
89•Sean-Der•1h ago

Comments

AIorNot•1h ago
so is the answer

WebRTC + Kubernetes

anzerarkin•1h ago
I hate the voice ai though, it's so much dumber
NikolaNovak•1h ago
Fwiw - I found the advanced AI voice feature to be actually detrimental. It's good if you just want a single sentence answer. I've turned it off though when I want a more detailed, structured, considered answer.
drusepth•1h ago
Interestingly, that kind of parallels the real world too: if you want a quick and high level answer, talk to someone in person; if you want something detailed and info-dense, get them to write it down.
cdrnsf•1h ago
It's missing the part where they explain how they obtained the training data for their voice AI.
thimabi•1h ago
> Voice AI only feels natural if conversation moves at the speed of speech […] At OpenAI’s scale, that translates into three concrete requirements: Global reach for more than 900 million weekly active users

Surely the number refers to the total users of ChatGPT overall, and the fraction of those who use voice features is considerably smaller, is it not?

That’s the kind of thing that influences business decisions like knowing how much hardware and software optimization to throw at a problem.

stuartmemo•1h ago
Yeah, that's why they've used "reach" - the total number of users who could be exposed to the feature regardless of engagement.
Aeroi•1h ago
if anyone is looking to get into this. pipecat is a great open-source repo and community. https://github.com/pipecat-ai/pipecat
BoxedEmpathy•1h ago
I've been looking at this! Great project.
pncnmnp•1h ago
I wish I had known about Pipecat a lot sooner. I found out about it a few weeks back, and since Gemma 4 launched, I've been building my own entirely local voice assistant using Gemma 4 + Kokoro TTS + Whisper from scratch - https://github.com/pncnmnp/strawberry.

Pipecat's smart turn model is really good for VAD - https://huggingface.co/pipecat-ai/smart-turn-v3

AnthOlei•55m ago
What do you have going on the hardware side? I want to plug this into hass but don’t know what hardware I need for reasonable latency
Sean-Der•27m ago
Check out [0]. You can do 'Voice AI' on small/cheap hardware. It's the most fun you can have in the space ATM :) It's been a while, but posted a demo here [1]

[0] https://github.com/pipecat-ai/pipecat-esp32

[1] https://www.youtube.com/watch?v=6f0sUEUuruw

AnthOlei•7m ago
beautiful demo - is it running fully locally or talking to 3rd party API’s? That box was jaw dropping small
doctorpangloss•1h ago
what i learned from making a webrtc+kubernetes game streaming product:

- openai is wrong. almost of the issues they described are issues with libwebrtc, not with webrtc, kubernetes, network architecture, etc. the clue was when they said "the conventional one-port-per-session WebRTC model."

- there are no alternatives worth trying. everything else open source in the ecosystem, like pion, coturn, stunner, are too immature.

- libwebrtc is the only game in town.

- they haven't discovered libwebrtc feature flags or how it works with candidates, which directly fix a bunch of latency issues they are discovering. a correct feature flag can instantly reduce latency for free, compared to pay for twilio network traversal style solutions

- 99% of low latency voice END USERS will be in a network situation that can eliminate relays, transceivers, etc. it is totally first class on kubernetes. but you have to know something :)

this is the first time i'm experiencing gell mann amnesia with openai! look those guys are brilliant, but there is hardly anyone in the world who is doing this stuff correctly.

jiggawatts•34m ago
Something I noticed is that companies that are vibe-coding their products miss out on the intelligence that (still) only humans can bring to bear. Just the knowledge cutoff alone puts AI at a serious disadvantage in any rapidly changing field.
fragmede•7m ago
GPT 5.5's knowledge cutoff is August 2025. Which aspect of WebRTC has meaningfully changed since then?
Sean-Der•26m ago
Did you use libwebrtc on the backend? When you say `libwebrtc` is the only game in town are you talking about clients or servers?

Even for clients you have things like libpeer that libwebrtc can't hit.

flakiness•47m ago
Should I or shouldn't I be glad to see zero mention on Codex.
mock-possum•37m ago
Shouldn’t, I think - advanced voice is a surprisingly slick feature, and if you’re someone who feels that they can think and speak more naturally than when they think and type, AI voice transcription is kind of huge.
gyanchawdhary•22m ago
100% .. as a product designer/developer, i use it heavily for early feature ideation .. i’ll do a loose, exploratory back and forth on a long walk .. then pass the transcript to claude to validate and turn into a spec ..
furyofantares•45m ago
> Global reach for more than 900 million weekly active users

lol, definitely didn't need to know there's 900M weekly users for this post. I mean yeah, there's a lot of users and they serve globally, that's relevant. But this is just pulling out your biggest stat because you can. How many voice users you have would actually be relevant and interesting but, to baselessly speculate on motivation here, might be a number that doesn't add as much fuel to an upcoming IPO as reminded people that you're almost at a billion users does.

didibus•36m ago
I wouldn't mind waiting longer for answers that would go through a better model with more thinking. As long as it has good support for interrupting and also it doesn't start answering as soon as I pause for 1 second and it's smart about knowing I'm done speaking.
charisma123•34m ago
If a transceiver crashes during a stream, how is the active session recovered? Does the system automatically re-establish the context in a new WebRTC session?
Sean-Der•29m ago
It doesn't today, but you could with sometime like this [0]. You can save/suspend all WebRTC state and bring it back with the next process.

[0] https://github.com/pion/webrtc-zero-downtime-restart

legohead•31m ago
The low latency is more of a pain point than a good thing, the way they have it implemented. Trying to have a casual conversation with it, as humans we naturally pause, and GPT will take this as you are "done" and start blabbing away.

I also suffer from finding the appropriate word I want as I've gotten older and slower, and this fast-voice-gpt just ends up frustrating me more than helping. I have to sit there and think out the whole sentence in my head before I say anything -- not very natural.

zamadatix•18m ago
I think these are 2 different layers of "latency". The latency in the article is referring to the transport of the audio stream itself while the latency in your scenario is about how quickly to start responding inside the audio stream.
richardw•18m ago
Hard problem. I find myself adding in filler to stop the thing from jabbering.

I also think it spends most of its iq on sounding good rather than thinking about the problem. “Yeah absolutely I can see why you’d like to…” etc. This is likely because it’s on a timer and maybe voice is more expensive to process? Text responses spend more time on the task.

MagicMoonlight•10m ago
It’s possible to change the amount of time it waits if you’re using the API
Sean-Der•30m ago
Very grateful that OpenAI published the article/publicized their usage of Pion[0] a library I work on. If you aren't familiar with WebRTC it's a super fun space. I work on a book WebRTC for the Curious [1] that details how it works.

[0] https://github.com/pion/webrtc

[1] https://webrtcforthecurious.com

thatxliner•22m ago
slightly unrelated but what’s with storing the entire codebase in the root directory instead of a nested src folder? It makes getting to the README a lot more difficult
nemothekid•10m ago
Thats the default for go projects. Go imports are repository strings (e.g.):

     import ("github.com/go-sql-driver/mysql")
so it's standard to have the library files in the root directory.
a456463•6m ago
This is valid criticism. Go fanbois don't like listening to any go criticism. They were all like who needs templates in go. and now go has templates.

To me go code looks like somebody vomitted stuff in the root dir and i have to wade through that every time. No namespacing. nothing

dtran•6m ago
Thanks for WebRTC for the Curious and for Pion! Not using the latter directly, but have used both to better understand WebRTC
CrzyLngPwd•27m ago
It's bad enough having to speed-read the waffle of its written answers; even when told to be concise, the thought of having to listen to it waffle on in its smarmy, sycohpantic fashion makes me want to reach for the sick bag.
qrush•24m ago
Am I reading this right that OpenAI is not using Livekit for WebRTC/audio anymore?
rvz•23m ago
OpenAI uses Go for the networking implementation for the relays and the services, which makes a ton of sense, instead of something immature as TypeScript / Node or whatever.

Yet another reason to not consider anything else like that for low-latency networking. Golang (or even Rust and C++) is unmatched for this use-case.

nvarsj•13m ago
Can golang do zero copy networking nowadays? In the past golang was terrible at this kind of thing due to allocations and copies of all relayed data.
fragmede•6m ago
And the GC!
jonahs197•17m ago
Who cares? Their company is dying.