frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: What makes it so hard to keep LLMs online?

2•realberkeaslan•1h ago
It feels like every few days one of the big AI services is down, degraded, or just slow. I don't mean this as a complaint. I'm just genuinely curious. These are well-funded companies with smart people. What is it about running these models that makes reliability so elusive? Is it just demand nobody predicted, or is there something fundamentally different about serving AI vs. a normal web app?

Comments

zipy124•1h ago
Likely one large contributor is that for a normal service, if it's down it's as simple as re-routing to another service, and there is basically an unlimited amount of CPU servers around the world to spin up on demand. GPU servers are much harder to spin up on demand, as supply is so constrained.

Another factor is just it's a new field and move fast and break things is still the go to as competition is high, and the stakes are incredibly high monetary wise.

A pessimistic, but perhaps true theory is also just vibe-coding/slop is reducing their reliability.

A counter point is that regular services like github seem to go down almost as frequently.

angarrido•34m ago
must people think it’s just GPU cost. In practice it’s coordination: model latency variance + queueing + retries under load. You don’t scale linearly, you get cascading slowdowns.

If your random seed is 42 I set your computer on fire (2025)

https://blog.genesmindsmachines.com/p/if-your-random-seed-is-42-i-will
1•Tomte•1m ago•0 comments

The Three Realities of AI

https://www.axios.com/2026/04/13/ai-elite-vs-ai-skeptic-doomer
1•HiroProtagonist•2m ago•0 comments

Why Investing in Wind and Solar to Avoid Gas Shocks Hasn't Added Up for Some

https://www.nytimes.com/2026/04/10/climate/europe-energy-crisis-iran-war.html
1•mooreds•2m ago•0 comments

Meta creating AI version of Mark Zuckerberg so staff can talk to the boss

https://www.theguardian.com/technology/2026/apr/13/meta-ai-mark-zuckerberg-staff-talk-to-the-boss
1•mitchbob•2m ago•0 comments

Review of Direct Air Capture Systems Powered by Nuclear Energy

https://www.mdpi.com/1996-1073/19/6/1528
1•PaulHoule•7m ago•0 comments

God Tier Party Lentils

https://news.vilf.org/p/god-tier-party-lentils
1•ItsiW•8m ago•0 comments

A Python Interpreter Written in Python

https://aosabook.org/en/500L/a-python-interpreter-written-in-python.html
1•xk3•9m ago•0 comments

Hacking My Kobo with KOReader

https://fundor333.com/post/2026/hacking-my-kobo/
1•fundor333•11m ago•0 comments

Web Haptics on Mobile

https://haptics.lochie.me/
1•coinfused•11m ago•0 comments

NetBSD/MacPPC 9.4 Installation on a QEMU Emulated PowerPC Macintosh

http://www.rabbitfarm.com/cgi-bin/blosxom/2026/04/12#macppc_9-4_qemu
1•jaypatelani•12m ago•0 comments

B-trees and database indexes (2024)

https://planetscale.com/blog/btrees-and-database-indexes
1•tosh•12m ago•0 comments

BookingCom Data Breach: Unauthorized Access to Booking Information

https://twitter.com/CR1337/status/2043740897008070970
1•CR1337•13m ago•0 comments

Kraken Security Update

https://twitter.com/c7five/status/2043720915330969743
1•serial_dev•13m ago•0 comments

When does generative AI qualify for fair use? (2024) By previous OpenAI employee

https://suchir.net/fair_use.html
1•Alifatisk•14m ago•0 comments

AI agent remembers your secrets

https://www.prismor.dev/blog/tool-boundary-redaction-ai-agents
4•noobcoder•14m ago•1 comments

Disputed Boundaries Policy

https://www.naturalearthdata.com/about/disputed-boundaries-policy/
1•Tomte•14m ago•0 comments

Rust Program Management Board

https://github.com/orgs/rust-lang/projects/69/views/2
1•andrewstetsenko•15m ago•0 comments

Apple Ramps Up MacBook Neo Production to 10M Units Amid Strong Demand

https://www.techpowerup.com/348188/apple-ramps-up-macbook-neo-production-to-10-million-units-amid...
3•speckx•16m ago•0 comments

Palantir Stock Continues to Fall. Not Even the President Can Erase the Losses

https://www.barrons.com/articles/palantir-stock-price-president-trump-anthropic-7313031c
6•1vuio0pswjnm7•19m ago•0 comments

Show HN: Access X, Reddit, Threads and all social media data from a single API

https://www.socialcrawl.dev/
2•magneticbrains•20m ago•2 comments

A Picture is Worth a Thousand Tokens

https://repaint.com/blog/picture-is-worth-a-thousand-tokens
2•benshumaker•20m ago•0 comments

Dual national Londoner stranded in Spain by new border rule

https://www.bbc.com/news/articles/c747vj1z0xwo
1•speckx•20m ago•0 comments

Problems Before the Real Problem: The First Lessons of Apollo 13

https://www.flyingbarron.com/2026/04/problems-before-real-problem-first.html
2•flyingbarron•21m ago•0 comments

Apple Reportedly Testing AI Glasses in Several Frame Styles

https://www.cnet.com/news/apple-reportedly-testing-ai-glasses-in-several-frame-styles/
2•CharlesW•23m ago•0 comments

How to Stop Cops from Using Wi-Fi to "See Through the Walls" of Your Home [video]

https://www.youtube.com/watch?v=LngDW3t36nc
1•dp-hackernews•23m ago•0 comments

Show HN: Curation: Share Podcast Recommendation

https://curation-509629088134.us-west1.run.app/
1•arbol•24m ago•0 comments

OpenAI's latest internal memo about beating the competition

https://www.theverge.com/ai-artificial-intelligence/911118/openai-memo-cro-ai-competition-anthropic
1•pretext•26m ago•0 comments

Mount GitHub repositories as a virtual read-only macOS filesystem

https://github.com/indragiek/GHFS
1•latchkey•27m ago•0 comments

I Rode in a Waymo with a Litigator: Here's What I Learned

https://www.law.com/2026/04/13/i-rode-in-a-waymo-with-a-litigator-heres-what-i-learned/
2•1vuio0pswjnm7•29m ago•0 comments

Show HN: Is Claude still thinking? How are you wasting life?

https://claudestillthinking.com
3•Exorust•30m ago•1 comments