frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

LLMs are breaking 20 year old system design

https://zknill.io/posts/llms-are-breaking-20-year-old-system-design/
27•zknill•2h ago

Comments

throwaway27448•1h ago
? Yes if you treat llms like deterministic computation you'll get fucked, news at eleven. In terms of apps "shitty but uncannily useful search" seems like a better fit
mattjoyce•1h ago
Durable is used 13 times in this article.
NitpickLawyer•1h ago
> LLMs just make this problem more visible.

This theme keeps popping up everywhere. Lots of things were "the way we did things" because a lot of reasons. LLMs just amplify some things and they get enhanced visibility. It can be a good thing, if you're able to understand what/why/how changed, or it can be a bad thing if you insist that "this is how we do things, because this is how we've always done things".

endofreach•1h ago
> or it can be a bad thing if you insist that "this is how we do things, because this is how we've always done things"

Or... maybe... just maybe... it can be a bad thing, because it's a bad thing.

NitpickLawyer•1h ago
Many things can be wrong, for many reasons. The problem is when people think LLMs make it wrong, instead of understanding that LLMs just expose the thing for what it was. It's like shooting the messenger just because the messenger is an LLM. That was my point, in case I worded it badly.
cowl•1h ago
or maybe it is a bad thing, because right now the model is "throw it against the wall and see what sticks or how many billions we need to make it stick"?
NitpickLawyer•54m ago
> right now the model is "throw it against the wall and see what sticks

When was it not? We've been doing this for decades. Something usually sticks.

manueltgomes•1h ago
> Long running work: an agent doing a 10 minute task isn’t a ‘request’, it’s a long-running async process.

Correct, but we solved this a long time ago when we started sending files to servers to be converted, for example. We either got a 'job_id' or a call to a webhook when the job was finished."

foo42•57m ago
It feels like the virtual actors are the primitive the author is reaching for. As an erstwhile Elixir hobbyist I've often found myself wishing for the simplicity of actors when solving problems in my day job. I tend to work in an AWS environment, but I believe over in Azure they have something like it. I think it was called Orleans when I read about it but I think it's got a more corporate name now.
cronin101•53m ago
It’s still Orleans! https://learn.microsoft.com/en-us/dotnet/orleans/overview?pi...
pmargam•57m ago
Using Cloudflare's Durable Objects https://developers.cloudflare.com/durable-objects/concepts/w... for this and works pretty well.
ahofmann•54m ago
To me this makes no sense. Nothing in web development changes because of long running requests, there are plenty of solutions for this. The most easy one is to just listen long enough on a http request for the answer. The routing problem can be mitigated with session pinning. Http2 and 3 have solutions for streaming data, websockets can be used, and pub/sub also. Heck, we could push the LLM response in a k2v system/redis and read it from there. "State is in the DB" is running strong and will be for decades to come.
haileys•53m ago
The premise is incorrect and ignorant of the history - this is sticky sessions and the idea has been around longer than 20 years.

The "cloud native" (as the author refers to it) idea that app servers should be stateless is actually the new idea.

The industry eventually reached consensus on sticky sessions being a bad idea a lot of the time. That's why stateless app servers became the norm.

bilbo-b-baggins•51m ago
Claude code runs as a nearly stateless server using session JSONL files as a conversation database, sending stateless API requests to Anthropic, etc.

This post doesn’t seem to understand how these systems work at the core of agent harnesses.

ventana•47m ago
If I'm reading it correctly, the TL;DR of the article is: given the client and the server, we need to be able to ingest messages to the client-server communication channel, and this channel should survive a disconnection. The article suggests using named pub/sub channels for communication, so that the “connection” between a given client and a given (cloud) server had a name and it was possible to ingest data chunks into that named channel.

I would suggest that there is a much, much older technology than pub/sub that can be used for such kind of data transfer: it's UDP, documented in 1980.

I can't stop thinking how overcomplicated our software engineering reality is so we need to reinvent layers and layers of stuff on top of the other stuff. We must make applications for browsers; browsers disallow basic network communication for the code they execute; so sending a chunk of data from a client to a server becomes a real adventure.

lxgr•19m ago
UDP and nothing layered on top?

Then you'll be reimplementing host discovery (i.e. how do clients find the host that has context on their request), retransmissions, flow control, congestion control, and many other things on top of it, and suddenly it doesn't sound so simple anymore.

skywhopper•41m ago
This article is clearly written by someone who’s never done any work on actually complex web applications. Nothing here is a new problem nor unsolved. The pattern identified as being “LLM specific” (long-running async jobs) is not particularly unusual.
grugdev42•19m ago
Article doesn't make sense. Some of the "horizontally scaled" servers have their own state. A local cache, a temporary filesystem etc.

Also, has teh author never heard of long running queued jobs? Or long running scheduled jobs? They ultimately report back into the DB (updating their status etc).

This article reeks of someone using AI to make huge leaping jumps of logic. The "single source of truth" rule has survived this long for a reason. It works!

Claude for Small Business

https://www.anthropic.com/news/claude-for-small-business
208•neilfrndes•4h ago•136 comments

Scorched Earth 2000 – Web

http://www.scorch2000.com/web/
238•meshko•8h ago•95 comments

Linux gaming is faster because Windows APIs are becoming Linux kernel features

https://www.xda-developers.com/linux-gaming-is-getting-faster-because-windows-apis-are-becoming-l...
701•haunter•3d ago•454 comments

Classic 7 is a Windows 10 LTSC mod to look 1:1 to Windows 7

https://classic7.lol/
18•jandeboevrie•1h ago•8 comments

Arena AI Model ELO History

https://mayerwin.github.io/AI-Arena-History/
55•mayerwin•5h ago•38 comments

Setting up a free *.city.state.us locality domain (2025)

https://fredchan.org/blog/locality-domains-guide/
557•speckx•17h ago•173 comments

What the Hell Was Going on with Cigarette Ads in the 70s?

https://tohippo.com/what-the-hell-was-going-on-with-cigarette-ads-in-the-70s/
8•Vasbarlog•1h ago•7 comments

MacBook Neo Deep Dive: Benchmarks, Wafer Economics, and the 8GB Gamble

https://www.jdhodges.com/blog/macbook-neo-benchmarks-analysis/
208•tosh•14h ago•218 comments

A History of IDEs at Google

https://laurent.le-brun.eu/blog/a-history-of-ides-at-google
360•laurentlb•4d ago•241 comments

Extraordinary Ordinals

https://text.marvinborner.de/2026-04-09-17.html
21•marvinborner•2d ago•11 comments

The Emacsification of Software

https://sockpuppet.org/blog/2026/05/12/emacsification/
291•rdslw•1d ago•191 comments

Twin brothers wipe 96 government databases minutes after being fired

https://arstechnica.com/tech-policy/2026/05/drop-database-what-not-to-do-after-losing-an-it-job/
409•jnord•1d ago•313 comments

Show HN: Nibble

https://github.com/glouw/nibble
51•glouwbug•6h ago•5 comments

Cisco workforce reductions

https://blogs.cisco.com/news/our-path-forward
188•ahmedomran8•7h ago•175 comments

delta time

https://www.deltatime.life/
47•mxfh•7h ago•23 comments

Avoiding and reducing microplastic false positives from dry glove contact

https://pubs.rsc.org/en/content/articlelanding/2026/ay/d5ay01801c
40•efavdb•7h ago•1 comments

Chess puzzle I found in my dad's old book

https://ardoedo.it/kempelen/
151•Eswo•2d ago•41 comments

Notes from Optimizing CPU-Bound Go Hot Paths

https://blog.andr2i.com/posts/2026-05-03-notes-from-optimizing-cpu-bound-go-hot-paths
10•nnx•2d ago•1 comments

The Deathbed Notes of Henry James (1968)

https://www.theatlantic.com/past/docs/unbound/flashbks/james/jnote.htm
4•Hooke•1d ago•0 comments

Microsoft BitLocker – YellowKey zero-day exploit

https://www.tomshardware.com/tech-industry/cyber-security/microsoft-bitlocker-protected-drives-ca...
149•cookiengineer•5h ago•79 comments

The US is winning the AI race where it matters most: commercialization

https://avkcode.github.io/blog/us-winning-ai-race.html
196•akrylov•18h ago•535 comments

Xs of Y – roguelike that names itself every run. Written in 4kLoC

https://github.com/nooga/xsofy
183•andsoitis•4d ago•76 comments

Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model

https://github.com/cactus-compute/needle
671•HenryNdubuaku•1d ago•188 comments

Princeton mandates proctoring for in-person exams, upending 133 year precedent

https://www.dailyprincetonian.com/article/2026/05/princeton-news-adpol-proctoring-in-person-exami...
322•bookofjoe•12h ago•478 comments

Heritability of human life span is ~50% when heritability is redefined

https://dynomight.net/lifespan/
97•surprisetalk•1d ago•54 comments

Launch HN: Ardent (YC P26) – Postgres sandboxes in seconds with zero migration

https://www.tryardent.com/
85•vc289•15h ago•34 comments

How can Apple deal with the memory shortage?

https://asymco.com/2026/05/11/the-great-memory-panic-of-2026/
94•tambourine_man•2d ago•94 comments

A Claude Code and Codex Skill for Deliberate Skill Development

https://github.com/DrCatHicks/learning-opportunities
33•cdrnsf•5h ago•10 comments

Golden Testing a CAD Library

https://doscienceto.it/blog/posts/2026-04-27-golden-testing-cad.html
22•PaulHoule•2d ago•5 comments

Reverting the incremental GC in Python 3.14 and 3.15

https://discuss.python.org/t/reverting-the-incremental-gc-in-python-3-14-and-3-15/107014
228•curiousgal•4d ago•92 comments