frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Scaling long-running autonomous coding

https://simonwillison.net/2026/Jan/19/scaling-long-running-autonomous-coding/
49•srameshc•4h ago
Related: Scaling long-running autonomous coding - https://news.ycombinator.com/item?id=46624541 - Jan 2026 (187 comments)

Comments

anilgulecha•2h ago
That's a wild idea-a browser from scratch! And ladybird has been moving at snails pace for a long time..

I think a good abstractions design and good test suite will make it break success of future coding projects.

simonw•1h ago
One of the big open questions for me right now concerns how library dependencies are used.

Most of the big ones are things like skia, harfbuzz, wgpu - all totally reasonable IMO.

The two that stand out for me as more notable are html5ever for parsing HTML and taffy for handling CSS grids and flexbox - that's vendored with an explanation of some minor changes here: https://github.com/wilsonzlin/fastrender/blob/19bf1036105d4e...

Taffy a solid library choice, but it's probably the most robust ammunition for anyone who wants to argue that this shouldn't count as a "from scratch" rendering engine.

I don't think it detracts much if at all from FastRender as an example of what an army of coding agents can help a single engineer achieve in a few weeks of work.

sealeck•1h ago
I think the other question is how far away this is from a "working" browser. It isn't impossible to render a meaningful subset of HTML (especially when you use external libraries to handle a lot of this). The real difficulty is doing this (a) quickly, (b) correctly and (c) securely. All of those are very hard problems, and also quite tricky to verify.

I think this kind of approach is interesting, but it's a bit sad that Cursor didn't discuss how they close the feedback loop: testing/verification. As generating code becomes cheaper, I think effort will shift to how we can more cheaply and reliably determine whether an arbitrary piece of code meets a desired specification. For example did they use https://web-platform-tests.org/, fuzz testing (e.g. feed in random webpages and inform the LLM when the fuzzer finds crashes), etc? I would imagine truly scaling long-running autonomous coding would have an emphasis on this.

Of course Cursor may well have done this, but it wasn't super deeply discussed in their blog post.

I really enjoy reading your blog and it would be super cool to see you look at approaches people have to ensuring that LLM-produced code is reliable/correct.

simonw•1h ago
Yeah, I'm hoping they publish a lot more about this project! It deserves way more then the few sentences they've shared about it so far.
janoelze•1h ago
Any views on the nature of "maintainability" shifting now? If a fleet of agents demonstrated the ability to bootstrap a project like that, would that be enough indication to you that orchestration would be able to carry the code base forward? I've seen fully llm'd codebases hit a certain critical weight where agents struggled to maintain coherent feature development, keeping patterns aligned, as well as spiralling into quick fixes.
simonw•1h ago
Almost no idea at all. Coding agents are messing with all 25+ years of my existing intuitions about what features cost to build and maintain.

Features that I'd normally never have considered building because they weren't worth the added time and complexity are now just a few well-structured prompts away.

But how much will it cost to maintain those features in the future? So far the answer appears to be a whole lot less than I would previously budget for, but I don't have any code more than a few months old that was built ~100% by coding agents, so it's way too early to judge how maintenance is going to work over a longer time period.

brianjeong•38m ago
I think there's a somewhat valid perspective that the Nth+1 model can simply clean up the previous models mess.

Essentially a bet that the rate of model improvement is going to be faster than the rate of decay from bad coding.

Now this hurts me personally to see as someone who actually enjoys having quality code but I don't see why it doesn't have a decent chance of holding

tinyhouse•1h ago
Well, software is measured over time. The devil is always in the details.
vivzkestrel•1h ago
I am waiting for that guy or a team that uses LLMs to write the most optimal version of Windows in existence, something that even surpasses what Microsoft has done over the years and honestly looking at the current state of Windows 11, it really feels like it shouldn't even be that hard to make something more user friendly
kimixa•31m ago
Considering Microsoft's significant (and vocal) investment in LLMs, I fear the current state of Windows 11 is related to a team trying to do exactly that.
halfcat•45m ago
So AI makes it cheaper to remix anything already-seen, or anything with a stable pattern, if you’re willing to throw enough resources at it.

AI makes it cheap (eventually almost free) to traverse the already-discovered and reach the edge of uncharted territory. If we think of a sphere, where we start at the center, and the surface is the edge of uncharted territory, then AI lets you move instantly to the surface.

If anything solved becomes cheap to re-instantiate, does R&D reach a point where it can’t ever pay off? Why would one pay for the long-researched thing when they can get it for free tomorrow? There will be some value in having it today, just like having knowledge about a stock today is more valuable than the same knowledge learned tomorrow. But does value itself go away for anything digital, and only remain for anything non-copyable?

The volume of a sphere grows faster than the surface area. But if traversing the interior is instant and frictionless, what does that imply?

F-16 Falcon Strike, modern combat flight SIM for Atari XL/XE

https://webchrono.pl/F16FalconStrike/index.html
50•starkparker•1h ago•3 comments

Level S4 solar radiation event

https://www.swpc.noaa.gov/news/g4-severe-geomagnetic-storm-levels-reached-19-jan-2026
312•WorldPeas•8h ago•111 comments

Nova Launcher Added Facebook and Google Ads Tracking

https://lemdro.id/post/lemdro.id/35049920
117•celsoazevedo•3h ago•45 comments

Nearly a third of social media research has undisclosed ties to industry

https://www.science.org/content/article/nearly-third-social-media-research-has-undisclosed-ties-i...
261•bikenaga•10h ago•110 comments

Porsche sold more electrified cars in Europe in 2025 than pure gas-powered cars

https://newsroom.porsche.com/en/2026/company/porsche-deliveries-2025-41516.html
195•m463•3h ago•196 comments

Reticulum, a secure and anonymous mesh networking stack

https://github.com/markqvist/Reticulum
87•brogu•4h ago•15 comments

Understanding ZFS Scrubs and Data Integrity

https://klarasystems.com/articles/understanding-zfs-scrubs-and-data-integrity/
17•zdw•5d ago•0 comments

Nanolang: A tiny experimental language designed to be targeted by coding LLMs

https://github.com/jordanhubbard/nanolang
103•Scramblejams•7h ago•71 comments

What came first: the CNAME or the A record?

https://blog.cloudflare.com/cname-a-record-order-dns-standards/
321•linolevan•11h ago•111 comments

Scaling long-running autonomous coding

https://simonwillison.net/2026/Jan/19/scaling-long-running-autonomous-coding/
49•srameshc•4h ago•11 comments

Opening the AWS European Sovereign Cloud

https://aws.amazon.com/blogs/aws/opening-the-aws-european-sovereign-cloud/
24•notmine1337•3d ago•31 comments

The coming industrialisation of exploit generation with LLMs

https://sean.heelan.io/2026/01/18/on-the-coming-industrialisation-of-exploit-generation-with-llms/
99•long•20h ago•65 comments

Legal Structures for Latin American Startups (2021)

https://latamlist.com/legal-structures-for-latin-american-startups/
14•walterbell•3h ago•3 comments

The assistant axis: situating and stabilizing the character of LLMs

https://www.anthropic.com/research/assistant-axis
65•mfiguiere•7h ago•12 comments

From Nevada to Kansas by Glider

https://www.weglide.org/flight/978820
124•sammelaugust•4d ago•37 comments

Use Social Media Mindfully

https://danielleheberling.xyz/blog/mindful-social-media/
46•mooreds•7h ago•29 comments

How we made Python's packaging library 3x faster

https://iscinumpy.dev/post/packaging-faster/
47•rbanffy•3d ago•6 comments

British redcoat's lost memoir reveals realities of life as a disabled veteran

https://phys.org/news/2026-01-british-redcoat-lost-memoir-reveals.html
53•wglb•4d ago•43 comments

Show HN: An interactive physics simulator with 1000’s of balls, in your terminal

https://github.com/minimaxir/ballin
42•minimaxir•11h ago•8 comments

Notes on Apple's Nano Texture (2025)

https://jon.bo/posts/nano-texture/
159•dsr12•10h ago•88 comments

Selling SaaS in Japan

https://embedworkflow.com/blog/what-saas-founders-should-know-about-entering-the-japanese-market/
36•ewf•4d ago•22 comments

Sending Data over Offline Finding Networks

https://cc-sw.com/find-my-and-find-hub-network-research/
75•findmysanity•5d ago•8 comments

Go 1.26 Interactive Tour

https://antonz.org/go-1-26/
41•phren0logy•2h ago•4 comments

San Francisco coyote swims to Alcatraz

https://www.sfgate.com/local/article/san-francisco-coyote-alcatraz-21302218.php
169•kaycebasques•1d ago•51 comments

Conditions in the Intel 8087 floating-point chip's microcode

https://www.righto.com/2025/12/8087-microcode-conditions.html
95•diogotozzi•4d ago•34 comments

Harvard legal scholars debate the state of the U.S. constitution (2025)

https://www.harvardmagazine.com/social-sciences/is-the-constitution-broken
34•KnuthIsGod•3h ago•77 comments

CSS Web Components for marketing sites (2024)

https://hawkticehurst.com/2024/11/css-web-components-for-marketing-sites/
107•zigzag312•13h ago•51 comments

Weight Transfer for RL Post-Training in under 2 seconds

https://research.perplexity.ai/articles/weight-transfer-for-rl-post-training-in-under-2-seconds
30•jxmorris12•8h ago•1 comments

Graphics In Flatland – 2D ray tracing [video]

https://www.youtube.com/watch?v=WYTOykSqf2Y
71•evakhoury•3d ago•12 comments

Show HN: Pipenet – A Modern Alternative to Localtunnel

https://pipenet.dev/
89•punkpeye•12h ago•16 comments