frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

2025: The Year in LLMs

https://simonwillison.net/2025/Dec/31/the-year-in-llms/
67•simonw•2h ago

Comments

AndyNemmity•1h ago
These are excellent every year, thank you for all the wonderful work you do.
waldrews•54m ago
Remember, back in the day, when a year of progress was like, oh, they voted to add some syntactic sugar to Java...
throwup238•49m ago
> they voted to add some syntactic sugar to Java...

I remember when we just wanted to rewrite everything in Rust.

Those were the simpler times, when crypto bros seemed like the worst venture capitalism could conjure.

OGEnthusiast•30m ago
Crypto bros in hindsight were so much less dangerous than AI bros. At least they weren't trying to construct data centers in rural America or prop up artificial stocks like $NVDA.
sanreau•46m ago
> Vendor-independent options include GitHub Copilot CLI, Amp, OpenHands CLI, and Pi

...and the best of them all, OpenCode[1] :)

[1]: https://opencode.ai

simonw•39m ago
Good call, I'll add that. I think I mentally scrambled it with OpenHands.
the_mitsuhiko•33m ago
Thanks for adding pi to it though :)
nineteen999•22m ago
How did I miss this until now! Thank you for sharing.
the_mitsuhiko•44m ago
> The (only?) year of MCP

I like to believe, but MCP is quickly turning into an enterprise thing so I think it will stick around for good.

simonw•40m ago
I think it will stick around, but I don't think it will have another year where it's the hot thing it was back in January through May.
npalli•40m ago
Great summary of the year in LLMs. Is there a predictions (for 2026) blogpost as well?
simonw•37m ago
Given how badly my 2025 predictions aged I'm probably going to sit that one out! https://simonwillison.net/2025/Jan/10/ai-predictions/
skydhash•34m ago
Pretty much a whole year of nothing really. Just coming with a bunch of abstraction and ideas trying to solve an unsolvable problem. Getting reliable results from an unreliable process while assuming the process is reliable.

At least when herding cats, you can be sure that if the cats are hungry, they will try to get where the food is.

MattRix•27m ago
I’m not sure how to tell you how obvious it is you haven’t actually used these tools.
skydhash•20m ago
Why do people assume negative critique is ignorance?
dmd•17m ago
People denied that bicycles could possibly balance even as others happily pedaled by. This is the same thing.
measurablefunc•12m ago
Bicycles don't balance, the human on the bicycle is the one doing the balancing.
moralestapia•10m ago
Yikes.
dmd•8m ago
Yes, that is the analogy I am making. People argued that bicycles (a tool for humans to use) could not possibly work - even as people were successfully using them.
skydhash•5m ago
Please tell me which one of the headings is not about increased usage o LLMs and derived tools and is about some improvement in the axes of reliability or or any kind of usefulness.

Here is the changelog for OpenBSD 7.8:

https://www.openbsd.org/78.html

There's nothing here that says: We make it easier to use it more of it. It's about using it better and fixing underlying problems.

castwide•33m ago
2025: The Year in LLMs

I will never stop treating hallucinations as inventions. I dare you to stop me. i double dog dare y

aussieguy1234•31m ago
> The year of YOLO and the Normalization of Deviance #

On this including AI agents deleting home folders, I was able to run agents in Firejail by isolating vscode (Most of my agents are vscode based ones, like Kilo Code).

I wrote a little guide on how I did it https://softwareengineeringstandard.com/2025/12/15/ai-agents...

Took a bit of tweaking, vscode crashing a bunch of times with not being able to read its config files, but I got there in the end. Now it can only write to my projects folder. All of my projects are backed up in git.

amelius•28m ago
YOLO means something else in neural network parlance.
agentifysh•23m ago
What an amazing progress in just short time. The future is bright! Happy New Year y'all!
sho_hn•18m ago
Not in this review: Also the record year in intelligent systems aiding in and prompting human users into fatal self-harm.

Will 2026 fare better?

simonw•15m ago
I really hope so.

The big labs are (mostly) investing a lot of resources into reducing the chance their models will trigger self-harm and AI psychosis and suchlike. See the GPT-4o retirement (and resulting backlash) for an example of that.

But the number of users is exploding too. If they make things 5x less likely to happen but sign up 10x more people it won't be good on that front.

measurablefunc•14m ago
The people working on this stuff have convinced themselves they're on a religious quest so it's not going to get better: https://x.com/RobertFreundLaw/status/2006111090539687956
andai•14m ago
Also essential self-fulfilment.

But that one doesn't make headlines ;)

sho_hn•13m ago
Sure -- but that's fair game in engineering. I work on cars. If we kill people with safety faults I expect it to make more headlines than all the fun roadtrips.

What I find interesting with chat bots is that they're "web apps" so to speak, but with safety engineering aspects that type of developer is typically not exposed to or familiar with.

simonw•7m ago
One of the tough problems here is privacy. AI labs really don't want to be in the habit of actively monitoring people's conversations with their bots, but they also need to prevent bad situations from arising and getting worse.
websiteapi•16m ago
I'm curious how all of the progress will be seen if it does indeed result in mass unemployment (but not eradication) of professional software engineers.
simonw•12m ago
I nearly added a section about that. I wanted to contrast the thing where many companies are reducing junior engineering hires with the thing where Cloudflare and Shopify are hiring 1,000+ interns. I ran out of time and hadn't figured out a good way to frame it though so I dropped it.

C#-Style Property in C++

https://vorbrodt.blog/2025/12/05/c-style-property-in-c/
1•PaulHoule•4m ago•0 comments

Show HN: A local-first financial auditor using IBM Granite, MCP, and SQLite

https://github.com/simplynd/expense-ai
1•simplynd•4m ago•1 comments

Show HN: Browse your Claude Code history

https://github.com/kamranahmedse/claude-run
1•kamranahmedse•5m ago•0 comments

CodeWeavers CrossOver coupon code for 2026

1•twickline•6m ago•0 comments

<fencedframe>: The Fenced Frame element

https://developer.mozilla.org/en-US/docs/Web/HTML/Reference/Elements/fencedframe
1•jcbhmr•7m ago•0 comments

Most of Iran Shuts Down as Government Grapples with Protests and Economy

https://www.nytimes.com/2025/12/31/world/middleeast/iran-shutdown-protests.html
1•JumpCrisscross•8m ago•0 comments

I'm Trying #100DaysToOffload

https://www.autodidacts.io/100daystooffload/
1•Curiositry•9m ago•0 comments

Oil Tanker Fleeing the Coast Guard Now Listed in Russian Ship Database

https://www.nytimes.com/2025/12/31/us/politics/russia-oil-tanker-venezuela.html
1•JumpCrisscross•9m ago•0 comments

Zara uses AI to dress models virtually rather than book new photo shoots

https://www.cityam.com/zara-turns-to-ai-edited-models-amid-shop-closures/
1•Vaslo•13m ago•0 comments

2025 Year End Report on the Federal Judiciary – Chief Justice John Roberts [pdf]

https://www.supremecourt.gov/publicinfo/year-end/2025year-endreport.pdf
1•everybodyknows•19m ago•0 comments

Saks Prepares for Bankruptcy After Missing Debt Payment

https://www.wsj.com/finance/saks-prepares-for-bankruptcy-after-missing-debt-payment-ff3df6d2
1•JumpCrisscross•21m ago•0 comments

A man taking over the Large Hadron Collider – only to switch it off

https://www.theguardian.com/science/2025/dec/31/large-hadron-collider-head-of-cern-mark-thomson
2•pseudolus•23m ago•0 comments

Nerd: A language for LLMs, not humans

https://www.nerd-lang.org/about
22•gnanagurusrgs•43m ago•40 comments

Future of space exploration depends on better biology

https://www.economist.com/leaders/2025/12/30/the-future-of-space-exploration-depends-on-better-bi...
1•smurda•43m ago•0 comments

Show HN: Open-source AI agent Framework

https://github.com/claude-php/claude-php-agent
1•dalemhurley•49m ago•0 comments

Writing a performant autograd on tenstorrent wormhole p1

https://mewtwo.bearblog.dev/wormhole-autograd-p1/
3•csirak1528•49m ago•1 comments

The Struggle for Sudan

https://www.merip.org/the-struggle-for-sudan/
2•mhb•51m ago•0 comments

We need to reassess our relationship to digital tech

https://disconnect.blog/we-need-to-reassess-our-relationship-to-digital-tech/
2•bovermyer•51m ago•0 comments

California’s billionaire tax, explained

https://sfstandard.com/2025/12/30/california-s-billionaire-tax-explained/
4•donsupreme•52m ago•1 comments

Be aware when opening "take home challenges" from untrusted recruiters

https://www.reddit.com/r/cscareerquestions/s/qIYFSd4lUW
6•satvikpendem•57m ago•0 comments

To the people who've helped me become the way that I am

https://acknowledgements.aadillpickle.com/
1•dependency_2x•57m ago•1 comments

Interview with Steve Wozniak After Jobs' Departure (1985)

https://computeradsfromthepast.substack.com/p/interview-with-steve-wozniak-after
2•rbanffy•58m ago•0 comments

Roadmap to Java

https://nemorize.com/roadmaps/java
2•reverseblade2•1h ago•0 comments

Ray Bradbury S3E5 The Pedestrian (Transcript)

https://subslikescript.com/series/The_Ray_Bradbury_Theater-88591/season-3/episode-5-The_Pedestrian
2•raybadbury•1h ago•1 comments

What to know about latest rupture of the Bearspaw South feeder main

https://calgaryherald.com/news/feeder-main-break-2-0-q
1•petethomas•1h ago•0 comments

Runtime invariant to rule count in a single-pass boundary execution model

https://targetedwebresults.com/pounce-demo-final.gif
1•MKuykendall•1h ago•2 comments

Rembg: Remove Image Backgrounds

https://github.com/danielgatis/rembg
1•Olshansky•1h ago•0 comments

Leaks Predict $5000 RTX 5090 GPUs in 2026 Thanks to AI Industry Demand

https://www.techpowerup.com/344578/leaks-predict-usd-5000-rtx-5090-gpus-in-2026-thanks-to-ai-indu...
2•linksbro•1h ago•0 comments

Meta AI chief Alexandr Wang says will have kids only after Elon Musk's Neuralink

https://www.businessinsider.com/scale-ai-founder-alexandr-wang-meta-neuralink-kids-elon-musk-2025-6
1•radeeyate•1h ago•0 comments

Show HN: Chat with people who share the same Internet connection (= IP address)

https://ipchat.org
3•kkovacs•1h ago•3 comments