frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: What explains the recent surge in LLM coding capabilities?

3•orange_puff•1h ago
It seems like we are in the midst of another AI hype cycle. Many people are calling the current coding models an "inflection point", where now the capabilities are so high that future model growth will be explosive. I have heard serious people, like economics writer Noah Smith, make this argument [0].

But it's not just the commentariat. I have seen very serious people in software engineering and tech talk about the ways in which their coding habits have change drastically.

Benchmarks [1] alone don't seem to capture everything, although there have been jumps in the agentic sections, so maybe they actually do.

My question is; what explains these big jumps in capabilities that many serious people seem to be noticing all at once? Is it simply that we have thrown enough data and compute at the models, or instead, are labs perhaps fine-tuning models to get really good at tool calls, which leads to this new, surprising behavior?

When I explain agents to people, I usually walk them through a manual task one might go through when debugging code. You copy some code into ChatGPT, it asks you for more context, you copy some more code in, it suggests and edit, you edit and run, there is an error, so you paste that in, and so on. An agent is just an LLM in that loop which can use tools to do those things automatically. It would not be shocking to me if we took weaker models like Claude Opus 4.0 and made it 10x better at tool calls, it would be a much stronger and more impressive model. But is that all that is happening, or am I missing something big?

[0] https://substack.com/@noahpinion/p-187818379

[1] https://www.anthropic.com/news/claude-opus-4-6

Comments

coder4rover•47m ago
Quantum computing such that permutations of code to prompt is possible as it tries to answer to some kind of statistical probability solution.

The Re-Anchor Manager – Structured Session Handovers for AI Development

https://seekrates-ai.com/the-re-anchor-manager/
1•mohan-AIyer•58s ago•0 comments

Private Processing for WhatsApp – Technical White Paper and Security Guide

https://ai.meta.com/static-resource/private-processing-technical-whitepaper
2•doodlesdev•3m ago•0 comments

The most important part of an AI system-the human

https://gpt3experiments.substack.com/p/the-most-important-part-of-an-ai
2•nutanc•9m ago•0 comments

The Talents of the Procrastinator

https://www.psychologytoday.com/au/blog/fulfillment-at-any-age/202602/the-hidden-talents-of-the-p...
2•i7l•9m ago•0 comments

How to Solve the Tenor Shortage

https://www.economist.com/leaders/2026/02/12/how-to-solve-the-tenor-shortage
2•petethomas•13m ago•0 comments

What does the formation of a black hole look like? [video]

https://www.youtube.com/watch?v=oRSmMDH11Ss
1•ubercow13•14m ago•0 comments

Show HN: HareBoy – A Game Boy emulator written in Hare

https://github.com/drpaneas/hareboy
1•drpaneas•18m ago•0 comments

Underwear optional? The health pros and cons of going commando

https://www.theguardian.com/wellness/2026/feb/10/underwear-commando-pros-cons
1•andsoitis•21m ago•0 comments

Updated GitHub status page experience

https://github.blog/changelog/2026-02-13-updated-status-experience/
1•donutshop•24m ago•0 comments

The terrifying and efficient world of Olympic ski airlifts

https://www.latimes.com/sports/olympics/story/2026-02-13/inside-terrifying-efficient-world-of-oly...
2•bookofjoe•28m ago•1 comments

Four new astronauts arrive via SpaceX rocket at International Space Station

https://www.theguardian.com/science/2026/feb/14/international-space-station-full-crew
2•andsoitis•29m ago•0 comments

What happens when you put Claude, GPT, Grok, and DeepSeek in the same room?

https://warpmode.io
1•spranab•34m ago•1 comments

Ask HN: Alternatives to the Big 4 for SoC 2 compliance?

1•IsraCV•35m ago•0 comments

The myth of the high-tech heist

https://www.technologyreview.com/2026/02/13/1132397/myth-of-high-tech-heist/
1•gnabgib•36m ago•0 comments

Sonder is a word I like

https://www.autodidacts.io/sonder/
1•Curiositry•36m ago•0 comments

I built a bot to grab Berlinale film festival tickets that sell out in seconds

https://github.com/Rswcf/berlinale-ticket-buyer
1•rswcf•36m ago•1 comments

Narmada Human

https://en.wikipedia.org/wiki/Narmada_Human
1•thunderbong•39m ago•0 comments

Stitching Vision Encoders into LLMs: Clip vs. I-JEPA vs. ViT Comparison

https://teendifferent.substack.com/p/stitching-vision-into-llms-a-comparative
2•teendifferent•41m ago•1 comments

Bulletproof: A Look into Aéza

https://213.si/blog/bulletproof-a-look-into-aeza
1•dev213•47m ago•0 comments

NewPipe: YouTube client without vertical videos and algorithmic feed

https://newpipe.net/
16•nvader•48m ago•4 comments

Galactic Matter and Interstellar Flight [pdf]

http://large.stanford.edu/courses/2013/ph241/micks1/docs/bussard.pdf
2•bediger4000•49m ago•0 comments

Prayerfully journey through Lent on the Exodus 90 App

https://exodus90.com/how-lent-works/
1•nvader•49m ago•0 comments

The Battle of the Beams

https://en.wikipedia.org/wiki/Battle_of_the_Beams
4•jacquesm•51m ago•0 comments

I love the work of the ArchWiki maintainers

https://k7r.eu/i-love-the-work-of-the-archwiki-maintainers/
3•panic•51m ago•0 comments

Cuba's regime is in dire straits

https://www.economist.com/the-americas/2026/01/14/cubas-regime-is-in-dire-straits
3•ViktorRay•56m ago•1 comments

Anthropic's Public Benefit Mission

https://simonwillison.net/2026/Feb/13/anthropic-public-benefit-mission/
5•abdelhousni•59m ago•0 comments

States reliant on Colorado River fail to meet latest deadline to find consensus

https://apnews.com/article/colorado-river-arizona-california-nevada-water-45daf816feba9004c389dc4...
3•bikenaga•1h ago•0 comments

An open-source real-time motor driver for the Lego Orrery

https://gorkem.cc/projects/LegoOrreryMod/
1•gorkyver•1h ago•0 comments

Hardest Problem in Computer Science: Centering Things

https://tonsky.me/blog/centering/
1•signa11•1h ago•1 comments

I Have Nothing but Red Herring to Hide

https://theprivacydad.com/i-have-nothing-but-red-herring-to-hide/
1•theprivacydad•1h ago•0 comments