frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Inside An LLM

1•TimeForAChange•1d ago
I’ve always been amazed by children.

They are sponges.

Give them something to learn and they learn it quickly. Too quickly.

Psychologists call this memory plasticity.

A child can absorb sensory information, hold it together, and make sense of it almost immediately.

Learning doesn’t arrive one piece at a time. It happens in parallel.

Many impressions, held at once, until patterns begin to stand out on their own.

As we grow older, that plasticity fades. We stop absorbing so easily.

We carry more. But we change less.

In 2017, a Google research paper helped ignite the current wave of AI. Its title was simple:

All You Need Is Attention.

The idea was not to hand-build understanding. Not to carefully specify every connection in advance.

Instead: turn experience into tokens, examine their relationships all at once, and let structure emerge.

Up to that point, much of AI had tried to design intelligence explicitly. Representations. Connections. Rules.

It worked. But slowly. At enormous cost.

The new proposal was different. Just throw everything at it. Let the system figure it out.

In other words: teach the system the way a baby learns.

But the environments are not the same.

Children learn by being immersed in the world. Large language models learn by being immersed in the internet.

One of these environments contains playgrounds, stories, and banged knees.

The other contains comment sections. At scale.

And then there is a hard boundary.

At some point, the learning must stop.

The figuring-out is frozen into place— for better or worse— so the system can be used.

An LLM may have learned a great deal. But it has learned only what was present in its training.

This is what developers mean when they say a model is stateless.

It does not progress. It does not accumulate.

It resets.

Each time you use it, you are meeting the same frozen system again.

It may be intelligent. But it cannot learn more than it already knows— except for what you place in the prompt.

And when the session ends, that too disappears.

This has become a quiet frustration for many users.

Because the question isn’t whether these systems are intelligent.

It’s whether intelligence without the ability to change is learning at all.

---

Also on Medium: https://medium.com/@roger_gale/where-mistakes-go-to-learn-51a82a6f1187

If you enjoyed this, I'm writing a series on AI limitations and learning.

Language Modeling, Part 2: Training Dynamics

https://connorjdavis.substack.com/p/language-modeling-part-2-training
1•cjamsonhn•18s ago•0 comments

Global Revolution: Sven Beckert's "Capitalism"

https://www.jdbreport.com/p/sven-beckert-capitalism
1•JameDB•1m ago•0 comments

The Concerning Amount of Malware on the VS Code Marketplace

https://github.com/microsoft/vsmarketplace/blob/main/RemovedPackages.md
1•shadow-ninja•1m ago•0 comments

The JDB Report

https://www.jdbreport.com/
1•JameDB•2m ago•0 comments

Nvidia wants to power robotaxi fleets with chips, software by 2027

https://www.cnbc.com/2026/01/05/nvidia-plans-to-test-a-robotaxi-service-in-2027-in-self-driving-p...
1•gmays•5m ago•0 comments

Apple Watch blood sugar monitoring a step closer as new tech launches

https://9to5mac.com/2026/01/08/apple-watch-blood-sugar-monitoring-a-step-closer-as-new-tech-launc...
1•brandonb•9m ago•0 comments

GPT-4o selected optimal CT protocols more frequently than Radiologists

https://doi.org/10.1148/radiol.252105
1•hdoMRIphysics•11m ago•1 comments

I'm making a game engine based on dynamic signed distance fields (SDFs) [video]

https://www.youtube.com/watch?v=il-TXbn5iMA
3•camilleroux•12m ago•0 comments

Mole: Deep clean and optimize your Mac

https://github.com/tw93/Mole
1•handfuloflight•13m ago•0 comments

Manim Has Been Hacked

https://manim.community/
1•aizk•14m ago•4 comments

Show HN: Various Shape Regularization Algorithms

https://github.com/nickponline/shreg
1•nickponline•14m ago•0 comments

Delve AI Audit Fraud

1•mrteflon•25m ago•0 comments

In 2026, I Resolve to Friction-Maxx

https://www.thecut.com/article/brooding-friction-maxxing-new-years-2026-resolution.html
1•walterbell•26m ago•1 comments

System Design for Production Diffusion LLM Serving with Limited Memory Footprint

https://arxiv.org/abs/2512.17077
1•PaulHoule•26m ago•0 comments

A website that auctions itself daily. Winner gets control of a Codex editor

https://www.thedailyauction.com/
1•nsomani•28m ago•1 comments

Why AI Boosts Creativity for Some Employees but Not Others

https://hbr.org/2026/01/why-ai-boosts-creativity-for-some-employees-but-not-others
3•apparent•31m ago•0 comments

Trapiche – A cloud dos devs brasileiros

https://trapiche.cloud/
1•acfilho•36m ago•0 comments

Surprising Places Where Ada Is Used

https://www.adacore.com/blog/surprising-places-where-ada-is-used
2•GhosT078•37m ago•1 comments

My Electronics Blog Article: Why I Loathe TTL

https://mecrisp-stellaris-folkdoc.sourceforge.io/ttl-7400-series.html
2•oldguy101•40m ago•0 comments

The Godfather – Vogue Movie Review – May 1972

https://archive.vogue.com/article/1972/5/the-godfather
3•georgecmu•44m ago•1 comments

Nvidia CEO Jensen Huang says robots could be 'AI immigrants'

https://www.tomshardware.com/tech-industry/nvidia-ceo-jensen-huang-says-robots-are-ai-immigrants-...
3•panic•46m ago•1 comments

Logistics Is Dying; Or – Dude, Where's My Mail?

https://lagomor.ph/2026/01/logistics-is-dying-or-dude-wheres-my-mail/
3•ChilledTonic•47m ago•0 comments

Angular Version 21 EStore and Shopping Cart Prototype

https://horsecode-e216d.web.app/signup-login
1•lstanikmas•50m ago•1 comments

Asus on integrated graphics in gaming laptops being the norm

https://www.tomsguide.com/computing/i-asked-asus-about-integrated-graphics-on-gaming-laptops-beco...
7•not4uffin•50m ago•1 comments

iOS 26 still struggles to gain traction with iPhone users

https://www.cultofmac.com/news/ios-26-adoption-struggles-with-iphone-users
8•layer8•51m ago•4 comments

The most influential game design articles

https://www.reddit.com/r/gamedesign/s/OPbhQRnWHT
3•azhenley•55m ago•0 comments

Zirgen: Compiler for a Domain-Specific Language

https://github.com/risc0/zirgen
1•0xkato•58m ago•0 comments

Test your MCP Server for spec compliance, security, and agent-friendliness

https://mcpscan.dev/
2•norcalkc•58m ago•0 comments

Avoiding TanStack Form Pitfalls

https://matthuggins.com/blog/posts/avoiding-tanstack-form-pitfalls
1•matthuggins•59m ago•0 comments

Nvidia Brute-Force Bubble: Why 90% of Physics AI Compute Is a Mathematical Waste

https://github.com/isaac-sim/IsaacSim/discussions/394
3•ZuoCen_Liu•1h ago•1 comments