frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

A Theory of Deep Learning

https://elonlit.com/scrivings/a-theory-of-deep-learning/
52•elonlit•1d ago

Comments

refulgentis•1h ago
This is a beautifully written way of saying “Some parts of what the network memorizes affect test behavior, and some don’t.” But that’s not a theory of deep learning, the grand unified theory would explain that.

We're given a signal channel and a reservoir. Signal lives in the channel, noise lives in the reservoir, and the reservoir supposedly doesn’t show up at test time.

Okay, but then we have: why would SGD put the right things in the right bucket?

If the answer is “because the reservoir is defined as the stuff that doesn’t transfer to test,” then this is close to circular.

The Borges/Lavoisier stuff is a tell. "We have unified the field” rhetoric should come after nontrivial predictions and results. Claiming to solve benign overfitting, double descent, grokking, implicit bias, risk of training on population, how to avoid a validation set, and last but not least, skipping training by analytically jumping to the end is 6 theory papers, 3 NeurIPS winners, and a $10B startup. Let's get some results before we tell everyone we unified the field. :) I hope you're right.

dwrodri•1h ago
Admittedly probably some aggrandized boasting here, but I think empirical verification of that Adam modification alone would be a meaningful contribution, unless that's prior work?
airza•1h ago
A very fascinating read.

As a fellow tufte css enjoyer, Why is user select turned off on the sidenotes? I would like to be able to copy paste them quite badly.

prideout•57m ago
This is a fascinating mathematical framework, but the post title might be a bit of an overreach. I often wonder if "a theory of deep learning" could exist that could be stated succinctly and that could predict (1) scaling laws and (2) the surprising reliability of gradient descent.

Note that I said "predict" not "describe". It feels like we're still in the era of Kepler, not Newton.

jdw64•53m ago
Does anyone happen to know what font this site is using? It looks really elegant.
airza•43m ago
It is a modified version of ET_Book called ET_Bembo:

https://github.com/DavidBarts/ET_Bembo

jdw64•38m ago
I love u. thanks!
DataDaoDe•40m ago
apparently its the font used in Edward Tufte's books. Its on github: https://edwardtufte.github.io/et-book/
smokel•29m ago
This essay seems to be related to the paper "There Will Be a Scientific Theory of Deep Learning" [1] which was discussed here recently [2].

[1] https://arxiv.org/pdf/2604.21691

[2] https://news.ycombinator.com/item?id=47893779

ks2048•29m ago
The relevant paper: "A Theory of Generalization in Deep Learning". https://arxiv.org/abs/2605.01172

Valve releases Steam Controller CAD files under Creative Commons license

https://www.digitalfoundry.net/news/2026/05/valve-releases-steam-controller-cad-files-under-creat...
660•haunter•4h ago•209 comments

Appearing productive in the workplace

https://nooneshappy.com/article/appearing-productive-in-the-workplace/
335•diebillionaires•3h ago•111 comments

From Supabase to Clerk to Better Auth

https://blog.val.town/better-auth
113•stevekrouse•2h ago•48 comments

Ted Turner has died

https://www.cnn.com/2026/05/06/us/ted-turner-death
139•pseudolus•5h ago•101 comments

A Theory of Deep Learning

https://elonlit.com/scrivings/a-theory-of-deep-learning/
55•elonlit•1d ago•11 comments

Learning the Integral of a Diffusion Model

https://sander.ai/2026/05/06/flow-maps.html
22•benanne•1h ago•5 comments

The bottleneck was never the code

https://www.thetypicalset.com/blog/thoughts-on-coding-agents
412•Anon84•2d ago•281 comments

Inkscape 1.4.4

https://inkscape.org/doc/release_notes/1.4.4/Inkscape_1.4.4.html
39•s1291•42m ago•2 comments

Life During Class Wartime

https://www.tbray.org/ongoing/When/202x/2026/05/03/Life-During-Class-Wartime
44•AndrewDucker•3h ago•6 comments

Show HN: I built an open-source email builder, alternative to Beefree/Unlayer

https://play.templatical.com
53•oahmadov•3h ago•16 comments

CARA 2.0 – “I Built a Better Robot Dog”

https://www.aaedmusa.com/projects/cara2
411•hakonjdjohnsen•2d ago•50 comments

What makes a good smartphone camera?

https://cadence.moe/blog/2026-05-05-what-makes-a-good-smartphone-camera
50•zdw•1d ago•33 comments

Setting up a Sun Ray server on OpenIndiana Hipster 2025.10

https://catstret.ch/202605/srss-hipster202510/
109•jandeboevrie•9h ago•32 comments

Google tools for customizing searches

https://cardcatalogforlife.substack.com/p/google-has-a-secret-reference-desk
52•maxutility•15h ago•11 comments

Knitting bullshit

https://katedaviesdesigns.com/2026/04/29/knitting-bullshit/
383•ColinEberhardt•15h ago•160 comments

Colombia hosts talks on exiting fossil fuels as global energy crisis deepens

https://www.latimes.com/environment/story/2026-04-26/colombia-hosts-talks-on-exiting-fossil-fuels...
92•PaulHoule•3h ago•60 comments

Reverse-engineering the 1998 Ultima Online demo server

https://draxinar.github.io/articles/2026-05-01-uodemo-reverse-engineering.html
206•notsentient•13h ago•53 comments

Multi-stroke text effect in CSS

https://yuanchuan.dev/multi-stroke-text-effect-in-css
287•cheeaun•15h ago•39 comments

Going Full Time on Open Source

https://jdx.dev/posts/2026-04-17-going-full-time-on-open-source/
89•thunderbong•2h ago•8 comments

245TB Micron 6600 ION Data Center SSD Now Shipping

https://investors.micron.com/news-releases/news-release-details/industry-leading-245tb-micron-660...
224•neilfrndes•16h ago•160 comments

Coverage Cat (YC S22) Seeks Fractional Engineer to Build AI Growth Toolkit

https://www.coveragecat.com/careers/engineering/fractional-growth-engineer
1•botacode•8h ago

Batteries Not Included, or Required, for These Smart Home Sensors

https://coe.gatech.edu/news/2026/04/batteries-not-included-or-required-these-smart-home-sensors
175•gnabgib•3d ago•67 comments

BYD overtakes Tesla and Kia as the best-selling EV brand in key overseas markets

https://electrek.co/2026/05/05/byd-overtakes-tesla-kia-best-selling-ev-brand-key-overseas-markets/
147•doener•1h ago•198 comments

Proton Meet

https://proton.me/business/blog/introducing-proton-meet
51•Einenlum•1h ago•12 comments

Wolfenstein 3D for Gameboy Color on custom cartridge (2016)

https://www.happydaze.se/wolf/
120•ksymph•2d ago•25 comments

YouTube, your RSS feeds are broken

https://openrss.org/blog/youtube-your-feeds-are-broken
304•veeti•19h ago•102 comments

The Thinking Plant's Man (2025)

https://www.sciencehistory.org/stories/magazine/the-thinking-plants-man/
49•benbreen•2d ago•12 comments

SoundOff: Low-Cost Passive Ultrasound Tags

https://yibo-fu.com/SoundOff-Low-cost-Passive-Ultrasound-Tags-for-Non-invasive-and-Non
10•jonbaer•3h ago•1 comments

Agents can now create Cloudflare accounts, buy domains, and deploy

https://blog.cloudflare.com/agents-stripe-projects/
599•rolph•17h ago•347 comments

StarFighter 16-Inch

https://us.starlabs.systems/pages/starfighter
625•signa11•18h ago•343 comments