frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

After 8 years, I rewrote my open-source PyTorch curvature library

https://github.com/noahgolmant/pytorch-hessian-eigenthings
16•noahgolmant•2d ago

Comments

noahgolmant•2d ago
Back in 2018 I published pytorch-hessian-eigenthings, a niche open source package for GPU-accelerated curvature analysis of PyTorch models. Loss landscape curvature metrics like the eigenvalues of the Hessian have been implicated in many generalization properties of neural networks (like flat-minima hypotheses, low-rank Hessian claims, etc.). But the full Hessian costs memory quadratic in the parameter count, which is usually infeasible. This library uses Hessian-vector products + iterative methods (Lanczos, power iteration) to get the eigendecomposition in linear memory instead. I stepped away from the project for years, but it ended up being used by other researchers doing curvature analysis work. I noticed the original implementation had aged so I thought I'd revisit it. I also have more professional engineering experience under my belt to inform the design.

I just shipped a v1.0 rewrite. The new version adds new curvature operators (Generalized Gauss-Newton, empirical Fisher), and new algorithms (Hutchinson + Hutch++ trace estimation, spectral density via Stochastic Lanczos Quadrature). It also has a fused Triton/torch.compile cross-entropy Hessian-vector kernel for foundation-model-scale vocabularies (where standard implementations blow up). More importantly it adds a lot of numerical analysis validating the operators: closed-form correctness on linear/logistic regression where the Hessian is known analytically, and cross-library tests against curvlinops to catch any regressions.

https://github.com/noahgolmant/pytorch-hessian-eigenthings

I'm hoping to use it for some follow-up analysis. For example right now I'm looking at inter-agreement between various optimizer updates (Muon, K-FAC, Natural Gradient Descent) on Pythia checkpoints.

Very open to suggestions or requests from anyone who's been working in this space. I've been out of the field for a while, so pointers to recent work I should be aware of are very welcome.

SANA-WM, a 2.6B open-source world model for 1-minute 720p video

https://nvlabs.github.io/Sana/WM/
122•mjgil•3h ago•53 comments

Accelerate

https://github.com/AccelerateHS/accelerate
25•tosh•2h ago•3 comments

Δ-Mem: Efficient Online Memory for Large Language Models

https://arxiv.org/abs/2605.12357
130•44za12•6h ago•27 comments

Accelerando (2005)

https://www.antipope.org/charlie/blog-static/fiction/accelerando/accelerando.html
119•eamag•4h ago•58 comments

Moving away from Tailwind, and learning to structure my CSS

https://jvns.ca/blog/2026/05/15/moving-away-from-tailwind--and-learning-to-structure-my-css-/
145•mpweiher•6h ago•73 comments

My Favorite Bugs: Invalid Surrogate Pairs

https://george.mand.is/2026/05/my-favorite-bugs-invalid-surrogate-pairs/
32•meysamazad•2h ago•10 comments

How an Australian Teen Team Is Making Radio Astronomy Affordable for Schools

https://mag.openrockets.com/p/how-an-australian-teen-team-is-making-radio-astronomy-affordable-fo...
4•openrockets•32m ago•0 comments

Greek Alphabet Cards

https://labs.randomquark.com/alphabet_cards/
34•ricochet11•3h ago•8 comments

DeepSeek-V4-Flash means LLM steering is interesting again

https://www.seangoedecke.com/steering-vectors/
22•Brajeshwar•46m ago•0 comments

Futhark by Example

https://futhark-lang.org/examples.html
75•tosh•5h ago•20 comments

Project Gutenberg – keeps getting better

https://www.gutenberg.org/
1056•JSeiko•23h ago•228 comments

After 8 years, I rewrote my open-source PyTorch curvature library

https://github.com/noahgolmant/pytorch-hessian-eigenthings
16•noahgolmant•2d ago•1 comments

Kyber (YC W23) Is Hiring a Founding Marketer

https://www.ycombinator.com/companies/kyber/jobs/1rLQAro-founding-marketer-content-community
1•asontha•3h ago

Points are a weird and inconsistent unit of measure

https://buttondown.com/hillelwayne/archive/points-are-a-weird-and-inconsistent-unit-of/
38•danborn26•2d ago•23 comments

Nearly 50 Years Later, WKRP in Cincinnati Becomes a Real Radio Station

https://www.openculture.com/2026/05/nearly-50-years-later-wkrp-in-cincinnati-becomes-a-real-radio...
56•bookofjoe•3d ago•28 comments

Tesla reveals two Robotaxi crashes involving teleoperators

https://techcrunch.com/2026/05/15/tesla-reveals-two-robotaxi-crashes-involving-teleoperators/
9•Brajeshwar•23m ago•1 comments

I believe there are entire companies right now under AI psychosis

https://twitter.com/mitchellh/status/2055380239711457578
1602•reasonableklout•19h ago•826 comments

Ploopy Bean: a trackpoint for every computer

https://ploopy.co/shop/bean-pointing-stick/
142•jibcage•3d ago•61 comments

Gaining control of every projector and camera on campus

https://www.edna.land/blogs/posts/scanning/
77•ednaordinary•2d ago•23 comments

The bird eye was pushed to an evolutionary extreme

https://www.quantamagazine.org/how-the-bird-eye-was-pushed-to-an-evolutionary-extreme-20260513/
173•sohkamyung•2d ago•61 comments

Frontier AI has broken the open CTF format

https://kabir.au/blog/the-ctf-scene-is-dead
246•frays•8h ago•215 comments

Orthrus-Qwen3: up to 7.8×tokens/forward on Qwen3, identical output distribution

https://github.com/chiennv2000/orthrus
160•FranckDernoncou•17h ago•25 comments

Fecal transplants for autism deliver success in clinical trials

https://refractor.io/adhd-autism/fecal-transplants-for-autism-delivers-success-in-clinical-trials/
164•breve•6h ago•117 comments

The Physics–and Physicality–Of Extreme Juggling (2018)

https://www.wired.com/story/the-physicsand-physicalityof-extreme-juggling/
14•ColinWright•3d ago•2 comments

What Were Ancient Greco-Roman Curse Tablets?

https://www.history.com/articles/what-were-ancient-roman-curse-tablets
3•speckx•3d ago•0 comments

Where to buy a non-Apple, non-Google smartphone

https://www.theregister.com/on-prem/2026/05/01/where-to-buy-a-non-apple-non-google-smartphone/521...
145•_____k•7h ago•90 comments

The main thing about P2P meth is that there's so much of it (2021)

https://dynomight.net/p2p-meth/
161•tomjakubowski•16h ago•188 comments

A 0-click exploit chain for the Pixel 10

https://projectzero.google/2026/05/pixel-10-exploit.html
407•happyhardcore•1d ago•221 comments

The sigmoids won't save you

https://www.astralcodexten.com/p/the-sigmoids-wont-save-you
251•Tomte•1d ago•238 comments

Naturally Occurring Quasicrystals

https://johncarlosbaez.wordpress.com/2026/05/14/naturally-occurring-quasicrystals/
118•lukeplato•2d ago•10 comments