frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

After 8 years, I rewrote my open-source PyTorch curvature library

https://github.com/noahgolmant/pytorch-hessian-eigenthings
1•noahgolmant•1h ago

Comments

noahgolmant•1h ago
Back in 2018 I published pytorch-hessian-eigenthings, a niche open source package for GPU-accelerated curvature analysis of PyTorch models. Loss landscape curvature metrics like the eigenvalues of the Hessian have been implicated in many generalization properties of neural networks (like flat-minima hypotheses, low-rank Hessian claims, etc.). But the full Hessian costs memory quadratic in the parameter count, which is usually infeasible. This library uses Hessian-vector products + iterative methods (Lanczos, power iteration) to get the eigendecomposition in linear memory instead. I stepped away from the project for years, but it ended up being used by other researchers doing curvature analysis work. I noticed the original implementation had aged so I thought I'd revisit it. I also have more professional engineering experience under my belt to inform the design.

I just shipped a v1.0 rewrite. The new version adds new curvature operators (Generalized Gauss-Newton, empirical Fisher), and new algorithms (Hutchinson + Hutch++ trace estimation, spectral density via Stochastic Lanczos Quadrature). It also has a fused Triton/torch.compile cross-entropy Hessian-vector kernel for foundation-model-scale vocabularies (where standard implementations blow up). More importantly it adds a lot of numerical analysis validating the operators: closed-form correctness on linear/logistic regression where the Hessian is known analytically, and cross-library tests against curvlinops to catch any regressions.

https://github.com/noahgolmant/pytorch-hessian-eigenthings

I'm hoping to use it for some follow-up analysis. For example right now I'm looking at inter-agreement between various optimizer updates (Muon, K-FAC, Natural Gradient Descent) on Pythia checkpoints.

Very open to suggestions or requests from anyone who's been working in this space. I've been out of the field for a while, so pointers to recent work I should be aware of are very welcome.

AionDB: PostgreSQL-compatible SQL, graph, and vector database in Rust

https://aiondb.xyz/
1•ayoubnabil•1m ago•0 comments

PSVL 1.0 – The most comprehensive source-visible license (276 clauses)

https://github.com/BMBOMICH/PSVL
1•BMBOMICH•5m ago•0 comments

AutoScientist: Automating the Science of Model Training

https://www.adaptionlabs.ai/blog/autoscientist
1•sethbannon•7m ago•0 comments

Economic Futures – Anthropic

https://www.anthropic.com/economic-futures
2•gurjeet•8m ago•0 comments

2D map of 26,741M/CV papers from CVPR, NeurIPS, ICML, ICLR (2024–2025)

https://matejgazda.com/posts/paper-map.html
1•matog•9m ago•1 comments

Waiting for Website Changes in the Browser

https://alexwlchan.net/2026/livereload-in-browser/
1•ingve•12m ago•0 comments

Claude subscriptions no longer include Agent SDK and Claude -p usage

https://www.xda-developers.com/anthropics-claude-subscriptions-no-longer-include-agent-sdk-and-cl...
1•gitaarik•17m ago•0 comments

Silicon Valley Is Bracing for a Permanent Underclass

https://www.nytimes.com/2026/04/30/opinion/ai-labor-work-force-silicon-valley.html
2•imartin2k•17m ago•0 comments

Best TTS in 2026: Blind Benchmark

https://techstackups.com/articles/best-tts-2026-blind-benchmark/
1•ritzaco•17m ago•0 comments

Bun's rewrite in Rust was merged

https://github.com/oven-sh/bun/commit/23427dbc12fdcff30c23a96a3d6a66d62fdc091d
5•maxloh•23m ago•3 comments

Show HN: I built an AI travel camera because I was tired of Google Lens

https://apps.apple.com/us/app/spotter-the-travel-companion/id6761238251
1•XanMan•23m ago•0 comments

Show HN: AGEF, an open evidence format for AI agent sessions

https://github.com/radotsvetkov/agef
1•radotsvetkov•24m ago•0 comments

Solar Impulse 2 Crashes into Gulf

https://dronexl.co/2026/05/13/solar-impulse-2-crashes-gulf/
1•pingou•24m ago•0 comments

PHP RFC: Bound-Erased Generic Types

https://wiki.php.net/rfc/bound_erased_generic_types
2•choult•25m ago•0 comments

Rewrite Bun in Rust has been merged

https://github.com/oven-sh/bun/pull/30412
3•Chaoses•26m ago•1 comments

An Experimental PS1 emulator written in Zig

https://github.com/maxpoletaev/nupsx
2•Einenlum•30m ago•0 comments

People Would Rather Have Nuclear Power Plants in Their Area Than AI Data Centers

https://www.forbes.com/sites/maryroeloffs/2026/05/13/people-would-rather-have-nuclear-power-plant...
3•robtherobber•32m ago•3 comments

The gen on the family of 'vi' clones

https://jdebp.uk/FGA/vi-family.html
1•JdeBP•32m ago•1 comments

49,000 Lake Tahoe residents will lose power to data centers

https://www.shacknews.com/article/149126/lake-tahoe-residents-lose-power-data-centers
2•vrganj•34m ago•0 comments

RFV-0001: Request for Vibes

https://github.com/Request-For-Vibes/rfv
1•tomaytotomato•35m ago•0 comments

Dependency free charting library for .NET with over 40 charts, diagrams, etc.

https://github.com/EvotecIT/ChartForgeX
1•themadboy•36m ago•0 comments

Ask HN: I've been using AI for 3 years,I've lost the ability to think for myself

2•snasan•36m ago•5 comments

Gloop – A Self-Modifying AI Agent and TS Library

https://gloop.codes/
1•hypendev•38m ago•0 comments

Benchmarking Quant Backtesting Engines

https://medium.com/@DolphinDB_Inc/benchmarking-quant-backtesting-engines-dolphindb-vs-backtrader-...
1•CrazyTomato•40m ago•0 comments

BBC announces David Attenborough is returning to narrate Blue Planet III

https://themanc.com/trending/bbc-announces-david-attenborough-is-returning-to-narrate-blue-planet...
1•thunderbong•40m ago•0 comments

Dissatisfied: Three-fourths of AI customer service rollouts are a letdown

https://www.theregister.com/ai-ml/2026/05/13/ai-customer-service-bots-get-rolled-back-at-74-of-fi...
2•dijksterhuis•41m ago•0 comments

How to Pick Your Life Partner – Part 1 (2014)

https://waitbutwhy.com/2014/02/pick-life-partner.html
1•downbad_•42m ago•0 comments

Bridge Launches Computer Agent Beta

https://twitter.com/bridge_surf/status/2054600056263623046
1•Johnson8053•42m ago•0 comments

State media control shapes LLM behaviour by influencing training data

https://www.nature.com/articles/d41586-026-01486-9
3•XzetaU8•43m ago•0 comments

Save Your Tears

https://www.youtube.com/watch?v=xOsX5matrbE
1•ninjahawk1•45m ago•1 comments