frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Leaving Meta and PyTorch

https://soumith.ch/blog/2025-11-06-leaving-meta-and-pytorch.md.html
167•saikatsg•2h ago

Comments

msmd74•1h ago
Sounds like you had a momentous run.

If you take advice from reformed Internet trolls, consider turning off all your devices and trying to give yourself at least a week, but ideally a month offline staring at your new baby. You'll never get that time back and there's nothing your brain will appreciate more than loading up those memories as they grow.

Good luck.

qmatch•1h ago
As a loyal JAX user, I hope they can play catchup. PyTorch has dominated the AI scene since TF1 fumbled the ball at 10th yard line. What Matt Johnson has done turning Autograd into JAX is hopefully going to be worthy of as much praise as what Soumith has received.
n_u•1h ago
> PyTorch has dominated the AI scene since TF1 fumbled the ball at 10th yard line

can you explain why you think TensorFlow fumbled?

zapnuk•1h ago
For me it was about 8 years ago. Back then TF was already bloated but had two weaknesses. Their bet on static compute graphs made writing code verbose and debugging difficult.

The few people I know back then used keras instead. I switched to PyTorch for my next project which was more "batteries included".

intermerda•1h ago
Do you have experience in both JAX and PyTorch? Why do you prefer JAX?
chopete3•1h ago
>>Every major AI company and hardware vendor are on a speed dial. This kind of power is really hard to give up. But curiosity ultimately won out in my head.

A simple feeling has such a power. May he gets an opportunity to create one more powerful tool before retiring.

perfmode•1h ago
Respect.
mxkopy•1h ago
PyTorch is one of those tools that’s so simple and easy to take apart that you feel like you might’ve been able to make it yourself. I can’t imagine how much engineering effort was behind all those moments where I thought to myself, “of course it should work like that, how can it be any other way?”
TechnicolorByte•1h ago
Can anyone recommend a technical overview describing the design decisions PyTorch made that led it to win out?
huevosabio•1h ago
I don't know the full list, but back when it came out, TF felt like a crude set of bindings to the underlying c++/CUDA workhorse. PyTorch felt, in contrast, pythonic. It was much closer in feeling to numpy.
puttycat•1h ago
I think it was mostly the eager evaluation that made it possible to debug every step in the network forward/backward passes. Tensorflow didn't have that at the time which made debugging practically impossible.
GistNoesis•1m ago
The choice of the dynamic computation graph [1] of PyTorch made it easier to debug and implement, leading to higher adoption, even though running speed was initially slower (and therefore training cost higher).

Other decisions follow from this one.

Tensorflow started with static and had to move to dynamic at version 2.0, which broke everything. Fragmentation between tensorflow 1, tensorflow 2, keras, jax.

Pytorch's compilation of this computation graph erased the remaining edge of Tensorflow.

Is the battle over ? From a purely computational point, Pytorch solution is very far from optimal and billions of dollars of electricity and GPUs are burned every year, but major players are happy with circular deals to entrench their positions. So at the pace of current AI code development, probably one or two years before Pytorch is old history.

[1] https://www.geeksforgeeks.org/deep-learning/dynamic-vs-stati...

BoredPositron•1h ago
The last few years must have been incredibly exhausting. Thanks for your work good luck and 73.
vintermann•1h ago
That man has an infective enthusiasm. I remember the DCGAN paper inspired me to try getting the (Lua) Torch code to work, and I tried it on the Oxford flowers dataset early on. It worked surprisingly well, and Soumith Chintala even shared it around in social media, surprised at how well it worked on such a small dataset. Of course back then we didn't really appreciate the problem of mode collapse.

Pytorch and old Lua Torch were a pleasure to work with compared to the contemporary Tensorflow. Lots of S.C's code was copied around liberally, it had its quirks (I remember the DCGAN code had a pretty odd way of doing parameter passing) but it was also really easy to understand and made random people like me feel like we had suddenly stumbled onto something crazy powerful (which we had!). It was wonderfully hackable.

utopiah•8m ago
What I find most interesting with this is that it shows they believe there is nothing unique at Meta related to AI. There is no resource, people and computing power, that they can't get elsewhere for whatever they believe would be more interesting for them.

I mention this because it feels analogous to military research, where people "dream" of how advanced the military is, how forward they are compared to public research... and yet, it seems to be a recurring myth they love to sustain.

So the signal I get here is AI "labs" in BigTech have nothing worth waiting for around the corner, it's just more of the same and boring for people who stick there.

aabhay•2m ago
For anyone that’s curious, the underlying Torch library is also a joy to work with, as are the many other torch bindings. For example, Rust has tch and Burn which both work with libtorch.

PyTorch of course has the benefit of being dynamically debuggable. Can’t forget the first time I break pointed my pytorch model and wrote pytorch calls inside the terminal to inspect the behavior. That’s still something I miss a lot now that I’m working with only “fast” compiled code.

Using Codex CLI with GPT-OSS:120B on an Nvidia DGX Spark via Tailscale

https://til.simonwillison.net/llms/codex-spark-gpt-oss
1•GavinAnderegg•5m ago•0 comments

Show HN: I built an AI DJ bot that understands "play some chill Arctic Monkeys"

https://github.com/Enteee/DJAlgoRhythm
1•Ente•6m ago•0 comments

Bose SoundTouch Support Discontinued

https://www.bose.com/soundtouch-end-of-life
1•mngnt•6m ago•0 comments

Winning at the Edges

https://www.holenventures.com/blog/build-for-the-edges/
1•hholen•8m ago•1 comments

BlazingMQ: Introduction

https://www.andy-pearce.com/blog/posts/2024/Jun/blazingmq-introduction/
1•enz•15m ago•0 comments

AI Emoji

https://emoji.design
1•ZhengRuijie•18m ago•1 comments

Apple begins shipping AI servers from Houston factory

https://www.reuters.com/business/apple-begins-shipping-ai-servers-houston-factory-2025-10-23/
2•mpweiher•18m ago•0 comments

Agentic FinSearch vs. Bloomberg Terminal

https://chromewebstore.google.com/detail/agentic-finsearch/aehnlpneoncdfioafiigiljmbghccami
1•yanglet•23m ago•1 comments

What Are the Hidden Risks of Custom GPTs?New Open-Source Tool

https://www.token.security/blog/what-are-the-hidden-risks-of-custom-gpts-token-security-launches-...
1•sharonsha•26m ago•1 comments

The R47: A new physical RPN calculator released today in 2025

https://www.swissmicros.com/product/model-r47
1•dm319•26m ago•1 comments

How to benchmark performance difference between Windows 11 Home and LTSC

1•vivzkestrel•31m ago•0 comments

Cisco Event Response: Continued Attacks Against Cisco Firewalls

https://sec.cloudapps.cisco.com/security/center/resources/asa_ftd_continued_attacks
1•doener•32m ago•0 comments

Tool to help researchers submit an application in accordance with Art. 40 DSA

https://centerforuserrights.freiheitsrechte.org/en/webtool-zur-beantragung-des-forschungsdatenzug...
1•doener•34m ago•0 comments

Empowering Researchers with Unprecedented Data Access

https://data-access.dsa.ec.europa.eu/home
1•doener•36m ago•0 comments

Solar-powered backpack is helping California's homeless live better

https://www.yankodesign.com/2025/11/05/more-than-a-backpack-the-solar-powered-makeshift-traveler-...
2•harambae•38m ago•0 comments

Wolfram 14.3 is now available on 64-bit Debian 12 (Bookworm) and 13 (Trixie)

https://www.wolfram.com/raspberry-pi/
2•teleforce•45m ago•0 comments

Gemini API – Managed RAG/File Search

https://blog.google/technology/developers/file-search-gemini-api/
2•philschmidxxx•45m ago•0 comments

Algorithm that drones can use to manipulate heavy loads together

https://www.youtube.com/watch?v=tMvNCUuXai8
2•gsf_emergency_4•53m ago•0 comments

Multiple EU airports disrupted in wave of suspected Russian drone sightings

https://www.reuters.com/business/aerospace-defense/russias-suspected-hybrid-war-puts-european-air...
4•lode•53m ago•0 comments

Chrome Keyboard Shortcuts

https://support.google.com/chrome/answer/157179?hl=en&co=GENIE.Platform%3DDesktop
3•joshdavham•55m ago•2 comments

A Love Song to Nicotine

https://psychotechnology.substack.com/p/a-love-song-to-nicotine-630
1•eatitraw•57m ago•0 comments

The Final Chapter for GummySearch

https://gummysearch.com/final-chapter/
1•meysamazad•59m ago•0 comments

Stop Uploading Your PDFs to Random Websites

https://github.com/alam00000/bentopdf
3•bakigul•1h ago•0 comments

Becoming Standard: WebP's Late Victory and AVIF's Quiet Takeover

https://onelayerdeeper.substack.com/p/becoming-standard-webps-late-victory
2•jd__•1h ago•0 comments

Wolves have returned to Denmark, and not everyone is happy about it

https://theconversation.com/wolves-have-returned-to-denmark-and-not-everyone-is-happy-about-it-26...
2•PaulHoule•1h ago•0 comments

Passenger jets nearly collide moments after taking off from LAX

https://www.cbsnews.com/losangeles/news/lax-near-collision-airplane-jets/
5•harambae•1h ago•1 comments

Show HN: I Built a Free AI Sentence Rewriter – Multiple Style Modes, No Ads

https://sentencerewriter.cc
2•john_davis_0122•1h ago•0 comments

Error Codes for Control Flow

https://matklad.github.io/2025/11/06/error-codes-for-control-flow.html
3•ingve•1h ago•0 comments

Lessons from Growing a Piracy Streaming Site

https://prison.josh.mn/lessons
26•zuhayeer•1h ago•1 comments

Once upon a time this blog was powered by make

https://refp.se/articles/a-makefile-driven-blog
4•ingve•1h ago•0 comments