frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Why Fei-Fei Li and Yann LeCun Are Both Betting on "World Models"

https://entropytown.com/articles/2025-11-13-world-model-lecun-feifei-li/
62•signa11•1h ago

Comments

techblueberry•1h ago
I don’t know enough about this to be sure, but this feels like a white whale.
andrewflnr•1h ago
Human-level language was a white whale just a few years ago.
krainboltgreene•59m ago
A.L.I.C.E. was published in '95.
CrackerNews•1h ago
I think video and agentic and multimodal models have led to this point, but actually making a world model may provide to be long and difficult.

I feel LeCun is correct that LLMs as of now have limitations where it needs an architectural overhaul. LLMs now have a problem with context rot, and this would hamper with an effective world model if the world disintegrates and becomes incoherent and hallucinated over time.

It'd doubtful whether investors would be in for the long haul, which may explain the behavior of Sam Altman in seeking government support. The other approaches described in this article may be more investor friendly as there is a more immediate return with creating a 3D asset or a virtual simulation.

Fricken•1h ago
A trillion dollars are now riding on that white whale. An entire naval fleet is being raised for the purposes of chasing down that whale. LeCun and Fei-Fei merely believe that the whale is in a different ocean.
andrewflnr•1h ago
If I was smarter, I would have predicted that not only would everyone else figure out that world models are a critical step, but that as a direct consequence the term "world model" would lose all meaning. Maybe next time. That said, Le Cunn's concept in the blog post is the only one worthy of the title.
Marshferm•43m ago
Control theory and cog-sci are impaired ideas. There is no mind, and cog sci is a post hoc retrofit narrated onto brains, rather than experience as events integrated. Cog sci is words sportscasting synthetic categories.

LeCun's model will fail as the idea of world model is oxymoronic, brains don't need them and the world isn't modeled, all models are wrong, the world is experienced instantaneously in optic flow that's built atop of olfaction.

https://www.eneuro.org/content/7/4/ENEURO.0069-20.2020

Any real AI that veers at control will have to adopt a neurobio path

https://tbrnewsmedia.com/sbus-sima-mofakham-chuck-mikell-des...

That's built paradoxically from unpredictability

https://pubmed.ncbi.nlm.nih.gov/38579270/

MangoToupe•28m ago
All abstraction of reality are bound to fail, but some abstractions are more convincing (or indeed more useful) than others.

> Any real AI that veers at control will have to adopt a neurobio path

Maybe. Or maybe it's a useless distraction. Only time will tell what signals are meaningful.

Marshferm•22m ago
Neuro is the experience integrating allo/egocentric. We've already crossed that threshold in vision depth meets allocortex behaviors in entertainment. Ie there's more intelligence in The Shining than anything in current folk science AI/cog sci. It's a resounding flop, so will the Gaussian and the psychobabble of LeCuns as it is a psychological approach.
ryandv•15m ago
> There is no mind

Interesting. What is your response to the cogito?

benatkin•1h ago
Whether or not this is exactly the same thing, I find this glossary entry from NVIDIA interesting: https://www.nvidia.com/en-us/glossary/world-models/
ChrisArchitect•1h ago
Earlier: https://news.ycombinator.com/item?id=45914363
IntrepidPig•1h ago
I always felt like one of reasons LLMs are so good is that they piggyback on the many years that have gone into developing language as an information representation/compression format. I don’t know if there’s anything similar a world model can take advantage of.

That being said there have been models which are pretty effective at other things that don’t use language, so maybe it’s a non issue.

ares623•45m ago
I will gladly take $10B to find out for you.
allenleee•1h ago
With all due respect, AI is ultimately a capital game. World models aren’t where real B2B customer revenue comes from—at least compared to today’s LLMs; they’re mainly a better story for raising huge amounts of private capital. Hopefully they figure out how to build the next-gen AI architecture along the way.
echelon•57m ago
The most useful models are image, video, and audio models. It makes sense that we'd make the video models more 4D aware.

Text really hogged all the attention. Media is where AI is really going to shine.

Some of the most profitable models right now are in music, image, and video generation. A lot of people are having a blast doing things they could legitimately never do before, and real working professionals are able to use the tools to get 1000x more done - perhaps providing a path to independence from bigger studios, and certainly more autonomy for those not born into nepotism.

As long as companies don't over-raise like OpenAI, there should be a smooth gradient from next gen media tools to revolutionary future stuff like immersive VR worlds that you can bend like the Matrix or Holodeck.

And I'll just be exceedingly chuffed if we get open source and highly capable world models from the Chinese that keep us within spitting distance of the unicorns.

Aperocky•28m ago
That just sounds like text with extra steps.

Fundamentally what AGI is trying to do is to encode ability to logic and reason. Tokens, images, video and audio are all just information of different entropy density that is the output of that logic reasoning process or emulation of logic reasoning process.

ryandv•19m ago
> Fundamentally what AGI is trying to do is to encode ability to logic and reason.

No? The Wason selection task has shown that logic and reason are not really core nor essential to human cognition.

It's really verging on speculation, but see chapter 2 of Jaynes 1976 - in particular the section on spatialization and the features of consciousness.

danielmarkbruce•27m ago
>> The most useful models are image, video, and audio models

This is wrong. The vast majority of revenue is being generated by text models because they are so useful.

MangoToupe•31m ago
> World models aren’t where real B2B customer revenue comes from

You could say the same thing about AGI. Ultimately capital will realize intelligence is a drawback.

philipkiely•1h ago
I played with Marble yesterday, Fei-Fei/World Labs' new product.

It is the most impressed I've been with an AI experience since the first time I saw a model one-shot material code.

Sure, its an early product. The visual output reminds me a lot of early SDXL. But just look at what's happened to video in the last year and image in the last three. The same thing is going to happen here, and fast, and I see the vision for generative worlds for everything from gaming/media to education to RL/simulation.

CrackerNews•51m ago
Marble appears to be like HunyuanWorld to me, but this time they marketed it as a first step to a world model, and it has multimodal capabilities.
nmfisher•16m ago
I wasn't actually able to use it because the servers were overloaded. What exactly impressed you (or more generally, what does it actually let you do at the moment?).
IAmGraydon•58m ago
The LLM grift is burned up, so this is the next thing. It has just enough new magic tricks to wow the VCs who don't really get what's going on here. I think this comment from the article says it all:

“Taking images and turning them into 3D environments using gaussian splats, depth and inpainting. Cool, but that’s a 3D GS pipeline, not a robot brain.”

ares623•44m ago
It’s for the VCs who missed out early. Now’s their chance!
skywhopper•17m ago
Because they are smart enough to realize current LLM tech is nearing a dead end and cannot serve as a full AGI, even ignoring context and hallucination issues, without actual knowledge of the real world.
lumost•15m ago
Most world models so far are based on transformers, no?
ripe•11m ago
And the pendulum swings back toward representation. It is becoming clear that the LLM approach is not adequate to reach what John McCarthy called human-level intelligence:

Between us and human-level intelligence lie many problems. They can be summarized as that of succeeding in the "common-sense informatic situation". [1]

And the search continues...

[1] https://www-formal.stanford.edu/jmc/human.pdf

FakeBlueSamurai•4m ago
Le Cunn's talk at Harvard informs how far behind he is.

Nano Banana can be prompt engineered for nuanced AI image generation

https://minimaxir.com/2025/11/nano-banana-prompts/
519•minimaxir•10h ago•135 comments

Zed is our office

https://zed.dev/blog/zed-is-our-office
513•sagacity•12h ago•255 comments

Why Fei-Fei Li and Yann LeCun Are Both Betting on "World Models"

https://entropytown.com/articles/2025-11-13-world-model-lecun-feifei-li/
64•signa11•1h ago•29 comments

Copyright winter is coming (to Wikipedia?)

https://authorsalliance.substack.com/p/copyright-winter-is-coming-to-wikipedia
51•the-mitr•1h ago•35 comments

650GB of Data (Delta Lake on S3). Polars vs. DuckDB vs. Daft vs. Spark

https://dataengineeringcentral.substack.com/p/650gb-of-data-delta-lake-on-s3-polars
115•tanelpoder•7h ago•31 comments

Launch HN: Tweeks (YC W25) – Browser extension to deshittify the web

https://www.tweeks.io/onboarding
199•jmadeano•12h ago•149 comments

How to Get a North Korea / Antarctica VPS

https://blog.lyc8503.net/en/post/asn-5-worldwide-servers/
24•uneven9434•3h ago•8 comments

OpenMANET Wi-Fi HaLow open-source project for Raspberry Pi–based MANET radios

https://openmanet.net/
83•hexmiles•7h ago•24 comments

Fannie Mae officials ousted after sounding alarm on sharing confidential data

https://apnews.com/article/fannie-mae-freddie-mac-firing-pulte-data-a4f8c53df74fef83ec7fd07e3d524746
62•consumer451•1h ago•21 comments

I Built a One File Edge Probe to Tell Me When Time Is Lying

https://physical-ai.ghost.io/a-one-file-pwa-to-tell-you-when-time-is-lying/
23•boulevard•1w ago•2 comments

Kubernetes Ingress Nginx is retiring

https://www.kubernetes.dev/blog/2025/11/12/ingress-nginx-retirement/
61•TheApplicant•6h ago•18 comments

Think in math, write in code (2019)

https://www.jmeiners.com/think-in-math/
127•alabhyajindal•4d ago•47 comments

Apple Mini Apps Partner Program

https://developer.apple.com/programs/mini-apps-partner/
83•soheilpro•3h ago•55 comments

Blue Origin lands New Glenn rocket booster on second try

https://techcrunch.com/2025/11/13/blue-origin-lands-new-glenn-rocket-booster-on-second-try/
286•perihelions•7h ago•147 comments

SIMA 2: An agent that plays, reasons, and learns with you in virtual 3D worlds

https://deepmind.google/blog/sima-2-an-agent-that-plays-reasons-and-learns-with-you-in-virtual-3d...
182•meetpateltech•13h ago•72 comments

Blender Lab

https://www.blender.org/news/introducing-blender-lab/
212•radeeyate•14h ago•44 comments

SlopStop: Community-driven AI slop detection in Kagi Search

https://blog.kagi.com/slopstop
349•msub2•9h ago•165 comments

Show HN: DBOS Java – Postgres-Backed Durable Workflows

https://github.com/dbos-inc/dbos-transact-java
58•KraftyOne•8h ago•33 comments

Disrupting the first reported AI-orchestrated cyber espionage campaign

https://www.anthropic.com/news/disrupting-AI-espionage
190•koakuma-chan•10h ago•122 comments

Texas A&M to restrict faculty from advocating "race and gender ideology"

https://www.texastribune.org/2025/11/13/texas-am-regents-race-gender-ideology-course-audit/
9•geox•1h ago•0 comments

Piramidal (YC W24) Hiring: Front End Engineer

https://www.ycombinator.com/companies/piramidal/jobs/i9yNX5s-front-end-engineer-user-interface
1•dsacellarius•7h ago

Itiner-E – The Digital Atlas of Ancient Roads

https://itiner-e.org/
25•beatthatflight•1w ago•1 comments

Why I'm Learning Sumerian

https://mindthenerd.com/why-im-learning-sumerian-and-what-it-taught-me-about-hard-work-burnout-an...
5•surprisetalk•1w ago•0 comments

The Eggstraordinary Fortress

https://ahmed1011001.github.io/Notes/stories/eggstrodinary.html
51•tippa123•10h ago•19 comments

A Brutal Look at Balanced Parentheses, Computing Machines, and Pushdown Automata

https://raganwald.com/2019/02/14/i-love-programming-and-programmers.html
7•warrenm•1w ago•1 comments

How to fix subsystem request failed on channel 0

https://blog.x-way.org/Linux/2025/11/06/How-to-fix-subsystem-request-failed-on-channel-0.html
27•speckx•1w ago•9 comments

The emergence and diversification of dog morphology

https://www.science.org/doi/10.1126/science.adt0995
30•Marshferm•5h ago•17 comments

Android developer verification: Early access starts

https://android-developers.googleblog.com/2025/11/android-developer-verification-early.html
1295•erohead•1d ago•616 comments

Steam Machine

https://store.steampowered.com/sale/steammachine
2668•davikr•1d ago•1294 comments

Remind: A sophisticated calendar and alarm program

https://dianne.skoll.ca/projects/remind/
45•n3t•1w ago•9 comments