Why AI systems don't learn – On autonomous learning from cognitive science

18•aanet•3h ago

Comments

aanet•3h ago

by Emmanuel Dupoux, Yann LeCun, Jitendra Malik

"he proposed framework integrates learning from observation (System A) and learning from active behavior (System B) while flexibly switching between these learning modes as a function of internally generated meta-control signals (System M). We discuss how this could be built by taking inspiration on how organisms adapt to real-world, dynamic environments across evolutionary and developmental timescales. "

dasil003•2h ago

If this was done well in a way that was productive for corporate work, I suspect the AI would engage in Machievelian maneuvering and deception that would make typical sociopathic CEOs look like Mister Rogers in comparison. And I'm not sure our legal and social structures have the capacity to absorb that without very very bad things happening.

marsten•1h ago

Agents playing the iterated prisoner's dilemma learn to cooperate. It's usually not a dominant strategy to be entirely sociopathic when other players are involved.

ehnto•23m ago

You don't get that many iterations in the real world though, and if one of your first iterations is particularly bad you don't get any more iterations.

iFire•16m ago

https://github.com/plastic-labs/honcho has the idea of one sided observations for RAG.

beernet•3h ago

The paper's critique of the 'data wall' and language-centrism is spot on. We’ve been treating AI training like an assembly line where the machine is passive, and then we wonder why it fails in non-stationary environments. It’s the ultimate 'padded room' architecture: the model is isolated from reality and relies on human-curated data to even function.

The proposed System M (Meta-control) is a nice theoretical fix, but the implementation is where the wheels usually come off. Integrating observation (A) and action (B) sounds great until the agent starts hallucinating its own feedback loops. Unless we can move away from this 'outsourced learning' where humans have to fix every domain mismatch, we're just building increasingly expensive parrots. I’m skeptical if 'bilevel optimization' is enough to bridge that gap or if we’re just adding another layer of complexity to a fundamentally limited transformer architecture.

jdkee•1h ago

LeCun has been talking about his JEPA models for awhile.

https://ai.meta.com/blog/yann-lecun-ai-model-i-jepa/

A Decade of Slug

Python 3.15's JIT is now back on track

Microsoft's 'unhackable' Xbox One has been hacked by 'Bliss'

Get Shit Done: A Meta-Prompting, Context Engineering and Spec-Driven Dev System

Mistral AI Releases Forge

Show HN: Sub-millisecond VM sandboxes using CoW memory forking

Launch HN: Kita (YC W26) – Automate credit review in emerging markets

Kagi Small Web

Launch an autonomous AI agent with sandboxed execution in 2 lines of code

Electron microscopy shows 'mouse bite' defects in semiconductors

It Took Me 30 Years to Solve This VFX Problem – Green Screen Problem [video]

Unsloth Studio

Chrome extension adjusts video speed based on how fast the speaker is talking

Show HN: Fatal Core Dump – A debugging murder mystery played with GDB

Torturing Rustc by Emulating HKTs

Edge.js: Run Node apps inside a WebAssembly sandbox

Ryugu asteroid samples contain all DNA and RNA building blocks

Honda is killing its EVs

Node.js needs a virtual file system

'The Secret Agent': Exploring a Vibrant, yet Violent Brazil (2025)

Why AI systems don't learn – On autonomous learning from cognitive science

Meta and TikTok let harmful content rise to drove engagement, say whistleblowers

Spice Data (YC S19) Is Hiring a Product Specialist

OpenSUSE Kalpa

Show HN: Horizon – GPU-accelerated infinite-canvas terminal in Rust

Show HN: I built an interactive 3D three-body problem simulator in the browser

Java 26 is here

Meta Horizon Worlds on Meta Quest is being discontinued

Reverse-engineering Viktor and making it open source

Show HN: Crust – A CLI framework for TypeScript and Bun

A Decade of Slug

Python 3.15's JIT is now back on track

Microsoft's 'unhackable' Xbox One has been hacked by 'Bliss'

Get Shit Done: A Meta-Prompting, Context Engineering and Spec-Driven Dev System

Mistral AI Releases Forge

Show HN: Sub-millisecond VM sandboxes using CoW memory forking

Launch HN: Kita (YC W26) – Automate credit review in emerging markets

Kagi Small Web

Launch an autonomous AI agent with sandboxed execution in 2 lines of code

Electron microscopy shows 'mouse bite' defects in semiconductors

It Took Me 30 Years to Solve This VFX Problem – Green Screen Problem [video]

Unsloth Studio

Chrome extension adjusts video speed based on how fast the speaker is talking

Show HN: Fatal Core Dump – A debugging murder mystery played with GDB

Torturing Rustc by Emulating HKTs

Edge.js: Run Node apps inside a WebAssembly sandbox

Ryugu asteroid samples contain all DNA and RNA building blocks

Honda is killing its EVs

Node.js needs a virtual file system

'The Secret Agent': Exploring a Vibrant, yet Violent Brazil (2025)

Why AI systems don't learn – On autonomous learning from cognitive science

Meta and TikTok let harmful content rise to drove engagement, say whistleblowers

Spice Data (YC S19) Is Hiring a Product Specialist

OpenSUSE Kalpa

Show HN: Horizon – GPU-accelerated infinite-canvas terminal in Rust

Show HN: I built an interactive 3D three-body problem simulator in the browser

Java 26 is here

Meta Horizon Worlds on Meta Quest is being discontinued

Reverse-engineering Viktor and making it open source

Show HN: Crust – A CLI framework for TypeScript and Bun

Why AI systems don't learn – On autonomous learning from cognitive science

Comments