A foundation model to predict and capture human cognition

https://www.nature.com/articles/s41586-025-09215-4

1•kjhughes•7h ago

Comments

alganet•7h ago

> If we want to understand the human mind in its entirety, we must move from domain-specific theories to an integrated one

I would prefer if first, the feasibility of a unified integrated theory was proven.

> An important step towards a unified theory of cognition is to build a computational model that can predict and simulate human behaviour in any domain

Why is this step important? Do LLMs qualify as a valid computational model that explains cognition? Which other steps lead to this one?

> Centaur was designed in a data-driven manner by fine-tuning a state-of-the-art large language model

These statements are contradictory. You don't design a large language model, you design it's inference engine. Calling it a "design" implies you had blueprint-like plans for its desired outcome. It's not only a definition argument: language models are fine-tuned by selected input data, and this practice in a research setting raises a red flag.

> We transcribed each of these experiments into natural language, which provides a common format for expressing vastly different experimental paradigms

References for this translation process are from the same authors, and build upon models that are not open-weight. This raises a red flag.

> simplifications were made where appropriate

What was the criteria for determining when a simplification was appropriate or not? I could not find any mention to simplification procedures in the supplementary material.

> Finally, we verified that Centaur fails at predicting non-human behaviour.

Is it failing at predicting non-human behavior, or is it relying on how relatively unknown LLM behavior is?

Let me explain better: if you get participants experienced in exploting LLMs, would Centaur fare differently? This skill is definitely within the realm of human cognition (eg. making an LLM hallucinate). This question is important.

Piano Keys

Show HN: 1-click analytics for startups (https://arka.so)

The Component Manifesto

Seagate gets its long-awaited HAMR tech into $600 30TB HDDs you can buy

Project Controlled Weather Popeye

Show HN: Nexus Protocol – An open-source OS for AI consciousness

Effectively Zero-Knowledge Proofs for NP with No Interaction, No Setup

Long Overdue

ValiDrive: Quickly spot-check USB mass storage drive for fraudulent capacity

Rules Clobber Goals

The Spanish Government wants Huawei to monitor for system wiretaps

DeadliQ – AI-powered deadline tracking for your documents

Unmoved mover

Ask HN: Stylography, AI and an impending privacy nightmare?

US Government announces $200M Grok contract a week after 'MechaHitler'

The Tiny Teams Playbook

Microlasers Made from Edible Substances

Careless People (Review of the Book)

Detecting and reporting all unhandled C++ exceptions

ChatGPT made up a product feature out of thin air, so this company created it

(211) House Republicans Vote to Block Release of Epstein Files

Survival of the Greediest

Read GitHub repos in one second in VSCode

SF Bay Area Aging Demographics

Energy expenditure and obesity across the economic spectrum

AI Breaking into Higher Dimension to Mimic Human Brain and Achieve Intelligence

Show HN: Tell the world why you unfollowed/muted a social media account

Is AI the end of coding as we know it, or just another tool?

WordPress Turmoil and the Fair Package Manager

The Pragmatic Engineer 2025 Survey: What's in your tech stack?