frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

A foundation model to predict and capture human cognition

https://www.nature.com/articles/s41586-025-09215-4
2•crocowhile•6h ago

Comments

crocowhile•6h ago
- the only real comparisons they make are with the parental model, llama 3.7-70b without fine tuning. That tells us what is the added value of fine tuning the dataset but it is hardly state of the art. I guess it should be seen as an indication of how difficult it is to stay afloat in this world when you are in academia and the tech barrier stands billion dollars tall.

- Fig 4a shows Centaur clusters more closely to humans than any other model in a cognitive benchmark (CogBench) but also shows that parental llama cluster closer than claude and openAI thinking models which makes me a bit sceptical of using this measurement at all and reinforces the need for further comparisons.

- the fMRI stuff makes no sense and transforms the paper into a propaganda stunt, IMHO.

- At the end of the paper, the comparison with an "informed" Deepseek-R1 (not shown in data?) shows that a modern reasoning model matches Centaur-performance even without any fine tuning.

The latter point is incredibly interesting in principle but it has nothing to do with the claims of the paper. It basically concludes that a modern reasoning model with CoT can outperform out of the box a "simpler" model that was specifically fine-tuned with a huge dataset of human cognitive behaviours. Bigger claim than the title itself basically IMO.

Video conferencing that's simply fun

https://opentalk.eu/en
1•doener•1m ago•0 comments

Shoelace 3.0 WebAwesome Released

https://github.com/shoelace-style/webawesome
1•awesomewebawe•1m ago•0 comments

Show HN: San Francisco Events Map

https://whereshouldwego.co/san-francisco-events-map
1•thelazyorcas•2m ago•0 comments

Big Tech's Mixed Response to U.S. Treasury Sanctions

https://krebsonsecurity.com/2025/07/big-techs-mixed-response-to-u-s-treasury-sanctions/
1•todsacerdoti•2m ago•0 comments

Microsoft: Sequential Diagnosis with Language Models

https://arxiv.org/abs/2506.22405
1•blopker•2m ago•0 comments

Critics blast Microsoft's limited reprieve for those stuck on Windows 10

https://www.theregister.com/2025/07/01/windows_10_updates_criticism/
2•doener•4m ago•0 comments

Hugging Your Cactus

https://www.hugyourcactus.com/2023/01/11/intro-to-hugging-your-cactus/
3•ozgrakkurt•9m ago•0 comments

The Tech Billionaire to Fascist Pipeline [video]

https://www.youtube.com/watch?v=olhu9UhFGl4
2•NdMAND•9m ago•0 comments

One Billion Cells

https://cells.andersmurphy.com/
1•vyrotek•10m ago•0 comments

Daniel Gross Left SSI. Ilya Is the CEO Now

https://twitter.com/ilyasut/status/1940802278979690613
2•tzury•11m ago•0 comments

Practical Retrofitting for Obsolete Devices [pdf]

https://computingwithinlimits.org/2025/papers/limits2025-lafrechoux-retrofitting.pdf
1•_rpxpx•11m ago•0 comments

The Tandy Corporation, Part 2

https://www.abortretry.fail/p/the-tandy-corporation-part-2
1•rbanffy•12m ago•0 comments

Show HN: Epoch – build and backtest trading strategies with plain English

https://www.epoch.trade/
1•aadesola•15m ago•0 comments

Automated news aggregation that generates reports from a multitude of sources

https://github.com/tvanderb/Cronkite
1•Extropy_•15m ago•0 comments

Collected vehicle registration data

https://robbieandrew.github.io/carsales/
1•pier25•17m ago•0 comments

EBAF – eBPF Based Ad Firewall

https://github.com/Kazedaa/eBAF
2•ARob109•19m ago•0 comments

Tell HN: Google says "not vuln", fixes hours later without attribution

6•Eikon•19m ago•2 comments

Encoding Jake Gyllenhaal into one million checkboxes (2024)

https://ednamode.xyz/blogs/2.html
4•chilipepperhott•20m ago•0 comments

xAI data center gets air permit to run 15 turbines, but imaging shows 24 on site

https://arstechnica.com/tech-policy/2025/07/xai-gets-an-air-permit-to-power-its-supercomputer-but-pollution-fears-remain/
2•voxadam•20m ago•0 comments

Build Your Own Color Search Engine

https://lui.ie/guides/semantic-search-colors
2•io84•21m ago•1 comments

Taming agentic engineering – Prompts are code, .json/.md files are state

https://mariozechner.at/posts/2025-06-02-prompts-are-code/
2•badlogic•22m ago•0 comments

Turn Any Website into a Viral Slideshow in Minutes (AI-Powered)

https://shortgen.io
1•Fr1tz1707•22m ago•2 comments

What's So Great About Sudoedit?

http://www.wingtiplabs.com/blog/posts/2013/03/13/sudoedit/
1•thunderbong•23m ago•0 comments

Printing the web: making webpages look good on paper

https://piccalil.li/blog/printing-the-web-making-webpages-look-good-on-paper/
2•PaulHoule•26m ago•1 comments

Show HN: KLogger – Quick CLI to grab all deployment logs from a K8s namespace

https://www.github.com/christensen/klogger
2•christensen143•27m ago•0 comments

Stop Killing Games consumer movement hits some major milestones

https://www.gamingonlinux.com/2025/07/stop-killing-games-consumer-movement-hits-some-major-milestones/
3•doener•27m ago•0 comments

OpenNebula 7.0 "Phoenix" is out

https://opennebula.io/blog/announcements/opennebula-7-0-phoenix-released/
1•francjp•27m ago•1 comments

Customer Service Representative's Perception of the AI Assistant in Call Centers

https://arxiv.org/abs/2507.00513
1•rntn•29m ago•0 comments

Tesla Is in Disarray. Musk Has Moved Beyond Caring About Cars

https://www.wsj.com/business/autos/tesla-elon-musk-robotaxi-robots-95ae80a4
4•bookofjoe•29m ago•2 comments

Reason's Overreach

https://www.overcomingbias.com/p/reasons-overreach
1•sebg•29m ago•0 comments