frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Deficient executive control in transformer attention

https://academic.oup.com/pnasnexus/article/5/6/pgag149/8698838
9•derbOac•1h ago

Comments

ivanvoid•1h ago
this is a nice study but i don’t think it’s actually good argument
quotemstr•1h ago
The first thing I do when I see a paper that claims transformers fundamentally can't do X or Y is to look at the models under test:

> To evaluate generalizability, we conducted tests of GPT-5 (41), Claude Opus 4.1 (42), and Gemini 2.5 Pro (43) from 2025 September

The problem with empirical negative results on LLMs is that they can't rule out that the alleged deficiencies disappear with increased scale and the right fine-tuning. It's like saying my dog has trouble with subject-verb agreement, so meat brains are "fundamentally limited in their capacity for grammar".

I can accept that current LLMs (even latest generation) might exhibit cognitive gaps similar to those we see in humans with deficient executive function, I can't accept these gaps as evidence of fundamental limits of the transformer architecture. LLMs are universal function approximators. Executive function is a function. Yes, yes, it's well-known that transformers have a circuit complexity limit set by layer count and whatever. The limit disappears once you allow for autoregression. Nobody cares about the limits of AI inside a single forward pass.

I have high confidence that with the right sort of training, executive function gaps in LLM can be addressed. I'm not convinced that the problem is the architecture per se.

I Built a Hazel Alternative for Mac with AI Rule Generation

https://medium.com/@jamal_davis/i-built-a-hazel-alternative-for-mac-with-ai-rule-generation-heres...
1•Gotoorbitapp•1m ago•0 comments

Auto-geo – open-source CLI for GEO that helps get your brand mentioned by LLMs

https://github.com/shadowresearch/auto-geo
1•jessen-gibbs•6m ago•1 comments

The Parable of the Talents

https://slatestarcodex.com/2015/01/31/the-parable-of-the-talents/
1•shadow28•9m ago•0 comments

Manus registered my domain in their own name and won't release it

1•AeonCa•9m ago•0 comments

Co-Existence and the End of Co-Intelligence

https://www.oneusefulthing.org/p/co-existence-and-the-end-of-co-intelligence
1•paulpauper•9m ago•0 comments

The Labor Share Fell. So What?

https://marginalrevolution.com/marginalrevolution/2026/06/the-labor-share-fell-so-what.html
1•paulpauper•10m ago•0 comments

I've Solved Content Discovery Conditions May Apply

https://philosophybear.substack.com/p/ive-solved-content-discovery-conditions
1•paulpauper•10m ago•0 comments

Windows 11 sucks slightly less due to June update

https://www.engadget.com/2191909/windows-11-sucks-slightly-less-now-thanks-to-a-june-update/
2•NordStreamYacht•16m ago•0 comments

China-linked operatives used ChatGPT to influence data centers debate

https://www.axios.com/2026/06/10/openai-china-ai-data-center-tariffs-chatgpt
1•alephnerd•18m ago•1 comments

The Social Reckoning (official teaser trailer) [video]

https://www.youtube.com/watch?v=gM4LkaXwGuY
1•Fricken•21m ago•0 comments

WebODM: The Missing Guide

https://webodmbook.com
1•pierotofy•22m ago•0 comments

Plants Could Be Used to Grow Medicines in Space

https://today.ucsd.edu/story/plants-could-be-used-to-grow-medicines-in-space-study-shows
1•gmays•28m ago•0 comments

Starlink: The Constellation, Live

https://sheets.works/data-viz/starlink
1•jonbaer•28m ago•0 comments

Ask HN: Someone started a company same name, same city, industry

1•bxclltkfz•29m ago•0 comments

AdBreak – Jailbreaking the Kindle

https://kindlemodding.org/jailbreaking/AdBreak/
1•nivethan•30m ago•0 comments

The First 100 Wikipedia Pages

https://en.wikipedia.org/wiki/Wikipedia:First_100_pages
2•bananamogul•30m ago•2 comments

Return on Tokens (Rot)

https://www.notboring.co/p/return-on-tokens-rot
1•thedreammachine•31m ago•0 comments

Stop the Surveillance State [pdf]

https://epic.org/wp-content/uploads/2026/04/EPIC-Stop-the-Surveillance-State-5.pdf
1•Cider9986•32m ago•0 comments

Few things in DC are more predictable than Congress renewing surveillance powers

https://xcancel.com/RepThomasMassie/status/2064849178249892220
4•Cider9986•33m ago•0 comments

China's BYD aims to be biggest car firm within five years

https://www.theguardian.com/business/2026/jun/10/china-byd-car-firm-ev-maker-toyota
4•teleforce•33m ago•0 comments

A short history of Cerro Torre, the most controversial mountain

https://www.markhorrell.com/blog/2012/a-short-history-of-cerro-torre/
3•joebig•40m ago•0 comments

Vector memory database remembers everything. That's the issue

https://medium.com/@vektormemory/your-vector-memory-database-remembers-everything-thats-exactly-t...
2•vektormemory•41m ago•0 comments

AWS Graviton5's improved design increases speed and energy efficiency

https://www.amazon.science/blog/graviton5s-improved-design-increases-speed-and-energy-efficiency-...
3•tanelpoder•42m ago•0 comments

I was tired of repos that say they run but don't

https://github.com/rossbuckley1990-hash/bootproof
6•Bucko1•42m ago•1 comments

David Sinclair plans to test whole-body rejuvenation drugs in xPrize competition

https://www.technologyreview.com/2026/06/09/1138545/david-sinclair-plans-to-test-whole-body-rejuv...
3•bookofjoe•43m ago•1 comments

Show HN: Catalyst Maze: biotech trading game

https://rnpv.baybridgebio.com/maze/
3•aaavl2821•43m ago•0 comments

Show HN: Black Hole in Your Ghostty

https://twitter.com/s13k_/status/2064705517264552274
2•s13k•44m ago•0 comments

Shopee cuts jobs in Singapore amid AI push

https://www.channelnewsasia.com/singapore/shopee-job-cuts-layoff-employees-software-engineers-617...
2•kelt•46m ago•0 comments

We Saw What AI Data Centers Don't Want You to See [video][22 Mins]

https://www.youtube.com/watch?v=5p426fSlYH4
2•Bender•48m ago•0 comments

Show HN: Pacman AI – Generated with Claude Fable 5

https://pacmanai.com/
4•javierluraschi•49m ago•1 comments