frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Deficient executive control in transformer attention

https://academic.oup.com/pnasnexus/article/5/6/pgag149/8698838
16•derbOac•1h ago

Comments

ivanvoid•1h ago
this is a nice study but i don’t think it’s actually good argument
quotemstr•1h ago
The first thing I do when I see a paper that claims transformers fundamentally can't do X or Y is to look at the models under test:

> To evaluate generalizability, we conducted tests of GPT-5 (41), Claude Opus 4.1 (42), and Gemini 2.5 Pro (43) from 2025 September

The problem with empirical negative results on LLMs is that they can't rule out that the alleged deficiencies disappear with increased scale and the right fine-tuning. It's like saying my dog has trouble with subject-verb agreement, so meat brains are "fundamentally limited in their capacity for grammar".

I can accept that current LLMs (even latest generation) might exhibit cognitive gaps similar to those we see in humans with deficient executive function, I can't accept these gaps as evidence of fundamental limits of the transformer architecture. LLMs are universal function approximators. Executive function is a function. Yes, yes, it's well-known that transformers have a circuit complexity limit set by layer count and whatever. The limit disappears once you allow for autoregression. Nobody cares about the limits of AI inside a single forward pass.

I have high confidence that with the right sort of training, executive function gaps in LLM can be addressed. I'm not convinced that the problem is the architecture per se.

fc417fc802•29m ago
> they lack an explicit architecture for the executive control of attention found in humans

Deceptive terminology strikes again! The "attention" mechanism in transformers appears (to my understanding at least) to have about as much to do with human attention as the "neurons" in a multi-layer perceptron have to do with biological neurons.

That said, the core premise of building in something that mimics executive function is an intriguing one (which I assume has been explored before but it's not something I'm familiar with).

Cybersecurity researchers aren't happy about the guardrails on Anthropic's Fable

https://techcrunch.com/2026/06/10/cybersecurity-researchers-arent-happy-about-the-guardrails-on-a...
198•speckx•8h ago•185 comments

AI agent runs amok in Fedora and elsewhere

https://lwn.net/SubscriberLink/1077035/c7e7c14fbd60fae9/
72•tanelpoder•1h ago•7 comments

πFS

https://github.com/philipl/pifs
507•helterskelter•6h ago•132 comments

Raspberry Pi 5 – 16GB RAM

https://www.adafruit.com/product/6125?src=raspberrypi
162•akman•5h ago•188 comments

Anthropic requires 30 day data retention for Fable and Mythos

https://support.claude.com/en/articles/15425996-data-retention-practices-for-mythos-class-models
162•lebovic•1d ago•65 comments

A Written Language for the Cherokee So Efficient It Was Thought to Be Magic

https://www.smithsonianmag.com/innovation/man-created-written-language-cherokee-did-efficiently-e...
82•grahambargeron•3h ago•45 comments

I'm Eric Ries, author of "The Lean Startup" and new book "Incorruptible" – AMA

519•eries•10h ago•425 comments

How JPL keeps the 13-year-old Curiosity rover doing science

https://spectrum.ieee.org/curiosity-rover-jpl-mars-science
169•pseudolus•8h ago•35 comments

PgDog is funded and coming to a database near you

https://pgdog.dev/blog/our-funding-announcement
384•levkk•11h ago•197 comments

L'Affaire Siloxane

https://mceglowski.substack.com/p/laffaire-siloxane
153•idlewords•1d ago•23 comments

What is it like to be a bat? (1974) [pdf]

https://www.sas.upenn.edu/~cavitch/pdf-library/Nagel_Bat.pdf
62•shadow28•4h ago•51 comments

Deficient executive control in transformer attention

https://academic.oup.com/pnasnexus/article/5/6/pgag149/8698838
16•derbOac•1h ago•3 comments

GeoLibre 1.0

https://geolibre.app/
149•jonbaer•7h ago•9 comments

Show HN: Extend UI – open-source UI kit for modern document apps

https://www.extend.ai/ui
148•kbyatnal•9h ago•36 comments

World Capitals Voronoi

https://www.jasondavies.com/maps/voronoi/capitals/
36•vincnetas•2d ago•16 comments

Farmer donates land for a park, city sells it for $10M as data center land

https://www.tomshardware.com/tech-industry/farmer-donates-land-for-a-park-city-sells-it-for-data-...
402•maxloh•6h ago•197 comments

Who's the smartest corvid?

https://thetyee.ca/Culture/2026/06/05/Whos-the-Smartest-Corvid/
65•NaOH•1d ago•54 comments

Show HN: HelixDB – A graph database built on object storage

https://github.com/HelixDB/helix-db/tree/main
89•GeorgeCurtis•9h ago•30 comments

Building an HTML-first site doubled our users overnight

https://mohkohn.co.uk/writing/html-first/
996•edent•12h ago•453 comments

Unix GC Remastered

https://mohandacherir.github.io/Qdiv7/posts/unix_new_gc/
11•mananaysiempre•2h ago•1 comments

Claude Desktop spawns 1.8 GB Hyper-V VM on every launch, even for chat-only use

https://github.com/anthropics/claude-code/issues/29045
339•tonyrice•8h ago•240 comments

Apache Burr: Build reliable AI agents and applications

https://burr.apache.org/
173•anhldbk•10h ago•90 comments

Authentication issues related to API requests

https://www.githubstatus.com/incidents/fcj3088jg1wx
153•Multicomp•10h ago•30 comments

Why are there so many canines in fine art?

https://www.theatlantic.com/magazine/2026/07/the-dogs-gaze-thomas-w-laqueur/687312/
14•prismatic•3d ago•10 comments

Computer Lessons

https://technicshistory.com/2026/06/06/computer-lessons/
6•cfmcdonald•4d ago•0 comments

All 9,300 Japanese train station, animated by the year it opened (1872–2026)

https://jivx.com/eki
195•momentmaker•13h ago•67 comments

Anthropic's model naming, extrapolated

https://samwilkinson.io/posts/2026-06-09-anthropics-model-naming-extrapolated
281•sammycdubs•6h ago•78 comments

Smudging the game disc to make speedrunning 'SpongeBob' faster

https://www.inverse.com/input/gaming/the-dirty-secret-that-makes-speedrunning-on-spongebob-a-lot-...
73•pncnmnp•23h ago•42 comments

A €0.01 bank transfer could compromise a banking AI agent

https://blue41.com/blog/how-we-helped-bunq-secure-their-financial-ai-assistant/
165•tvissers•11h ago•150 comments

Policy on the AI Exponential

https://darioamodei.com/post/policy-on-the-ai-exponential
132•yjp20•6h ago•190 comments