frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Counting Down Capabilities to AGI

https://shash42.substack.com/p/counting-down-capabilities-to-agi
1•shash42•6h ago

Comments

shash42•6h ago
This is a living document where I'll track my evolving thoughts on what remains on the path to building generally-intelligent agents. Why does this matter? Three compelling reasons:

Top-down view: AI research papers (and product releases) move bottom-up, starting from what we have right now and incrementally improving, in the hope we eventually converge to the end-goal. This is good, that’s how concrete progress happens. At the same time, to direct our efforts, it is important to have a top-down view of what we have achieved, and what are the remaining bottlenecks towards the end-goal. Besides, known unknowns are better than unknown unknowns.

Research prioritisation: I want this post to serve as a personal compass, reminding me which capabilities I believe are most critical for achieving generally intelligent agents—capabilities we haven't yet figured out. I suspect companies have internal roadmaps for this, but it’s good to also discuss this in the open.

Forecasting AI Progress: Recently, there is much debate about the pace of AI advancement, and for good measure—this question deserves deep consideration. Generally-intelligent agents will be transformative, requiring both policymakers and society to prepare accordingly. Unfortunately, I think AI progress is NOT a smooth exponential that we can extrapolate to make predictions. Instead, the field moves by shattering one (or more) wall(s) every time a new capability gets unlocked. These breakthroughs present themselves as large increases in benchmark performance in a short period of time, but the absolute performance jump on a benchmark provides little information about when the next breakthrough will occur. This is because, for any given capability, it is hard to predict when we will know how to make a model learn it. But it’s still useful to know what capabilities are important and what kinds of breakthroughs are needed to achieve them, so we can form our own views about when to expect a capability. This is why this post is structured as a countdown of capabilities, which as we build out, will get us to “AGI” as I think about it.

*Framework* To be able to work backwards from the end-goal, I think it’s important to use accurate nomenclature to intuitively define the end-goal. This is why I’m using the term generally-intelligent agents. I think it encapsulates the three qualities we want from “AGI”:

Generality: Be useful for as many tasks and fields as possible.

Intelligence: Learn new skills from as few experiences as possible

Agency: Planning and performing a long chain of actions.

Click and read the blog for:

Introduction

…. Framework

…. AI 2024 - Generality of Knowledge

Part I on The Frontier: General Agents

…. Reasoning: Algorithmic vs Bayesian

…. Information Seeking

…. Tool-use

…. Towards year-long action horizons

…. …. Long-horizon Input: The Need for Memory

…. …. Long-horizon Output

…. Multi-agent systems

Part II on The Future: Generally-Intelligent Agents [TBA]

Continuous Glucose Monitoring

https://www.imperialviolet.org/2025/06/29/cgm.html
2•zdw•14m ago•0 comments

WebGL2 Fundamentals

https://webgl2fundamentals.org/
2•beeflet•15m ago•0 comments

Use AI to build ecosystems, not just products

https://www.atelierlogos.studio/blog/2025-06-28-use-ai-to-build-ecosystems
2•jdbohrman•19m ago•1 comments

Land Values and Affordability

https://www.reillywood.com/blog/land-value/
2•luu•27m ago•0 comments

English and a Translator's Shame

https://thewire.in/culture/english-and-a-translators-shame
2•kawera•29m ago•0 comments

My home servers are not a homelab

https://blog.nradk.com/posts/homelab/
2•nradk•30m ago•1 comments

Claude-Code-Proxy

https://github.com/seifghazi/claude-code-proxy
3•handfuloflight•31m ago•0 comments

New Ensō – first public beta

https://untested.sonnet.io/notes/new-enso-first-public-beta/
2•wonger_•35m ago•0 comments

Army Field Manual FM 3-0 – Operations (October 2022) [pdf]

https://irp.fas.org/doddir/army/fm3-0.pdf
3•babelfish•36m ago•0 comments

Why Is Part of Alameda Island in San Francisco?

https://www.kqed.org/news/11702058/why-is-part-of-alameda-island-in-san-francisco
2•CalChris•36m ago•0 comments

Reflections on agentic coding: Magic or Mirage

https://www.async-let.com/blog/agentic-reflections/
2•arey_abhishek•44m ago•0 comments

What "One Big Beautiful Bill Act" Means for Your R&D Tax Credits

https://exactera.com/resources/what-one-big-beautiful-bill-act-means-for-your-rd-tax-credits/
3•antimora•46m ago•0 comments

She Got a Permit for Her Chickens. Now the City Is Fining Her $80k

https://reason.com/2025/06/28/she-got-a-permit-for-her-chickens-now-the-city-is-fining-her-80000/
30•fortran77•51m ago•29 comments

Project Farm: Digital calipers review

https://www.youtube.com/watch?v=z5KtKAee0jw
3•burnt-resistor•54m ago•0 comments

Context Engineering: A first-principles handbook with the latest research

https://github.com/davidkimai/Context-Engineering
5•davidkimai•1h ago•1 comments

Ask HN: Is the header CSS broken for you?

9•LorenDB•1h ago•1 comments

EĿlipsis, a Language Independent Preprocessor

https://gustedt.gitlabpages.inria.fr/ellipsis/index.html
4•faresahmed•1h ago•0 comments

New Zealand Approved Psychedelic Therapy. He's the Only Doctor Who Can Do It

https://www.nytimes.com/2025/06/26/world/asia/new-zealand-psilocybin-magic-mushrooms-therapy.html
4•bookofjoe•1h ago•2 comments

The Chan-Zuckerbergs stopped funding social causes

https://www.washingtonpost.com/technology/2025/06/29/mark-zuckerberg-priscilla-chan-school-closure/
12•1vuio0pswjnm7•1h ago•6 comments

On Wanting to Believe

https://www.carsengrote.com/2025/06/on-wanting-to-believe.html
3•dante44•1h ago•0 comments

Largest Digital Camera Snaps Its First Photos of the Universe

https://www.wsj.com/science/space-astronomy/worlds-largest-digital-camera-snaps-its-first-photos-of-the-universe-68099904
2•gmays•1h ago•0 comments

The Mysterious Billionaire Behind the OnlyFans Porn Empire

https://www.wsj.com/business/media/only-fans-leonid-radvinsky-profile-706c914d
3•Geekette•1h ago•2 comments

AI-Generated Psych-Rock Band Rack Up Spotify Streams

https://www.stereogum.com/2313501/ai-generated-psych-rock-band-the-velvet-sundown-rack-up-hundreds-of-thousands-of-spotify-streams/news/
3•TrackerFF•1h ago•0 comments

Show HN: Free AI Thumbnail Tester (based on real YouTube data)

https://www.aithumbnail.so/tools/thumbnail-tester
2•sachou•1h ago•0 comments

Use keyword-only arguments in Python dataclasses

https://chipx86.blog/2025/06/29/tip-use-keyword-only-arguments-in-python-dataclasses/
3•Bogdanp•1h ago•0 comments

Thousands in Norway told they won up to millions in lottery error

https://www.bbc.com/news/articles/c15wn70v7z8o
7•ednite•1h ago•1 comments

OpenAI reportedly 'recalibrating' compensation in response to Meta hires

https://techcrunch.com/2025/06/29/openai-reportedly-recalibrating-compensation-in-response-to-meta-hires/
4•ednite•1h ago•2 comments

Orange Pi Nova Teased with Loongson 2K3000 as Loongson Expands Product Line

https://linuxgizmos.com/orange-pi-nova-teased-with-loongson-2k3000-as-loongson-expands-product-line/
6•chsum•1h ago•0 comments

How to Do Autocomplete

https://bonsai.io/blog/how-to-really-do-autocomplete/
4•softwaredoug•1h ago•0 comments

Why Extreme Couponers Have Given Up on Coupons

https://www.wsj.com/personal-finance/extreme-coupon-prices-savings-e7604515
2•lxm•2h ago•0 comments