frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Levels of Agentic Engineering

https://www.bassimeledath.com/blog/levels-of-agentic-engineering
34•bombastic311•11h ago

Comments

sjkoelle•1h ago
Oceania has always been context engineering. Its been interesting to see this prioritized in the zeitgeist over the last 6 months from the "long context" zeitgeist.
smy20011•1h ago
I will not put it into a ladder. It implies that the higher the rank, the better. However, you want to choose the best solution for your needs.
efsavage•1h ago
Yegge's list resonated a little more closely with my progression to a clumsy L8.

I think eventually 4-8 will be collapsed behind a more capable layer that can handle this stuff on its own, maybe I tinker with MCP settings and granular control to minmax the process, but for the most part I shouldn't have to worry about it any more than I worry about how many threads my compiler is using.

lherron•1h ago
I was surprised the author didn’t mention Yegge’s list (or maybe I missed it in my skim).
ramesh31•27m ago
>"Yegge's list resonated a little more closely with my progression to a clumsy L8."

I thought level 8 was a joke until Claude Code agent teams. Now I can't even imagine being limited to working with a single agent. We will be coordinating teams of hundreds by years end.

politelemon•1h ago
These are levels of gatekeeping. The items are barely related to each other. Lists like these will only promote toxicity, you should be using the tools and techniques that solve your problems and fit your comfort levels.
eikenberry•1h ago
In my opinion there are 2 levels, human writes the code with AI assist or AI writes the code with human assist; centuar or reverse-centuar. But this article tries to focus on the evolution of the ideas and mistakenly terms them as levels (indicating a skill ladder as other commenters have noted) when they are more like stages that the AI ecosystem has evolved through. The article reads better if you think of it that way.
dist-epoch•38m ago
There is another level - AI writes the code with AI assist.
eikenberry•5m ago
That is just another level of reverse centaur and will eventually have a human ass attached to it.
mzg•1h ago
As a lowly level 2 who remains skeptical of these software “dark factories” described at the top of this ladder, what I don’t understand is this:

If software engineering is enough of a solved problem that you can delegate it entirely to LLM agents, what part of it remains context-specific enough that it can’t be better solved by a general-purpose software factory product? In other words, if you’re a company that is using LLMs to develop non-AI software, and you’ve built a sufficient factory to generate that software, why don’t you start selling the factory instead of whatever you were selling before? It has a much higher TAM (all of software)

pydry•1h ago
I have the same question about people who sell "get rich with real estate" seminars.
hakanderyal•42m ago
We are not there yet. While there are teams applying dark factory models to specific domains with self-reported success, it's yet to be proven, or generalizable enough to apply everywhere.
dist-epoch•39m ago
Codex and Claude Code are these (proto)factories you talk about - almost every programmer uses them now.

And when they will be fully dark factories, yes, what will happen is that a LOT of software companies will just disappear, they will be dis-intermediated by Codex/Claude Code.

glhast•15m ago
Also a measly level 2er. I'm curious what kind of project truly needs an autonomous agent team Ralph looping out 10,000 LOCs per hour? Seems like harness-maxxing is a competitive pursuit in its own right existing outside the task of delivering software to customers.

Feels like K8s cult, overly focused on the cleverness of _how_ something is built versus _what_ is being built.

measurablefunc•1h ago
What level is numeric patterns that evolve according to a sequence of arithmetic operations?
jjmarr•1h ago
I coded a level 8 orchestration layer in CI for code review, two months before Claude launched theirs.

It's very powerful and agents can create dynamic microbenchmarks and evaluate what data structure to use for optimal performance, among other things.

I also have validation layers that trim hallucinations with handwritten linters.

I'd love to find people to network with. Right now this is a side project at work on top of writing test coverage for a factory. I don't have anyone to talk about this stuff with so it's sad when I see blog posts talking about "hype".

jessmartin•4m ago
I got my own level 8 factory working in the last few days and it’s been exhilarating. Mine is based on OpenAI’s Symphony[1], ported to TypeScript.

Would be happy to swap war stories.

<myhnusername>@gmail.com

ftkftk•1h ago
I prefer Dan Shapiro's 5 level analogy (based on car autonomy levels) because it makes for a cleaner maturity model when discussing with people who are not as deeply immersed in the current state of the art. But there are some good overall insights in this piece, and there are enough breadcrumbs to lead to further exploration, which I appreciate. I think levels 3 and 4 should be collapsed, and the real magic starts to happen after combining 5 and 6; maybe they should be merged as well.
jackby03•39m ago
Good taxonomy. One thing missing from most discussions at these levels is how agents discover project context — most tools still rely on vendor-specific files (CLAUDE.md, .cursorrules). Would love to see standardization at that layer too.
nimasadri11•39m ago
I really like your post and agree with most things. The one thing I am not fully sure about:

> Look at your app, describe a sequence of changes out loud, and watch them happen in front of you.

The problem a lot of times is that either you don't know what you want, or you can't communicate it (and usually you can't communicate it properly because you don't know exactly what you want). I think this is going to be the bottleneck very soon (for some people, it is already the bottleneck). I am curious what are your thoughts about this? Where do you see that going, and how do you think we can prepare for that and address that. Or do you not see that to be an issue?

ramesh31•32m ago
>(Re: level 8) "...I honestly don't think the models are ready for this level of autonomy for most tasks. And even if they were smart enough, they're still too slow and too token-hungry for it to be economical outside of moonshot projects like compilers and browser builds (impressive, but far from clean)."

This is increasingly untrue with Opus 4.6. Claude Max gives you enough tokens to run ~5-10 agents continuously, and I'm doing all of my work with agent teams now. Token usage is up 10x or more, but the results are infinitely better and faster. Multi-agent team orchestration will be to 2026 what agents were to 2025. Much of the OP article feels 3-6 months behind the times.

C0ldSmi1e•12m ago
One of the best article I've read recently.

Tony Hoare has died

https://blog.computationalcomplexity.org/2026/03/tony-hoare-1934-2026.html
1023•speckx•5h ago•145 comments

You hired the AI to write the tests. Of course they pass

https://www.claudecodecamp.com/p/i-m-building-agents-that-run-while-i-sleep
45•aray07•49m ago•32 comments

Yann LeCun raises $1B to build AI that understands the physical world

https://www.wired.com/story/yann-lecun-raises-dollar1-billion-to-build-ai-that-understands-the-ph...
130•helloplanets•11h ago•257 comments

Launch HN: RunAnywhere (YC W26) – Faster AI Inference on Apple Silicon

https://github.com/RunanywhereAI/rcli
134•sanchitmonga22•2h ago•47 comments

Google to Discontinue Widevine Cloud License Service in April 2027

https://castlabs.com/blog/widevine-retiring-cloud-license-service/
24•dabinat•47m ago•4 comments

Debian decides not to decide on AI-generated contributions

https://lwn.net/SubscriberLink/1061544/125f911834966dd0/
205•jwilk•5h ago•163 comments

Billion-Parameter Theories

https://www.worldgov.org/complexity.html
61•seanlinehan•2h ago•28 comments

New HyperCard discovery: Neuromancer / Count Zero / Mona Lisa Overdrive

https://macintoshgarden.org/apps/neuromancer-count-zero-mona-lisa-overdrive
14•naves•42m ago•1 comments

FFmpeg-over-IP – Connect to remote FFmpeg servers

https://github.com/steelbrain/ffmpeg-over-ip
28•steelbrain•1h ago•12 comments

Intel Demos Chip to Compute with Encrypted Data

https://spectrum.ieee.org/fhe-intel
183•sohkamyung•6h ago•67 comments

Levels of Agentic Engineering

https://www.bassimeledath.com/blog/levels-of-agentic-engineering
37•bombastic311•11h ago•22 comments

Redox OS has adopted a Certificate of Origin policy and a strict no-LLM policy

https://gitlab.redox-os.org/redox-os/redox/-/blob/master/CONTRIBUTING.md
321•pjmlp•11h ago•325 comments

Rebasing in Magit

https://entropicthoughts.com/rebasing-in-magit
150•ibobev•6h ago•105 comments

Defeat as Method

https://www.cabinetmagazine.org/issues/71/khosravi.php
23•akbarnama•2h ago•1 comments

I put my whole life into a single database

https://howisfelix.today/
368•lukakopajtic•9h ago•175 comments

Meta acquires Moltbook

https://www.axios.com/2026/03/10/meta-facebook-moltbook-agent-social-network
281•mmayberry•5h ago•189 comments

Show HN: How I Topped the HuggingFace Open LLM Leaderboard on Two Gaming GPUs

https://dnhkng.github.io/posts/rys/
185•dnhkng•6h ago•67 comments

Open Weights Isn't Open Training

https://www.workshoplabs.ai/blog/open-weights-open-training
39•addiefoote8•20h ago•14 comments

Because Algospeak

https://www.tbray.org/ongoing/When/202x/2026/03/05/Because-Algospeak
7•zdw•2d ago•0 comments

I built a programming language using Claude Code

https://ankursethi.com/blog/programming-language-claude-code/
72•GeneralMaximus•3h ago•91 comments

Launch HN: Didit (YC W26) – Stripe for Identity Verification

39•rosasalberto•4h ago•43 comments

Converting Binary Floating-Point Numbers to Shortest Decimal Strings

https://onlinelibrary.wiley.com/doi/10.1002/spe.70056
6•matt_d•3d ago•0 comments

I used pulsar detection techniques to turn a phone into a watch timegrapher

https://www.chronolog.watch/timegrapher
44•tylerjaywood•3d ago•11 comments

RFC 454545 – Human Em Dash Standard

https://gist.github.com/bignimbus/a75cc9d703abf0b21a57c0d21a79e2be
99•jdauriemma•5h ago•88 comments

Throwing away 18 months of code and starting over

https://tompiagg.io/posts/we-threw-away-1-5-years-of-code
43•tomaspiaggio12•4h ago•46 comments

Maybe the G in AGI stands for Gemini

https://www.robinsloan.com/lab/gemini-agi/
10•speckx•1h ago•3 comments

Surpassing vLLM with a Generated Inference Stack

https://infinity.inc/case-studies/qwen3-optimization
16•lukebechtel•4h ago•4 comments

Online age-verification tools for child safety are surveilling adults

https://www.cnbc.com/2026/03/08/social-media-child-safety-internet-ai-surveillance.html
398•bilsbie•7h ago•244 comments

The Gervais Principle, or the Office According to "The Office" (2009)

https://www.ribbonfarm.com/2009/10/07/the-gervais-principle-or-the-office-according-to-the-office/
257•janandonly•3d ago•109 comments

The Enterprise Context Layer

https://andychen32.substack.com/p/the-enterprise-context-layer
27•zachperkel•4h ago•4 comments