Deep Agents

https://blog.langchain.com/deep-agents/

130•saikatsg•6mo ago

Comments

seabass•6mo ago

Is there more info on how the todo list tool is a noop? How exactly does that work?

JyB•6mo ago

Same question. I don’t understand what they mean by that. It obviously seem pretty central to how Claude Code is so effective.

kjhughes•6mo ago

I thought they meant that it's a noop as a tool in the sense that it takes no external action. It seems nonetheless effective as a means of organizing reasoning and expressing status along the way.

kobstrtr•6mo ago

just for chain of thought TodoWrite would be sufficient as a tool wouldn‘t it?

kobstrtr•6mo ago

if it was a noop, I feel like there wouldn‘t be a need to have TodoRead as a tool, since TodoWrite exists. Would love to get more info on whether this is really a noop

aabhay•6mo ago

My guess is the todo list is carried across “compress” points where the agent summarizes and restarts with fresh context + the summary

ttul•6mo ago

The context will contain a record that the tool call took place. The todo list is never actually fetched.

crawshaw•6mo ago

If you want to see it in action in some code, our agent Sketch uses a TODO list tool: https://github.com/boldsoftware/sketch/blob/main/claudetool/...

It is relatively easy to get the agent to use it, most of the work for us is surfacing it in the UI.

TrainedMonkey•6mo ago

My understanding is that it is basically a prompt about making a TODO list.

lmeyerov•6mo ago

i think he means it's 'just' a thin concat

most useful prompt stuff seems 'simple' to implement ultimately, so it's more impressive to me that such a simple idea of TODO goes so far!

(agent frameworks ARE hard in serious settings, don't get me wrong, just for other reasons. ex: getting the right mix & setup devilishly hard, as are infra layers below like multitenacy, multithreading, streaming, cancellation, etc.)

re: the TODO list, strong agree on criticality. it's flipped how we do louie.ai for stuff like speed running security log analysis competitions. super useful for preventing CoT from going off the rails after only a few turns.

a fun 'aha' for me there: nested todo's are great (A.2.i...), and easy for the LLM b/c they're linearized anyways

You can see how we replace claude code's for our own internal vibe coding usage, which helps with claude's constant compactions as a heavy user (= assuages issue of the ticking timer for a lobotomy): https://github.com/graphistry/louie-py/blob/main/ai/prompts/...

shmatt•6mo ago

At least from what I noticed - Junie from Jetbrains was the first to use a very high quality to do list, and it quickly became my favorite

I haven't used it since it became paid, but back then Junie was slow and thoughtful, while Cursor was constantly re-writing files that worked fine, and Claude was somewhere in the middle

tough•6mo ago

Cursor added a UI for todo list and encourages it's agent to use it (its great ux, but you can't really see a file of it)

kiro from amazon does both tasks (in tasks.md) and specs.

Too many tools soon, choose what works for you

manishsharan•6mo ago

I have been following along the code in this repo. https://github.com/ghuntley/claude-code-source-code-deobfusc...

The author has done a pretty good job of reverse engineering Claude Code and explaining the architecture.

update: changed the link to a better repo

cjonas•6mo ago

Can you explain what I'm looking at. Just appears to be a massive readme with a bunch of system instructions?

manishsharan•6mo ago

My apologies

This is a better repo to learn about Claude code internals

https://github.com/ghuntley/claude-code-source-code-deobfusc...

jayshah5696•6mo ago

sub agents adding isolating context is the real deal rest is just langgraph react agent

PantaloonFlames•6mo ago

This is valuable but not really a novel idea.

manx•6mo ago

I'm also in the process of creating a general purpose agent cli+library in rust: https://github.com/fdietze/alors

Still work in progress, but I'm already using it to code itself. Feedback welcome.

_andrei_•6mo ago

ah, deep agents = agents with planning + agents as tools => so regular agents.

i hate how LangChain has always tried to make things that are simple seem very complicated, and all the unnecessary new terminology and concepts they've pushed, but whatever sells LangSmith.

itsafarqueue•6mo ago

I used to consult on this type of thing. I’m not entirely convinced this is what’s happening here but it’s close enough, and is a well trod playbook - dress up the mundane in theatre and performance, create a taxonomy that’s specific to you, then sell access to the thing.

Next step is to try flood the SEO zone with your thing. It’s great if you can piggyback other key terms (deep *, agents) and.. I’m already bored writing this up it’s so [what’s the word for sheer resigned exhaustion at the capitalist corporate soul kill that is this type of work]

antoniojtorres•6mo ago

This is on the cynical side, for sure.

web-cowboy•6mo ago

As I think through this, I agree with others mentioning that "deep agents" still sounds a lot like agents+tools. I guess the takeaway for me is:

1. You need a good LLM for base knowledge.

2. You need a good system prompt to guide/focus the LLM (create an agent).

3. If you need some functionality that doesn't make any decisions, create a tool.

4. If the agent + tools flows get too wily, break it down into smaller domains by spawning sub agents with focused prompts and (less?) tools.

everforward•6mo ago

> 4. If the agent + tools flows get too wily, break it down into smaller domains by spawning sub agents with focused prompts and (less?) tools.

I think where this ultimately goes is a "coordinator" sort of model where the top-level agent primarily decides what needs to happen next and which agent is most equipped to handle that task. This could potentially happen in a recursive fashion (e.g. an agent for each product the company makes, that agent can dispatch to a "frontend" agent or a "backend" agent, etc).

That allows the agents that actually "do things" to maintain a limited context and set of tools, and the managing agents only have to maintain context on what their sub-agents can do.

storus•6mo ago

"I hacked on an open source package (deepagents) over the weekend." Thanks but no thanks.

yawnxyz•6mo ago

most of these agents are still fundamentally simple while loops; it shouldn't really take longer than a weekend to get one built

SCUSKU•6mo ago

Hacker hacks on project and gets posted to Hacker News. Commenter on Hacker News: No thanks, no hacking please.

storus•6mo ago

It's on langchain's official page, a framework that looks like it was hacked over the weekend by a fresh grad that brought a lot of pain to the agentic development, and this just feels like piling up more pain on it.

epolanski•6mo ago

Some of the biggest software in use today was hacked over few days in its first versions. Git is a famous one.

owebmaster•6mo ago

Absolutely not. Linus had git in his brain and it took a few days to write a first version but multiple years of learning

storus•6mo ago

Let's pretend BitKeeper didn't exist.

Also, that's one way to misinterpret what was said. I acknowledge there are some well-designed pieces of software that were hacked quickly. LangChain is not one of them, it's a mess, and hacking another module quickly is likely going to extend that mess, which was the reason for my comment.

hwchase17•6mo ago

Author here!

Main takeaways (which I'd love feedback on) are:

There are series of agents recently (claude code, manus, deep research) which execute tasks over longer time horizons particular well

At the core of it, it's just an LLM running in a loop calling tools... but when you try to do this naively (or at least, when I try to do it) the LLM struggles with doing long/complex tasks

So how do these other agents accomplish it?

These agents all do similar things, namely:

1. They use a planning tool

2. They use sub agents

3. They use a file system like thing to offload context

4. They have a detailed system prompt (prompting isn't dead!)

I don't think any of these things individually is novel... but I also think that they are not super common place to do when building agents. And the combination of them is (I think) an interesting insight!

Would love any feedback :)

gsmt•6mo ago

offloading context to a shared file system sounds good but at what point does it start getting messy when multiple subagents start working in parallel

noodletheworld•6mo ago

This matches my expectations.

Now that its increasingly clear that writing MCP servers isn't a winning strategy, people need a new way to jump on the band wagon as easily as possible.

Writing your own agent like geminin and claude code is the new hotness right now.

- low barrier to entry (tick)

- does something reasonably useful (tick)

- doesnt require any deep ai knowledge or skill (tick)

- easy to hype (tick)

Its like “cursor but for X” but easier to ship.

Were going to see a tonne of coding agents built this way, but my intuition is, and what Ive seen so far, is theyre not actually introducing anything novel.

Maybe having a quick start like this is good, because it drops the value of an unambitious direct claude code clone to zero.

I like it.

revskill•6mo ago

I created a simple openagen at https://github.com/revskill10/openagent-cli

sabaimran•6mo ago

Do subagents run in parallel?

revskill•6mo ago

No way because they share filesystem

revskill•6mo ago

Weird. The most interesting part is hidden totally. It is how u manage tool call from parsing to exection.

Rudolf Vrba

Autism Incidence in Girls and Boys May Be Nearly Equal, Study Suggests

Wellness Hotels Discovery Application

NASA delays moon rocket launch by a month after fuel leaks during test

Sebastian Galiani on the Marginal Revolution

Ask HN: Are we at the point where software can improve itself?

Binance Gives Trump Family's Crypto Firm a Leg Up

Reverse engineering Chinese 'shit-program' for absolute glory: R/ClaudeCode

Indian Culture

Show HN: Maravel-Framework 10.61 prevents circular dependency

The age of a treacherous, falling dollar

Ask HN: AI Generated Diagrams

Microsoft Account bugs locked me out of Notepad – are Thin Clients ruining PCs?

Show HN: A delightful Mac app to vibe code beautiful iOS apps

Show HN: Gemini Station – A local Chrome extension to organize AI chats

Welfare states build financial markets through social policy design

Market orientation and national homicide rates

California urges people avoid wild mushrooms after 4 deaths, 3 liver transplants

Matthew Shulman, co-creator of Intellisense, died 2019 March 22

Show HN: SuperLocalMemory – AI memory that stays on your machine, forever free

Show HN: Pyrig – One command to set up a production-ready Python project

Fast Response or Silence: Conversation Persistence in an AI-Agent Social Network [pdf]

C and C++ dependencies: don't dream it, be it

Show HN: Vbuckets – Infinite virtual S3 buckets

Open Molten Claw: Post-Eval as a Service

New York Budget Bill Mandates File Scans for 3D Printers

The End of Software as a Business?

Exploring 1,400 reusable skills for AI coding tools

Show HN: A unique twist on Tetris and block puzzle

The logs I never read

Rudolf Vrba

Autism Incidence in Girls and Boys May Be Nearly Equal, Study Suggests

Wellness Hotels Discovery Application

NASA delays moon rocket launch by a month after fuel leaks during test

Sebastian Galiani on the Marginal Revolution

Ask HN: Are we at the point where software can improve itself?

Binance Gives Trump Family's Crypto Firm a Leg Up

Reverse engineering Chinese 'shit-program' for absolute glory: R/ClaudeCode

Indian Culture

Show HN: Maravel-Framework 10.61 prevents circular dependency

The age of a treacherous, falling dollar

Ask HN: AI Generated Diagrams

Microsoft Account bugs locked me out of Notepad – are Thin Clients ruining PCs?

Show HN: A delightful Mac app to vibe code beautiful iOS apps

Show HN: Gemini Station – A local Chrome extension to organize AI chats

Welfare states build financial markets through social policy design

Market orientation and national homicide rates

California urges people avoid wild mushrooms after 4 deaths, 3 liver transplants

Matthew Shulman, co-creator of Intellisense, died 2019 March 22

Show HN: SuperLocalMemory – AI memory that stays on your machine, forever free

Show HN: Pyrig – One command to set up a production-ready Python project

Fast Response or Silence: Conversation Persistence in an AI-Agent Social Network [pdf]

C and C++ dependencies: don't dream it, be it

Show HN: Vbuckets – Infinite virtual S3 buckets

Open Molten Claw: Post-Eval as a Service

New York Budget Bill Mandates File Scans for 3D Printers

The End of Software as a Business?

Exploring 1,400 reusable skills for AI coding tools

Show HN: A unique twist on Tetris and block puzzle

The logs I never read

Deep Agents

Comments