Dynamic Workflows in Claude Code

https://claude.com/blog/introducing-dynamic-workflows-in-claude-code

62•mil22•1h ago

Comments

mil22•48m ago

Interesting to note, not sure if this was known publicly before today's blog post:

Rewriting Bun with dynamic workflows

An example of what dynamic workflows can unlock at scale is the recent rewrite of Bun. Jarred Sumner used dynamic workflows to port Bun from Zig to Rust with 99.8% of the existing test suite passing, roughly 750,000 lines of Rust, and eleven days from first commit to merge. One workflow mapped the right Rust lifetime for every struct field in the Zig codebase. The next wrote every .rs file as a behavior-identical port of its .zig counterpart, hundreds of agents working in parallel with two reviewers on each file. A fix loop then drove the build and test suite until both ran clean. After the port landed, an overnight workflow addressed unnecessary data copies and opened a PR for each for final review. While not yet in production, all of this was handled by dynamic workflows. Jarred will be writing about this more in the future.

SkyPuncher•34m ago

I'm extremely skeptical that dynamic workflows had anything to do with this. I've been able to refactor one of the most complicated parts of our code base with similar results.

Mechanical refactors are relatively straight forward for agents.

tra3•35m ago

I say this as someone who's found LLMs incredibly beneficial.

Is this a way to increase token burn?

I thought we covered this with Claude's C compiler. What changed?

mattas•31m ago

My initial reaction was that this is tokenmaxxing disguised as a product.

Deukhoofd•32m ago

I'm going to be honest, this very much reads like an exciting new way to burn up as many tokens as possible. Large amounts of parallel agents that all have all their work double-checked by multiple other agents, and that keeps running for a longer period of time?

I feel like there are more efficient ways to tackle the issues given.

SilverElfin•32m ago

Cloudflare just launched a feature with this same name, just this month. Why would Anthropic choose the same exact name?

https://blog.cloudflare.com/dynamic-workflows/

Also isn’t all of this already easy to do on any of the platforms (include Claude before this and OpenAI too).

CuriouslyC•31m ago

Anthropic is going to price themselves out of code, but still find a nice market providing service to senior management. Their long term play is virtual employees rather than tools for humans.

trjordan•30m ago

It feels like we're far past the point of where having AI do more faster is helpful.

It's telling that they used "rewrite Bun in Rust" as the proof point here. It's cool! But the vast majority of software engineering doesn't start with tens of thousands of tests, where making them pass is the whole job.

In my experience, AI still drifts from what I meant it to do on anything bigger than building a widget. My time is spent suspiciously reviewing output for changes the agent snuck in, or invariants it broke. I talked with a friend recently where the agent broke the test harness badly enough that none of the tests mattered for 3 weeks. They did pass, though, so CI never complained.

There's something at the intersection of context engineering, managing that sloppy pile of markdown plans, and good old fashioning system understanding that's the real bottleneck.

bcherny•28m ago

A few of us from the Claude Code team will be hanging around if anyone has questions! Very excited for this launch -- dynamic workflows have been a game changer for engineering here at Anthropic. Can't wait to hear what you think.

thallavajhula•26m ago

Hi Boris! Thanks for Claude Code.

Is there an example of how y'all use Dynamic Workflows internally that you could share with the rest of us here so that we can mimic something similar?

bcherny•22m ago

Hey, yep. A few things I personally used dynamic workflows for over the last few weeks:

1. Autonomously landed 20+ optimizations to reduce Claude Code's token usage by ~15%

2. Ported tree-sitter, color-diff, yoga-layout, and a number of other WASM and Rust native modules to TypeScript, improving CPU and memory use by 2-10x in the process

3. Made our CI faster, and repeatedly found and fixed flaky tests (with /loop)

4. Migrated from regex-based bash static analysis to tree-sitter, reducing false positive permission prompts by 45%

5. Reduced Claude Agent SDK startup time by 61%, by repeatedly profiling and optimizing the startup path, putting up a number of PRs in the process

6. Shipped 69 code simplification PRs, deleting >10k lines of code

rahkiin•10m ago

You _reduced_ its _efficiency_? Why do you make CC more inefficient?

bcherny•5m ago

Typo! Edited

piyuv•28m ago

“We realized the tech is not as addictive as we’ve hoped so we won’t be able to raise token prices enough to be profitable, so here’s a way to make you consume a lot more tokens without even realizing”

SkyPuncher•27m ago

I don't really get this. At this point, my limiting factor is not how quickly Claude can self-trudge through code. It's whether Claude is going to do the task correctly or not.

I need more mechanisms for controlling long-running sessions and dynamically injecting my thoughts, correction, and nudges rather than faster ways to burn through my tokens without knowing if the results are going to be correct.

jascha_eng•22m ago

yes I agree with this, more granular going back, letting me interrupt where it went off the rails, or even editing file reads myself etc would be lovely. Ingesting parts of other conversations would also be cool!

dude250711•20m ago

I have heard of "token-maxxing" but I have not heard of "correctness-maxxing" or "quality-maxxing".

wrs•18m ago

I think the theoretical answer here is this:

"Agents address the problem from independent angles, other agents try to refute what they found, and the run keeps iterating until the answers converge."

So you will be supplying the "ground truth" (test suite, detailed spec, whatever) and empower an agent to use it to guide the other agents. Currently a lot of people do this sequentially in the form of multiple code-review passes by fresh agent sessions looking at the work of previous sessions.

Adversarial models are a longstanding technique in ML so it makes sense they would try to go this way.

tsunamifury•13m ago

Ground truth is not consensus, it has to be graded against what actually works for the original goal. Plenty of scenarios with AI and Humans can result in consensus around incorrectness.

vld_chk•26m ago

Quite a thing to use Bun rewrite to Rust as example of dynamic workflows, while now it is considered as anti pattern which leads team to stop supporting the tool due to inability to properly understand and navigate 1m vibe coded Rust lines

buryat•23m ago

Not sure I understand how it's different from a team of sub-agents, what's the difference I'm curious?

bcherny•16m ago

There's two main differences:

1. Support for 1-2 OOMs more agents, to do more work in parallel

2. A phased, semi-structured approach where work happens in steps

vblanco•21m ago

I made my own knockoff of that for myself https://github.com/vblanco20-1/AgentLoom (not really usable, just a vibecoded prototype), based on the workflow files found in the Bun repo. Ive been using it but pointed at deepseek flash to do some really large scale stuff. Its a fun way of using agents, and highly useful for tasks like code review to apply some rules, or to find vulnerability candidates. Funny enough, i used it in the same way claude does, vibecoding the workflow scripts and prompts themselves.

I did find it uses tokens like crazy, i migrated Pixel Dungeon (java) to C# as a experiment, and it used almost 2 billion tokens. It was just 20 bucks due to deepseek flash, but i shudder thinking of how much money this uses when run on the real claude API pricing.

jorgeleo•17m ago

curios minds... why to do that port?

vblanco•13m ago

just to test the tech. No real usage other than for the fun of it.

I did port stb_image from C to Jai which i was able to fully verify and harden and that one ill give more use. Im also using the same workflow system to perform agentic translation of a game i work with from english to various other languages, the results are far better than the commercial "human" translation services we tested. And i also use it to fix OCR issues on PDF books im ocr-ing for a data pipeline. This kind of workflow/wide agent swarm system is rather useful for many things where you want to "apply" the same prompts across a whole codebase or just in parallel.

mkw5053•16m ago

Wow, almost like the good old days of /ultrathink are back. Feels simultaneously like just yesterday and a lifetime ago.

2001zhaozhao•16m ago

We really need a way to scope and implement these multi-agent orchestration features that isn't locked in to one provider.

xcskier56•13m ago

Are these “features” just hooks to get people to burn more tokens faster?

I’m at the point where deciding what we should and should not do takes a lot more time than actually doing it. More agents just means running faster in potentially the wrong direction

Claude Opus 4.8

Show HN: Continue? Y/N: A 60-second game about AI agent permission fatigue

Indoor Wi-Fi Roaming with OpenWRT

US's big bet on quantum computing may not be legal

Show HN: Ktx – Open-source executable context layer for data agents

Dynamic Workflows in Claude Code

YouTube to automatically label AI-generated videos

The Permanent Upper Crow

Trivial Pursuits

EU fines Temu €200M for allowing sale of illegal products

Boston and Bermuda

Show HN: Hallucinate – Massively Multiplayer Online Rave

News about Raspberry Pi 6 and Microcontroller Development

Show HN: Open-Source AI Racing Harness

Bttf is a command line datetime Swiss army knife

SimCity 3k in 4k (2025)

I'm Getting into Mesh Networks (Meshtastic, MeshCore, and Reticulum)

Thornton Wilder's Last Play Vanished into Thin Air. Or Did It?

Creusot helps you prove your Rust code is correct

What Apple and Google are doing to push notifications

Show HN: TapToyPia

Using Tailscale with an OrbStack VM on macOS

Disagreement among frontier LLMs on real-world fact-checks

Ruby vs. Java vs. TypeScript: my experience on building a Cowork DOCX plugin

The Ask

Seeing Around Corners Using Smartphone-Grade Lidar

RamAIn (YC W26) Is Hiring

New York passes pied-a-terre tax

Rust (and Slint) on a Jailbroken Kindle

More Whimsical OEIS Sequences

Claude Opus 4.8

Show HN: Continue? Y/N: A 60-second game about AI agent permission fatigue

Indoor Wi-Fi Roaming with OpenWRT

US's big bet on quantum computing may not be legal

Show HN: Ktx – Open-source executable context layer for data agents

Dynamic Workflows in Claude Code

YouTube to automatically label AI-generated videos

The Permanent Upper Crow

Trivial Pursuits

EU fines Temu €200M for allowing sale of illegal products

Boston and Bermuda

Show HN: Hallucinate – Massively Multiplayer Online Rave

News about Raspberry Pi 6 and Microcontroller Development

Show HN: Open-Source AI Racing Harness

Bttf is a command line datetime Swiss army knife

SimCity 3k in 4k (2025)

I'm Getting into Mesh Networks (Meshtastic, MeshCore, and Reticulum)

Thornton Wilder's Last Play Vanished into Thin Air. Or Did It?

Creusot helps you prove your Rust code is correct

What Apple and Google are doing to push notifications

Show HN: TapToyPia

Using Tailscale with an OrbStack VM on macOS

Disagreement among frontier LLMs on real-world fact-checks

Ruby vs. Java vs. TypeScript: my experience on building a Cowork DOCX plugin

The Ask

Seeing Around Corners Using Smartphone-Grade Lidar

RamAIn (YC W26) Is Hiring

New York passes pied-a-terre tax

Rust (and Slint) on a Jailbroken Kindle

More Whimsical OEIS Sequences

Dynamic Workflows in Claude Code

Comments