frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Lessons learned from running Claude Code swarms at scale

7•sermakarevich•8h ago
Some time ago I built a simple app to run swarms of coding agents — I call it fleet (https://news.ycombinator.com/item?id=48256389). It's based on centralized beads with a Python orchestrator and can run any coder (Claude, agy, Codex). Recently I added a UI to manage the whole agent lifecycle: adding new tasks, monitoring running ones, and a chat interface built on MCP with a centralized SQLite DB. From the UI I can spawn agents to run in any directory, define dependencies on other tasks, and specify which coder/model should do the job. Today I can run 10–15 agents concurrently. At that scale you burn through limits very fast, so I spent some time investigating where those limits go and how to maximize efficiency. Here are the lessons learned after a few weeks of running the fleet:

- CLAUDE.md is a terrible abstraction. These files load unconditionally, they often contain descriptions irrelevant to the task at hand, and they stack from your working directory upward. The result is wasted tokens and confusion from injecting irrelevant instructions into the session.

- Skills are bad, but not as bad as CLAUDE.md. They use a progressive disclosure approach: only the skill description goes into the session, and Claude loads the full skill text with a tool when it's needed. That's one level better, but it still doesn't let you scale — you can't create 10K skills, as that would eat your entire usable context. Claude recently introduced a skills budget that silently drops less frequently used skills from the session entirely. You can still invoke them in an interactive session, but the model can't invoke them in a background session.

- Some plugins may be installed more than once. During cleanup I found that a few of mine were installed in multiple locations, consuming double the tokens on duplicated instructions.

- Attaching plugins to every session is a bad idea at scale. You want to be precise about which plugins are actually useful and attach them per task.

- Use a hierarchical knowledge base instead of CLAUDE.md / skills / plugins. It lets you benefit from real progressive disclosure: keep your instructions and tool descriptions in it and let Claude navigate through it quickly and cheaply.

- System tools consume ~15K tokens (7% of the session). You can't manage this — they're just attached, and disabling tools doesn't remove them from the context.

- AskUserQuestion isn't available in background sessions. You need to implement your own tool — MCP- or CLI-based — to give `claude -p` the ability to talk to you.

- You become selective about which model handles each task. Decompose work into harder and simpler subtasks so you can route the simpler ones to weaker, cheaper models and save tokens.

- Your context-switching skill improves over time.

Fleet repo: https://github.com/sermakarevich/fleet

Comments

lavaman131•3h ago
This is interesting. Are you thinking that you'll eventually evolve this into a full control plane for agentic swarms?
sermakarevich•2h ago
I use it as a full control plane for a couple of weeks from cli and from UI completely over last few days. UI now allows to manage the full cycle of agents - create task, monitor task, respond to agents questions, unblock jobs, etc

Show HN: Lowfat – pluggable CLI filter that saved 91.8% of my LLM tokens

https://github.com/zdk/lowfat
16•zdkaster•3h ago•5 comments

Show HN: Local-first fast CPU image to text for screenshots, PDFs, webpages

https://github.com/kouhxp/textsnap
4•mrkn1•1h ago•2 comments

Show HN: Mercek – A Desktop IDE for AWS ECS

https://www.mercek.dev/
60•utibeumanah•15h ago•24 comments

Show HN: Prela – Purely Algebraic Relation Combinators

https://github.com/remysucre/prela
70•remywang•3d ago•13 comments

Show HN: Altersend – File sharing without cloud

https://github.com/denislupookov/altersend
7•denisdev1•6h ago•1 comments

Show HN: Uruky (EU-based Kagi alternative) now has Image Search and URL Rewrites

https://uruky.com/?il=en
225•BrunoBernardino•1d ago•212 comments

Show HN: Edsger – A handwritten Clojure REPL for the reMarkable 2

https://handwritten.danieljanus.pl/2026-06-01-edsger.html
256•nathell•2d ago•34 comments

Show HN: Hitoku Draft – Context aware local assistant

https://hitoku.me/draft/
18•lostathome•19h ago•4 comments

Show HN: NoiR Code – because QR sounds similar to "noir"

https://noir-code.suncake.xyz/
10•Sunkek•1d ago•4 comments

Show HN: I reverse-engineered the world maps of Test Drive III (1990 DOS game)

https://github.com/s-macke/Test-Drive-3-Maps
215•s-macke•5d ago•56 comments

Show HN: Cost.dev (YC W21) – making agents cost-aware and cheaper to call

https://cost.dev/
33•akh•1d ago•17 comments

Show HN: Formally verified polygon intersection – Opus 4.8 oneshots, prev failed

https://github.com/schildep/verified-polygon-intersection
45•permute•14h ago•12 comments

Show HN: Papernews – self-hosted daily newspaper PDF for your reMarkable

https://github.com/marcj/papernews
8•bourbonproof•13h ago•2 comments

Show HN: CentProof – Local-first bank statement reconciliation for macOS

https://centproof.com
3•javamantraact•7h ago•0 comments

Show HN: I embedded 685M public texts in 32 minutes (on 8x A100, Rust, TensorRT)

https://github.com/Artain-AI/ignite-ms
6•ddayanov•21h ago•0 comments

Show HN: Lessons learned from running Claude Code swarms at scale

7•sermakarevich•8h ago•2 comments

Show HN: Eyeball

https://eyeball.rory.codes/
290•mrroryflint•3d ago•88 comments

Show HN: Boxes.dev: ditch localhost; run Claude Code and Codex in the cloud

https://boxes.dev
91•nab•22h ago•66 comments

Show HN: Nutrepedia – Nutrition info in 29 locales built with Clojure and Htmx

https://nutrepedia.com/en-us/
132•llovan•1d ago•29 comments

Show HN: Using Haskell to play music on 3D printer motors (2020)

https://lucasoshiro.github.io/software-en/2020-07-31-music_gcode/
10•lucasoshiro•16h ago•2 comments

Show HN: Mnemo – local-first AI memory layer for any LLM (Rust, SQLite,petgraph)

https://github.com/zaydmulani09/mnemo
59•zaydmulani•1d ago•26 comments

Show HN: Intencion – Product analytics that improves your AI agents continuously

https://intencion.io
4•sakuraiben•15h ago•0 comments

Show HN: FFmpeg WebCLI – Full FFmpeg in Browser, Offline PWA, No Uploads(WASM)

https://github.com/tejaswigowda/ffmpeg-webCLI
81•tejaswigowda•16h ago•25 comments

Show HN: Digger Solo – Local AI File Explorer

https://solo.digger.lol
5•sean_pedersen•17h ago•0 comments

Show HN: Live breath detection and biofeedback from a phone microphone

https://github.com/shiihaa-app/shiihaa-breath-detection
64•felixzeller•2d ago•25 comments

Show HN: ControllerTest-test gamepads,stick drift and polling rate by browser

https://controllertestonline.com/
3•zylics•12h ago•0 comments

Show HN: Rscrypto, pure-Rust crypto with industry leading public benches

https://github.com/loadingalias/rscrypto
33•LoadingALIAS•1d ago•14 comments

Show HN: Ideogram 4.0 – open-weight 9.3B text-to-image model

https://github.com/ideogram-oss/ideogram4
45•pigcat•1d ago•10 comments

Show HN: Bio Glyph – Turn Your Face into a One-Line Drawing

https://bio.bairui.dev/
21•subairui•1d ago•17 comments

Show HN: OpenPaper - No algorithm, no account, just your sources as a newspaper

https://github.com/falense/openpaper
3•falense•15h ago•0 comments