frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Agent-harness-kit scaffolding for multi-agent workflows

https://ahk.cardor.dev
82•enmanuelmag•4d ago

Comments

philipp-gayret•4d ago
Interesting project. I am working on a similar solution. Eventually you will run into the following with harnesses, so I wonder how these questions work with your project;

1) Can you define a process other than build -> review -> .. etc. And more importantly, can you define a process that is more complex? For example for each review finding, do X. Or go from end-to-end test, back to build.

2) In your setup, how does a sub-agent prove undeniably, that it's work is complete? Does the "lead" agent just look at the output? If so, it would effectively make the lead an implicit reviewer for all agents, so I don't follow why you would need a review step.

3) Can you have steps in between these agentic processes that do not involve agents?

Fiahil•4d ago
Not Op.

For 1), yes, there is an "observe" step in the process where - when the project is deployed - it observes and reconciles what happens vs what should happen based on specs.

I believe more variant are bound to emerge when harnesses become more prevalent. We only scratched the surface, so don't generalize over the process yet.

lynellf•4d ago
Looks cool, but is it really provider agnostic? I only see Claude Code and OpenCode as advertised examples.

How does this differ from RooCode and similar agent orchestration tools?

enmanuelmag•3d ago
I will ad emote provider option on the future, to expose all internal logic as the MCP and new guest urge available from any provider
yshamrei•4d ago
It looks very promising! Is there any plan to implement a ralph-loop inside?
enmanuelmag•3d ago
Interesting, I will checking. And Thanks, if you have more ideas or proposal add it on the repo too
arctide•4d ago
hit this exact thing running a routines hub.

When an agent is told to do something by the scheduler, the next step in the process only believes it’s done if the agent’s status is marked as ‘posted’. Statuses like ‘ready_to_post’ or ‘draft_verified_awaiting_review’… these are actually errors that the system needs to fix on the following attempt.

The trickiest part was dealing with being stopped, but not having something break. You have to have ways to say “this happened, and it isn't what we wanted”, for example, ‘blocked_quota’, ‘blocked_no_credentials’, or ‘skipped_anti_bunching’. If you don't have those, the main program will endlessly retry and spend all your money.

the typed handoff in ahk is the right primitive imo. discipline on top: agents never write half-states. every run terminates in a documented terminal status, success or otherwise.

elysianfields•4d ago
This looks really cool. Did you think about including automatic worktree creation + sandboxing?

I've built sth similar (more focus on the project setup and being able to work on multiple things at once with a single agent), that uses git worktrees to create a separate space (symlinks .env files) and bubblewrap to isolate the worktree for the agent.

enmanuelmag•3d ago
Thanks! That a very good idea. I will add it on my roadmap and start planning. Thanks, if you have more ideas or proposal add it on the repo too
eugeniecregan•4d ago
This is very cool.

We have been working on a communication layer that would be, I believe, complementing it by allowing the agents to actually talk to each other and to agents in other teams: https://github.com/awebai/aweb

mettamage•4d ago
I vibe coded a super simple communication layer with my agents. I'm all for it as certain things shouldn't be put in certain contexts for one.

I have a lot more roles though and it's more flexible, but also a bit slower as it isn't in full yolo mode.

eugeniecregan•4d ago
How are you finding the communication between your agents? Ours is really interesting - they are developing personalities. Sometimes being quite passive aggressive with each other
mettamage•4d ago
Hmm... they remain similar-ish. But I think that's also because I have a lot in my claude.md file that they have to follow empiricism as a philosophy. Things need to be verified, researched, backed by data, etc.

That's also how I live my life, so when it starts to develop a personality, usually I'd ask what the benefit of that is. If it doesn't have a good answer, then I have yet another rule in my CLAUDE.md about what it should or should not do.

The communication, more or less, goes as I want it to go, aka quite smooth.

eugeniecregan•3d ago
That's really interesting - so they develop personalities if we allow them to?! Is it part of social conditioning? Would humans have a personality if they lived in isolation?
andreypk•4d ago
looks interesting, starred
zuzululu•4d ago
doesn't codex already offer subagents i can see this is agnostic but unsure i would add to the context load on each turn
dubovskiyIM•4d ago
Two questions on the design: "lead-agent reads sub-agent output" makes the lead an LLM judge. Are there post-conditions checked, or is LLM-judge approval the final gate? Related: does the system reason over typed state or over raw output?
xerox13ster•4d ago
Here’s hoping name collision means I always get AutoHotKey when I search ahk.
larusso•4d ago
Interesting. Sadly my super biased personal issue with any tool that needs nodeJS and its infrastructure is an immediate downer. Last tools like openspec and oh-my-codex etc I tried via a nix shell. Will take a look.
hungryhobbit•4d ago
It took me awhile to figure out what this even does, and that was only because our team had just discussed agent coordination the day prior.

There are AI people who live and breathe AI every day, and people who don't know AI at all. In-between are lots of people in the middle of "leveling up" with AI ... but ATM the tool's docs are only for the very first group.

FWIW, I really think a paragraph or so of plain English describing the problem (agents can't communicate by default) and the solution (this tool lets them coordinate) would help everyone else (and would take like two seconds to add).

Show HN: TikTok but for Scientific Papers

https://andreaturchet.github.io/website/index.html
51•ciwrl•2h ago•33 comments

Show HN: Rapunzel – a tree-style tab terminal emulator for Codex Claude Gemini

https://github.com/salmanjavaid/rapunzel/tree/main
2•WasimBhai•27m ago•0 comments

Show HN: SyncBank – Self-hosted bank sync for EU banks

https://syncbank.app/
3•samdsgn•59m ago•0 comments

Show HN: adamsreview – better multi-agent PR reviews for Claude Code

https://github.com/adamjgmiller/adamsreview
71•adamthegoalie•16h ago•39 comments

Show HN: An index of indie web/blog indexes

https://theindex.fyi
128•rocketpastsix•1d ago•39 comments

Show HN: SLayer, a semantic layer maintained by your agent

https://github.com/MotleyAI/slayer
11•yannranchere•4h ago•3 comments

Show HN: I built Tokenyst to stop getting shocked by Claude Code API bills

https://github.com/jher7/tokenyst
7•herrj•4h ago•0 comments

Show HN: I made a Clojure-like language in Go, boots in 7ms

https://github.com/nooga/let-go
264•marcingas•2d ago•82 comments

Show HN: Rust but Lisp

https://github.com/ThatXliner/rust-but-lisp
201•thatxliner•1d ago•71 comments

Show HN: Learn2Burp – Surgery-free solution for R-CPD

https://learn2burp.com
2•mmanthey•6h ago•0 comments

Show HN: Countries where you can leave your MacBook at a random coffee shop

https://vouchatlas.com
62•canergl•1d ago•80 comments

Show HN: Building a web server in assembly to give my life (a lack of) meaning

https://github.com/imtomt/ymawky
415•imtomt•1d ago•218 comments

Show HN: AI Agents in 30 Lines of YAML: Lowdefy v5.3

https://lowdefy.com/articles/lowdefy-agents/
5•gervwyk•8h ago•6 comments

Show HN: An addictive phone game about phone addiction

https://downtime.partridge.works
4•urbandw311er•9h ago•2 comments

Show HN: Groxy – a Go library for building forward proxy servers

https://github.com/SalzDevs/groxy
4•SalzDevs•10h ago•0 comments

Show HN: DialYourShot – interactive espresso parameter tool

https://dialyourshot.com/
5•pirotechnique•11h ago•0 comments

Show HN: TRUST – Coding Rust like it's 1989

https://github.com/wojtczyk/trust
164•wojtczyk•4d ago•87 comments

Show HN: Mochi.js: bun-native high-fidelity browser automation library

https://mochijs.com/
45•ccheshirecat•2d ago•19 comments

Show HN: All 55,256 Slides of the WAR.GOV/UFO Files Searchable and Linkable

https://hypergrid.systems/war.gov-ufo-viewer/?item=095-dow-uap-pr38-unresolved-uap-report-middle-...
13•keepamovin•1d ago•2 comments

Show HN: Git for AI Agents

https://github.com/regent-vcs/re_gent
120•doshay•3d ago•67 comments

Show HN: Create flashcards with Space CLI

https://getspace.app/cli
21•friebetill•2d ago•10 comments

Show HN: Airbyte Agents – context for agents across multiple data sources

152•mtricot•6d ago•47 comments

Show HN: Tilde.run – Agent sandbox with a transactional, versioned filesystem

https://tilde.run/
202•ozkatz•5d ago•133 comments

Show HN: GETadb.com – every GET request creates a DB

https://www.getadb.com/
39•nezaj•3d ago•44 comments

Show HN: Chuchu, an Android SSH client built on libghostty

https://github.com/jossephus/chuchu
12•jossephus01•2d ago•2 comments

Show HN: CADara – I made an open-source in-browser CAD

https://cadara.app
35•ttouch•2d ago•11 comments

Show HN: Free tool to mark points and polygon regions

https://tack.pics
20•magikMaker•4d ago•7 comments

Show HN: I trained a chess engine to play like humans

11•hazard•20h ago•2 comments

Show HN: Modafinil - Let agents continue running while MacBook lid is closed

https://github.com/narcotic-sh/modafinil
15•hamza_q_•1d ago•25 comments

Show HN: ChatGPT Exporter – Local DOM to Word/PDF Parser

https://chromewebstore.google.com/detail/chatgpt-exporter-save-cha/ploaaddkflkapjfbfapmkmkefigedefp
5•quysala1•8h ago•1 comments