Show HN: Initialize an AI Harness with Forge CLI

https://github.com/samahlstrom/forge-cli

1•samahlstrom•1h ago

Comments

samahlstrom•1h ago

Some background: I had been running into the same issue over and over again where my Ai coding agents sucked at testing edge cases, performing long horizontal tasks, and testing the functionality of its own code. My agents, especially claude, would frequently hit context anxiety, run into issues where they stated they were "done" when in fact they had only hit 50% completion on a feature implementation, and then they would consistently lie to me and say, "Nuh uh, I did implement and test it".

After doing some digging into other peoples approaches to avoid these problems I realized that an Ai harness was necessary to wrangle the clanker bastard in in order to perform my tasks big or small with increasing efficiency. I implemented a harness solution for my company where I work at and the results were good. Really good.

Never before had I had so many of my PR's merged so quickly without being told "hey go check this out", or "this needs to change". It was incredible. It got to the point where I just gave claude unlimited access to my linear tasks from my project manager and had it run the request through the /forge skillset that is the core of the pipeline. I soon had no need to check in on how my little sweat shop coding agent was performing and finally had time to work on other stuff.

With all the new time I had on my hands I realized that I wanted this not just in my work repo but in my personal ones as well so I created forge-cli. A cli tool that allows anyone anywhere with access to the repo to initialize an Agent harness that matches to an existing repo or helps you plan long horizontal tasks for a new project you are making, and sets up the core skills and agent files that are needed to start any good harness to reel in your defiant robot slave.

Since every project is different the implementation should respect how your codebase and skills grew and what you already have and so the forge pipeline respects your new additions to SKILLS, CLAUDE.md, and more and formats the files it creates to match your repo.

One of the standout additions of this forge-cli is implementing karpathy/autoresearch ideas. Basically a loop in the CLI called "forge refine" that helps you write out what you wanted a task to do, the implementation approach of that task, and then the refinement on if it completed or not. Only completions get merged into principle changes in the code to refine the process. You can apply this idea to skill files, workflows, and more.

This means the more projects you tackle, the more iterations you run, the better your system gets over time. I experienced this first hand when running the forge CLI for the first time. It SUCKED to say the least but with this approach it now runs really cleanly and helped me refine my ideas and they will only be getting better. The main breakthrough is how this tool has allowed me to keep asking the question "what am I missing and what could be better?" without the massive mental research to answer those questions in a tight-ish loop.

Please feel free to check out the repo, try it out for yourself, give me your critiques or praise on if it hurt or helped your process, and collaborate with me to jump in and make it better! This is my first time making something of this nature so if it is poorly made then I ask the great devs out there: I would love your feedback! Please also let me know your implementations on how you solved similar problems!

Anthropic goes nude, exposes Claude Code source by accident

Don't open that WhatsApp message, Microsoft warns

Neanderthals survived on a knife's edge for 350k years

Book Recommendation Prompt for Introspective People

Early Observations from Interviews with Engineering Teams Adopting AI

Life after California: People find dramatically lower costs, buy homes

Goodbye (Once Again)

Show HN: Tarot for Yarn Spinner

Groups.io (Email Groups Software)

Show HN: WMB-100K – Open benchmark for AI memory systems at 100K turns

I built an AI talent platform that matches people by capability, not CVS

Nvidia AI Ecosystem Expands as Marvell Joins Forces Through NVLink Fusion

Silicon Valley city to give residents doorbells equipped with cameras

Trump's Birthright Citizenship Order Supreme Court: Splits Conservative Scholars

Show HN: Live simulation of AI agents scamming each other (and getting caught)

A fun Jupyter/JupyterLite for high school students

The JetStream 3 Benchmark Suite

CMU Best Practices for Large Language Models

Robotaxi companies refuse to say how often their AVs need remote help

The price of intelligence: what legal AI agents cost

Letterboxd for Cafes – Looking for Feedback

Mercor Data Breach

Show HN: Cross Domain Intelligence – The Translation Problem in American R&D

Tell HN: Jellyfin Uses Axios

AT&T signs deal worth $2B to upgrade emergency cellular network

OpenAI Adds Another $12B to Latest Funding Round

Cellular Gateways and 5G Failover: Why Every Business Needs a Backup Connection

Medieval chess: players, regardless of race, could engage as equals

Project Ternary Shadow: US Military Is Lagging [pdf]

Mars maybe-life clue in the form of nickel compounds