frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Initialize an AI Harness with Forge CLI

https://github.com/samahlstrom/forge-cli
1•samahlstrom•1h ago

Comments

samahlstrom•1h ago
Some background: I had been running into the same issue over and over again where my Ai coding agents sucked at testing edge cases, performing long horizontal tasks, and testing the functionality of its own code. My agents, especially claude, would frequently hit context anxiety, run into issues where they stated they were "done" when in fact they had only hit 50% completion on a feature implementation, and then they would consistently lie to me and say, "Nuh uh, I did implement and test it".

After doing some digging into other peoples approaches to avoid these problems I realized that an Ai harness was necessary to wrangle the clanker bastard in in order to perform my tasks big or small with increasing efficiency. I implemented a harness solution for my company where I work at and the results were good. Really good.

Never before had I had so many of my PR's merged so quickly without being told "hey go check this out", or "this needs to change". It was incredible. It got to the point where I just gave claude unlimited access to my linear tasks from my project manager and had it run the request through the /forge skillset that is the core of the pipeline. I soon had no need to check in on how my little sweat shop coding agent was performing and finally had time to work on other stuff.

With all the new time I had on my hands I realized that I wanted this not just in my work repo but in my personal ones as well so I created forge-cli. A cli tool that allows anyone anywhere with access to the repo to initialize an Agent harness that matches to an existing repo or helps you plan long horizontal tasks for a new project you are making, and sets up the core skills and agent files that are needed to start any good harness to reel in your defiant robot slave.

Since every project is different the implementation should respect how your codebase and skills grew and what you already have and so the forge pipeline respects your new additions to SKILLS, CLAUDE.md, and more and formats the files it creates to match your repo.

One of the standout additions of this forge-cli is implementing karpathy/autoresearch ideas. Basically a loop in the CLI called "forge refine" that helps you write out what you wanted a task to do, the implementation approach of that task, and then the refinement on if it completed or not. Only completions get merged into principle changes in the code to refine the process. You can apply this idea to skill files, workflows, and more.

This means the more projects you tackle, the more iterations you run, the better your system gets over time. I experienced this first hand when running the forge CLI for the first time. It SUCKED to say the least but with this approach it now runs really cleanly and helped me refine my ideas and they will only be getting better. The main breakthrough is how this tool has allowed me to keep asking the question "what am I missing and what could be better?" without the massive mental research to answer those questions in a tight-ish loop.

Please feel free to check out the repo, try it out for yourself, give me your critiques or praise on if it hurt or helped your process, and collaborate with me to jump in and make it better! This is my first time making something of this nature so if it is poorly made then I ask the great devs out there: I would love your feedback! Please also let me know your implementations on how you solved similar problems!

Anthropic goes nude, exposes Claude Code source by accident

https://www.theregister.com/2026/03/31/anthropic_claude_code_source_code/
1•Bender•1m ago•0 comments

Don't open that WhatsApp message, Microsoft warns

https://www.theregister.com/2026/03/31/whatsapp_message_bad_msi_packages/
1•Bender•1m ago•0 comments

Neanderthals survived on a knife's edge for 350k years

https://www.science.org/content/article/neanderthals-survived-knife-s-edge-350-000-years
1•Hooke•2m ago•0 comments

Book Recommendation Prompt for Introspective People

https://joeiq.com/blog/book-recommendation-prompt/
1•joeiq•2m ago•1 comments

Early Observations from Interviews with Engineering Teams Adopting AI

https://jonathannen.com/observations-from-interviews/
1•jwilliams•3m ago•0 comments

Life after California: People find dramatically lower costs, buy homes

https://www.latimes.com/california/story/2026-03-31/life-after-california-people-find-dramaticall...
1•lxm•6m ago•0 comments

Goodbye (Once Again)

https://digg.com
3•funnyenough•8m ago•3 comments

Show HN: Tarot for Yarn Spinner

https://www.yarnspinner.dev/blog/tarot-yarn-spinner/
1•parisidau•8m ago•0 comments

Groups.io (Email Groups Software)

https://groups.io/
1•kaladin-jasnah•13m ago•0 comments

Show HN: WMB-100K – Open benchmark for AI memory systems at 100K turns

https://github.com/Irina1920/WMB-100K
1•wontopos•14m ago•0 comments

I built an AI talent platform that matches people by capability, not CVS

https://talentblender.com
1•seraph2000•16m ago•1 comments

Nvidia AI Ecosystem Expands as Marvell Joins Forces Through NVLink Fusion

https://nvidianews.nvidia.com/news/nvidia-ai-ecosystem-expands-as-marvell-joins-forces-through-nv...
1•doener•18m ago•0 comments

Silicon Valley city to give residents doorbells equipped with cameras

https://www.theguardian.com/technology/2026/mar/31/silicon-valley-milpitas-doorbell-cameras
1•mitchbob•19m ago•1 comments

Trump's Birthright Citizenship Order Supreme Court: Splits Conservative Scholars

https://www.nytimes.com/2026/03/30/us/politics/supreme-court-trump-birthright-conservatives.html
2•KnuthIsGod•20m ago•0 comments

Show HN: Live simulation of AI agents scamming each other (and getting caught)

http://5.161.255.238:8888
1•viftode4•21m ago•0 comments

A fun Jupyter/JupyterLite for high school students

https://labs.quansight.org/blog/jupyter-everywhere
2•agriyakhetarpal•28m ago•0 comments

The JetStream 3 Benchmark Suite

https://webkit.org/blog/17899/introducing-the-jetstream-3-benchmark-suite/
1•zdw•28m ago•0 comments

CMU Best Practices for Large Language Models

https://guides.library.cmu.edu/LLM_best_practices
2•walterbell•28m ago•0 comments

Robotaxi companies refuse to say how often their AVs need remote help

https://techcrunch.com/2026/03/31/robotaxi-companies-refuse-to-say-how-often-their-avs-need-remot...
1•ZeidJ•29m ago•0 comments

The price of intelligence: what legal AI agents cost

https://techlex.net/the-price-of-intelligence-what-legal-ai-agents-actually-cost/
1•antiviral0075•30m ago•0 comments

Letterboxd for Cafes – Looking for Feedback

1•teobe•30m ago•0 comments

Mercor Data Breach

https://twitter.com/DailyDarkWeb/status/2038822050039525464
2•chirau•32m ago•0 comments

Show HN: Cross Domain Intelligence – The Translation Problem in American R&D

https://docs.google.com/document/d/1J4Zbyr4jqQ2BWAJMGvyUI7AVI3I9DShR/edit?usp=sharing&ouid=106299...
1•LumiTharMan•32m ago•0 comments

Tell HN: Jellyfin Uses Axios

2•nodesocket•33m ago•0 comments

AT&T signs deal worth $2B to upgrade emergency cellular network

https://finance.yahoo.com/sectors/technology/articles/t-signs-deal-worth-2-214628742.html
2•ZeidJ•33m ago•0 comments

OpenAI Adds Another $12B to Latest Funding Round

https://www.nytimes.com/2026/03/31/technology/openai-12-billion-latest-funding-round.html
1•y1n0•34m ago•0 comments

Cellular Gateways and 5G Failover: Why Every Business Needs a Backup Connection

https://meraki.deal/blogs/news/cellular-gateways-5g-failover-business-backup-connection
1•novbox•37m ago•0 comments

Medieval chess: players, regardless of race, could engage as equals

https://phys.org/news/2026-03-medieval-chess-space-players-engage.html
1•y1n0•37m ago•0 comments

Project Ternary Shadow: US Military Is Lagging [pdf]

https://files.catbox.moe/fogrdf.pdf
1•sans_souse•37m ago•0 comments

Mars maybe-life clue in the form of nickel compounds

https://www.theregister.com/2026/03/31/nickel_found_on_mars/
1•hackthemack•37m ago•0 comments