frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Gambit, an open-source agent harness for building reliable AI agents

https://github.com/bolt-foundry/gambit
22•randall•1h ago
Hey HN!

Wanted to show our open source agent harness called Gambit.

If you’re not familiar, agent harnesses are sort of like an operating system for an agent... they handle tool calling, planning, context window management, and don’t require as much developer orchestration.

Normally you might see an agent orchestration framework pipeline like:

compute -> compute -> compute -> LLM -> compute -> compute -> LLM

we invert this so with an agent harness, it’s more like:

LLM -> LLM -> LLM -> compute -> LLM -> LLM -> compute -> LLM

Essentially you describe each agent in either a self contained markdown file, or as a typescript program. Your root agent can bring in other agents as needed, and we create a typesafe way for you to define the interfaces between those agents. We call these decks.

Agents can call agents, and each agent can be designed with whatever model params make sense for your task.

Additionally, each step of the chain gets automatic evals, we call graders. A grader is another deck type… but it’s designed to evaluate and score conversations (or individual conversation turns).

We also have test agents you can define on a deck-by-deck basis, that are designed to mimic scenarios your agent would face and generate synthetic data for either humans or graders to grade.

Prior to Gambit, we had built an LLM based video editor, and we weren’t happy with the results, which is what brought us down this path of improving inference time LLM quality.

We know it’s missing some obvious parts, but we wanted to get this out there to see how it could help people or start conversations. We’re really happy with how it’s working with some of our early design partners, and we think it’s a way to implement a lot of interesting applications:

- Truly open source agents and assistants, where logic, code, and prompts can be easily shared with the community.

- Rubric based grading to guarantee you (for instance) don’t leak PII accidentally

- Spin up a usable bot in minutes and have Codex or Claude Code use our command line runner / graders to build a first version that is pretty good w/ very little human intervention.

We’ll be around if ya’ll have any questions or thoughts. Thanks for checking us out!

Walkthrough video: https://youtu.be/J_hQ2L_yy60

Comments

franciscomello•56m ago
This looks quite interesting in terms of the architecture. Seems like a fresh take on stuff like Langchain, which at least last time I checked sucks.
randall•8m ago
thx!
sofdao•10m ago
this is awesome

are things like file system baked in?

fan of the design of the system. looks great architecturally

randall•8m ago
omg thank you so much. We're working on the file system stuff, that's an easier lift for us than the initial work, so we wanted to start with the big stuff and work backward. Claude Code and Codex are obviously really great at that stuff, and we'd like to be able to support a lot of that out of the box.

How to Melt ICE

https://www.wintersmiths.com/blogs/all-things-ice/how-does-ice-melt
1•marysminefnuf•3m ago•0 comments

Connect multiple Claude Code agents into one collaborative team

https://openagents.org/showcase
3•snasan•8m ago•1 comments

Wikipedia Inks AI Deals with Microsoft, Meta and Perplexity

https://apnews.com/article/wikipedia-internet-jimmy-wales-50e796d70152d79a2e0708846f84f6d7
1•amiga386•8m ago•1 comments

Show HN: I built a text-based business simulator to replace video courses

https://www.core-mba.pro/
1•Core_Dev•8m ago•0 comments

Can tinkering with plant pores protect crops against drought?

https://knowablemagazine.org/content/article/food-environment/2025/manipulating-stomata-could-hel...
1•PaulHoule•10m ago•0 comments

Kutt.ai – Free AI Video Generator, Text and Image to Video

https://kutt.ai/
1•zuoning•11m ago•1 comments

Hyperfiddle: An automatic front end for any back end function or object

https://github.com/hyperfiddle/hfql
2•filoeleven•11m ago•0 comments

Fast Client-Side Search with Rust and WebAssembly

https://code.visualstudio.com/blogs/2026/01/15/docfind
2•azhenley•12m ago•0 comments

Signal, the secure messaging app: A guide for beginners

https://freedom.press/digisec/blog/signal-beginners/
1•doener•16m ago•0 comments

The future of AI is voice

https://realizeai.substack.com/p/the-future-of-ai-is-voice
1•rafaelmdec•22m ago•0 comments

Profile a Parser Implementation in Rust

https://blog.wybxc.cc/blog/profile-cgrammar/
1•todsacerdoti•23m ago•0 comments

Show HN: Playn a privacy first and fast blog platform

https://playn.blog/
1•bairess•25m ago•0 comments

Show HN: Using Qwen3:1.7B to call itself recursively

https://seanneilan.com/posts/tiny-llm-calls-itself/
1•sneilan1•28m ago•0 comments

Gatekeeping: A Partial History of Cold Fusion

https://philsci-archive.pitt.edu/27902/
1•mathgenius•30m ago•0 comments

Sustainability frameworks: Past, present, and future

https://illuminem.com/illuminemvoices/sustainability-frameworks-past-present-and-future
2•R3G1R•31m ago•0 comments

AI chatbot with Vision AI camera

https://www.seeedstudio.com/SenseCAP-Watcher-XIAOZHI-EN-p-6532.html
1•meilily•32m ago•0 comments

Towards a Science of Scaling Agent Systems

https://arxiv.org/abs/2512.08296
1•handfuloflight•37m ago•0 comments

Show HN: Cursor For Data – Make LLMs and Agents have row-level intelligence

https://github.com/vitalops/datatune
1•abhijithneil•39m ago•0 comments

IAMF Binaural Web Demo

https://aomediacodec.github.io/iamf-tools/web_demo/
1•goodburb•45m ago•0 comments

AI is great for scientists, but perhaps not for science

https://www.programmablemutter.com/p/ai-is-great-for-scientists-perhaps
2•anigbrowl•49m ago•0 comments

Multi-Agent Coding Pipeline: Claude Code and Codex[Open Source]

https://github.com/Z-M-Huang/claude-codex
1•zh_code•52m ago•0 comments

Show HN: Your AI can't lie about what it did anymore – live demo

https://neurop-forge.onrender.com/demo/microsoft
1•LBWasserman•53m ago•2 comments

Show HN: BunKill – npkill alternative built with Bun.js

https://github.com/codingstark-dev/bunkill
1•codingstark•54m ago•1 comments

More Americans are living alone than ever before

https://sherwood.news/personal-finance/more-americans-are-living-alone-than-ever-before/
3•avonmach•56m ago•0 comments

BGP Network Browser

1•hivedc•58m ago•0 comments

Why college students can't do basic math [video]

https://www.youtube.com/watch?v=lFXFZs5Ha40
2•drivebyhooting•58m ago•2 comments

A Datomic entity browser for prod [video]

https://www.youtube.com/watch?v=dmucZkE5gd0
3•codemonkey-zeta•1h ago•0 comments

Ask HN: AI Music Covers in 2026?

2•sexy_seedbox•1h ago•3 comments

Build Cursor Tab in less than 300 lines of Lua

https://jda.bearblog.dev/build-cursor-tab-in-less-than-300-lines-of-lua/
2•jalford14•1h ago•0 comments

The Changing Face of Koreans

https://www.geekinheels.com/the-changing-face-of-koreans/
2•enceladus06•1h ago•0 comments