frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Open Bias – proxy that enforces agent behavior at runtime

https://github.com/open-bias/open-bias/
6•algomaniac•2h ago

Comments

algomaniac•1h ago
Hey HN,

We spent the past year working on evals for teams running AI agents in production. We kept seeing rules that worked fine in evals stop working after a while (or miss inconsistently). And as teams added more rules, the agent started missing more of them overall.

Evals and observability help, but the long tail finds you in prod anyway. Guardrails are like the side rails on the highway, useful, but you don't want to be hitting them often. We wanted the lane-keeping system that steers the agent as it deviates.

So we built an open-source proxy that helps steer agents, catching and fixing violations before they reach users. Rules live in a RULES.md file (single source of truth for all policies). The thing we care most about is that the engines doing the checking are pluggable:

- Some checks are best as regex or deterministic code

- Some are LLM-as-judge

- Some are existing guardrail systems like Nvidia's NeMo

- Some are state classifiers for workflows

Results from all of them get combined to steer the agent (intervene, block, or shadow). No single evaluator is going to be right for every rule, and we didn't want to pretend otherwise. Still working on a bunch of things, calibrating: per-engine thresholds, voting across judges, how to aggregate signals across engines, and the classifier that routes rules to the right engine.

Checkers for critical violations run sync and block before the response goes out. Non-critical ones run async and the correction lands on the next turn (latency was essential for teams running voice agents). We're still building the classifier for this; right now it's specified in the config.

Our instinct is that it's easier to detect an agent's mistake than to get the agent not to make it in the first place. The main agent carries the full context (system prompt, tools, conversation history, business logic). But the checkers can take a narrow slice, run in parallel, and perform simpler computations (or answer simpler questions). Cheaper, faster, and you can stack them.

Beta, rough in places. Would love feedback, especially from anyone running agents in prod and feeling this. Happy to go deep on the architecture, engines, whatever.

Repo: https://github.com/open-bias/open-bias

JotatD2•34m ago
i love the steering concept! but wouldn't this 2x my token spend? cost is already the bottleneck on the agent workflows we run
algomaniac•32m ago
Thanks! In practice it can be a fraction of that. The judge sees a much smaller slice than the original call, and it's usually a smaller model making it 10-20x cheaper than the model running your agent. In some cases we've also seen folks who were using a frontier model mainly for reliability end up comfortable downgrading their agent model to a cheaper one. We'll add a proper cost breakdown in the docs!

Show HN: Live Sun and Moon Dashboard with NASA Footage

https://www.lumara-space.app/
140•beeswaxpat•7h ago•45 comments

Show HN: My friend and his AI homies wrote SGI Indy emulator in Rust

https://github.com/techomancer/iris
2•greg_w•6m ago•0 comments

Show HN: I mapped the latest UK fuel prices by county

https://fuelfox.uk/regional
3•sircipher•1h ago•0 comments

Show HN: Open Bias – proxy that enforces agent behavior at runtime

https://github.com/open-bias/open-bias/
6•algomaniac•2h ago•3 comments

Show HN: OSS Agent I built topped the TerminalBench on Gemini-3-flash-preview

https://github.com/dirac-run/dirac
360•GodelNumbering•1d ago•141 comments

Show HN: Ragnerock, an AI data analysis tool

https://www.ragnerock.com
7•mmahowald27•3h ago•4 comments

Show HN: Utilyze – an open source GPU monitoring tool more accurate than nvtop

https://www.systalyze.com/utilyze
111•ManyaGhobadi•1d ago•27 comments

Show HN: A terminal spreadsheet editor with Vim keybindings

https://github.com/garritfra/cell
102•garritfra•1d ago•49 comments

Show HN: SyncVibe – Code with friends in the terminal, each with your own AI

https://syncvibe.online/
8•curious1008•5h ago•3 comments

Show HN: VoiceGoat – A vulnerable voice agent for practicing LLM attacks

https://github.com/redcaller/voice-goat
5•xmhatx•5h ago•1 comments

Show HN: I wrote a DOOM clone in my own programming language

https://spectrelang.org/log/devlog#cubedoom
5•pizza_man•9h ago•3 comments

Show HN: How much of the Linux kernel is written by AI?

https://assisted-by.dev/
5•snek14•6h ago•3 comments

Show HN: Turning a Gaussian Splat into a videogame

https://blog.playcanvas.com/turning-a-gaussian-splat-into-a-videogame/
234•yak32•5d ago•63 comments

Show HN: Waiting for LLMs Suck – Give your user a game

https://github.com/ftaip/waiting-game
18•dalemhurley•17h ago•12 comments

Show HN: Devicons, +1300 logos and icons in React, SVG, and icon format

https://devicons.io/
6•vorillaz•11h ago•1 comments

Show HN: I built a dating SIM that prepares you for your date

https://claude.ai/public/artifacts/98750067-546b-4c9e-ab62-68cae2941329
3•danish00111•3h ago•0 comments

Show HN: Implementing Patio11's "Dangerous Professional" as a Claude Code Plugin

https://playground.tetraresearch.io/p/implementing-patio11s-dangerous-professional
3•tawb•8h ago•1 comments

Show HN: AgentSwift – Open-source iOS builder agent

https://github.com/hpennington/agentswift
46•hpen•19h ago•9 comments

Show HN: Unusual Wikipedia

https://unusualwiki.nk412.com/
19•grilledchickenw•1d ago•3 comments

Show HN: The Unix Magic poster, annotated (updated)

https://github.com/drio/unixmagic
60•drio•1d ago•7 comments

Show HN: Tiao, A two-player turn-based board game

https://playtiao.com
59•trebeljahr•1d ago•28 comments

Show HN: PrePrompt – rewrites vague prompts before they reach the LLM

https://preprompt.org/
5•yashdeeptehlan•17h ago•4 comments

Show HN: Free textbook on engineering thermodynamics

https://thermodynamicsbook.com/
174•2DcAf•2d ago•47 comments

Show HN: BeVisible.app - Blog that runs itself

https://www.bevisible.app
6•evanyang•13h ago•2 comments

Show HN: Blotter, a live map of LAPD radio activity

https://blotter.fm
5•s_e__a___n•19h ago•1 comments

Show HN: Gate – AI workers handle dev tickets in a visual workspace

https://soliddark.net/gate
3•SolidDark•14h ago•0 comments

Show HN: Nat-zero – Scale-to-zero NAT instances for AWS (Terraform module)

https://machine.dev/blog/nat-zero-scale-to-zero-nat-instances/
5•leonardosul•14h ago•1 comments

Show HN: Startup Equity Adventure Game

https://options-game-polymathrobotics.pythonanywhere.com/
34•iliabara•1d ago•25 comments

Show HN: I built a dual crossword puzzle where two crosswords share one grid

https://forkle.co.uk/
20•daveoshawrus•1d ago•15 comments

Show HN: I've built a nice home server OS

https://lightwhale.asklandd.dk/
193•Zta77•3d ago•88 comments