frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Open Bias – proxy that enforces agent behavior at runtime

https://github.com/open-bias/open-bias/
6•algomaniac•1h ago

Comments

algomaniac•1h ago
Hey HN,

We spent the past year working on evals for teams running AI agents in production. We kept seeing rules that worked fine in evals stop working after a while (or miss inconsistently). And as teams added more rules, the agent started missing more of them overall.

Evals and observability help, but the long tail finds you in prod anyway. Guardrails are like the side rails on the highway, useful, but you don't want to be hitting them often. We wanted the lane-keeping system that steers the agent as it deviates.

So we built an open-source proxy that helps steer agents, catching and fixing violations before they reach users. Rules live in a RULES.md file (single source of truth for all policies). The thing we care most about is that the engines doing the checking are pluggable:

- Some checks are best as regex or deterministic code

- Some are LLM-as-judge

- Some are existing guardrail systems like Nvidia's NeMo

- Some are state classifiers for workflows

Results from all of them get combined to steer the agent (intervene, block, or shadow). No single evaluator is going to be right for every rule, and we didn't want to pretend otherwise. Still working on a bunch of things, calibrating: per-engine thresholds, voting across judges, how to aggregate signals across engines, and the classifier that routes rules to the right engine.

Checkers for critical violations run sync and block before the response goes out. Non-critical ones run async and the correction lands on the next turn (latency was essential for teams running voice agents). We're still building the classifier for this; right now it's specified in the config.

Our instinct is that it's easier to detect an agent's mistake than to get the agent not to make it in the first place. The main agent carries the full context (system prompt, tools, conversation history, business logic). But the checkers can take a narrow slice, run in parallel, and perform simpler computations (or answer simpler questions). Cheaper, faster, and you can stack them.

Beta, rough in places. Would love feedback, especially from anyone running agents in prod and feeling this. Happy to go deep on the architecture, engines, whatever.

Repo: https://github.com/open-bias/open-bias

JotatD2•5m ago
i love the steering concept! but wouldn't this 2x my token spend? cost is already the bottleneck on the agent workflows we run

'It took nine seconds': Claude AI agent deletes company's database

https://www.the-independent.com/tech/claude-ai-agent-deletes-startup-anthropic-b2966176.html
1•voxadam•1m ago•0 comments

Quality as a Curious System [video]

https://www.ministryoftesting.com/podcasts/into-the-motaverse?wchannelid=b2j0jiwz2n&wmediaid=h9oj...
2•mooreds•2m ago•0 comments

Show HN: A TUI for Markdown view an editing

https://mdee.bkh.dev
1•cloked•3m ago•0 comments

US State Department upgrades AI theft accusations to target China AI companies

https://www.chinapulse.com/data-news/2026/04/27/extraction-and-distillation-us-state-department-u...
3•devonnull•3m ago•0 comments

What is up with UK bridge height signs? [video]

https://www.youtube.com/watch?v=kH_bSvf7EVA
1•fortran77•3m ago•0 comments

Stop Wasting Brain Power

https://yusufaytas.com/stop-wasting-brainpower
11•london_safari•5m ago•0 comments

An AI-generated image is a finalist in Hasselblad Masters 2026 [video]

https://www.youtube.com/shorts/4MeO2wW5nOs
1•xuf•6m ago•1 comments

200 Journalists Applaud the Internet Archive's Role in Preserving Public Record

https://www.savethearchive.com/journalists/
3•doener•6m ago•0 comments

MSF: Israel uses water as a weapon of collective punishment against Palestinians

https://msf.org.uk/article/gaza-israel-uses-water-weapon-collective-punishment-against-palestinians
2•mef51•6m ago•1 comments

PowerDNS Security Updates – Vulnerabilities Explained CVE-2026-33257

https://tux.re/forum/viewtopic.php?t=179
1•Neteam•8m ago•0 comments

Fastembed – Lightweight Python Embedding Library

https://github.com/qdrant/fastembed
1•firasd•9m ago•0 comments

Escaping SaaS Trap: How Global Retailer Solved Data Sovereignty at 85% Less Cost

https://www.mydecisive.ai/case-study/luxury-retailer-data-sovereignty
1•jratkevic•11m ago•0 comments

Microsoft open sources DOS 1.0 – and it's more than the code

https://www.zdnet.com/article/microsoft-open-sources-dos-1-0-much-more-than-the-code/
2•CrankyBear•11m ago•1 comments

OpenAI Hits Back at Growth Fears, Says 'Firing on All Cylinders'

https://www.bloomberg.com/news/articles/2026-04-28/openai-hits-back-at-growth-fears-says-firing-o...
1•SilverElfin•12m ago•1 comments

One Man Broke the Marathon's Holy Grail

https://www.thefp.com/p/how-one-man-broke-the-marathons-holy
1•sorenKaram•14m ago•1 comments

Super fly: can an electric aircraft spark a quiet revolution in New York travel?

https://www.theguardian.com/us-news/2026/apr/28/electric-aircraft-new-york-air-travel
3•billybuckwheat•14m ago•0 comments

Ghostty is leaving GitHub

https://mitchellh.com/writing/ghostty-leaving-github
203•WadeGrimridge•14m ago•26 comments

Computing quantum waves from classical action

https://royalsocietypublishing.org/rspa/article/482/2336/20250413/481461/On-computing-quantum-wav...
1•orbifold•15m ago•1 comments

Lenovo acquires Phoenix Technologies' BIOS business

https://videocardz.com/newz/lenovo-acquires-phoenix-technologies-bios-business
2•LorenDB•17m ago•0 comments

Open-sourced the skills behind 353K views and 5,300 signups for a YC company

https://github.com/ScaleBrick/founder-marketing-skills
3•devgod•18m ago•1 comments

Lidl stirs up mobile market with discount phone plans

https://www.ft.com/content/52b9ba96-e86c-4ddb-ac1f-726051420fd2
1•doener•19m ago•0 comments

Why Everyone Is Obsessed with Claude Code

https://www.youtube.com/watch?v=gzt52Trk9w0
1•pythonic_hell•20m ago•0 comments

Two years without cookies on the site, here's where we ended up

https://blog.sentry.io/two-years-without-cookies-on-the-site/
3•annjose•21m ago•0 comments

Why Multi Window Multi Burn Rate SLO Alerting Sucks

https://eavan.blog/posts/multi-window-burn-rate-sucks.html
1•eavan0•26m ago•1 comments

You Take Notes but Never Get Smarter

https://meda.io/why-you-take-notes-but-never-get-smarter/
4•speckx•27m ago•0 comments

Show HN: CLI to for working with complex Google Docs using Markdown

https://github.com/think41/extrasuite
1•ksri•31m ago•1 comments

Why don't AI coding tools like REST?

3•sshadmand•32m ago•0 comments

It's dumb to study computer science thinking you can make a 'boatload' at Google

https://www.businessinsider.com/replit-ceo-dumb-to-study-computer-science-money-google-2026-4
1•austin-cheney•33m ago•0 comments

OpenAI models coming to Amazon Bedrock: Interview with OpenAI and AWS CEOs

https://stratechery.com/2026/an-interview-with-openai-ceo-sam-altman-and-aws-ceo-matt-garman-abou...
21•translocator•34m ago•2 comments

No one can force me to have a secure website [pdf]

https://tom7.org/httpv/httpv.pdf
2•susam•37m ago•1 comments