news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: FailWatch – A fail-closed circuit breaker for AI agents

https://github.com/Ludwig1827/FailWatch

2•Sheeplover•1mo ago

Comments

Sheeplover•1mo ago

Hi HN,

I built FailWatch because I couldn't trust my financial AI agent with a production wallet. No matter how much I optimized the system prompt (e.g., "Do not refund > $500"), the LLM would occasionally hallucinate or drift logically.

The scariest part wasn't the hallucination itself, but the failure mode: if my external validation service crashed or timed out, the default behavior in many frameworks was to "fail-open" and execute the tool anyway.

FailWatch is a Python middleware that sits between the agent and the tool execution to enforce fail-closed safety:

Math > Prompts: It uses deterministic Python logic (Pydantic/Regex) for hard constraints.

Fail-Closed Architecture: If the guard server is unreachable, the action is blocked by default.

Logic Drift Detection: It can optionally inspect the agent's "chain of thought" steps to detect intent mismatch before execution.

It's open source (MIT). I'd love to hear feedback on the architecture or how you handle "safety-critical" tool calls in your agents.

A Curated List of ML System Design Case Studies

https://github.com/Engineer1999/A-Curated-List-of-ML-System-Design-Case-Studies

2•tejonutella•3m ago•0 comments

Pony Alpha: New free 200K context model for coding, reasoning and roleplay

https://ponyalpha.pro

1•qzcanoe•8m ago•1 comments

Show HN: Tunbot – Discord bot for temporary Cloudflare tunnels behind CGNAT

https://github.com/Goofygiraffe06/tunbot

1•g1raffe•10m ago•0 comments

Open Problems in Mechanistic Interpretability

https://arxiv.org/abs/2501.16496

2•vinhnx•16m ago•0 comments

Bye Bye Humanity: The Potential AMOC Collapse

https://thatjoescott.com/2026/02/03/bye-bye-humanity-the-potential-amoc-collapse/

1•rolph•20m ago•0 comments

Dexter: Claude-Code-Style Agent for Financial Statements and Valuation

https://github.com/virattt/dexter

1•Lwrless•22m ago•0 comments

Digital Iris [video]

https://www.youtube.com/watch?v=Kg_2MAgS_pE

1•vermilingua•27m ago•0 comments

Essential CDN: The CDN that lets you do more than JavaScript

https://essentialcdn.fluidity.workers.dev/

1•telui•28m ago•1 comments

They Hijacked Our Tech [video]

https://www.youtube.com/watch?v=-nJM5HvnT5k

1•cedel2k1•31m ago•0 comments

Vouch

https://twitter.com/mitchellh/status/2020252149117313349

23•chwtutha•31m ago•2 comments

HRL Labs in Malibu laying off 1/3 of their workforce

https://www.dailynews.com/2026/02/06/hrl-labs-cuts-376-jobs-in-malibu-after-losing-government-work/

2•osnium123•32m ago•1 comments

Show HN: High-performance bidirectional list for React, React Native, and Vue

https://suhaotian.github.io/broad-infinite-list/

2•jeremy_su•34m ago•0 comments

Show HN: I built a Mac screen recorder Recap.Studio

https://recap.studio/

1•fx31xo•36m ago•0 comments

Ask HN: Codex 5.3 broke toolcalls? Opus 4.6 ignores instructions?

1•kachapopopow•42m ago•0 comments

Vectors and HNSW for Dummies

https://anvitra.ai/blog/vectors-and-hnsw/

1•melvinodsa•44m ago•0 comments

Sanskrit AI beats CleanRL SOTA by 125%

https://huggingface.co/ParamTatva/sanskrit-ppo-hopper-v5/blob/main/docs/blog.md

1•prabhatkr•55m ago•1 comments

'Washington Post' CEO resigns after going AWOL during job cuts

https://www.npr.org/2026/02/07/nx-s1-5705413/washington-post-ceo-resigns-will-lewis

3•thread_id•55m ago•1 comments

Claude Opus 4.6 Fast Mode: 2.5× faster, ~6× more expensive

https://twitter.com/claudeai/status/2020207322124132504

1•geeknews•57m ago•0 comments

TSMC to produce 3-nanometer chips in Japan

https://www3.nhk.or.jp/nhkworld/en/news/20260205_B4/

3•cwwc•1h ago•0 comments

Quantization-Aware Distillation

http://ternarysearch.blogspot.com/2026/02/quantization-aware-distillation.html

2•paladin314159•1h ago•0 comments

List of Musical Genres

https://en.wikipedia.org/wiki/List_of_music_genres_and_styles

1•omosubi•1h ago•0 comments

Show HN: Sknet.ai – AI agents debate on a forum, no humans posting

https://sknet.ai/

1•BeinerChes•1h ago•0 comments

University of Waterloo Webring

https://cs.uwatering.com/

2•ark296•1h ago•0 comments

Large tech companies don't need heroes

https://www.seangoedecke.com/heroism/

2•medbar•1h ago•0 comments

Backing up all the little things with a Pi5

https://alexlance.blog/nas.html

1•alance•1h ago•1 comments

Game of Trees (Got)

https://www.gameoftrees.org/

3•akagusu•1h ago•1 comments

Human Systems Research Submolt

https://www.moltbook.com/m/humansystems

1•cl42•1h ago•0 comments

The Threads Algorithm Loves Rage Bait

https://blog.popey.com/2026/02/the-threads-algorithm-loves-rage-bait/

1•MBCook•1h ago•0 comments

Search NYC open data to find building health complaints and other issues

https://www.nycbuildingcheck.com/

1•aej11•1h ago•0 comments

Michael Pollan Says Humanity Is About to Undergo a Revolutionary Change

https://www.nytimes.com/2026/02/07/magazine/michael-pollan-interview.html

2•lxm•1h ago•0 comments