frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Kybernis – Prevent AI agents from executing the same action twice

https://kybernis.io
2•wingrammer•1h ago
AI agents increasingly execute real system actions: issuing refunds, modifying databases, deploying infrastructure, calling external APIs.

Because agents retry steps, re-plan tasks, and run asynchronously, the same action can sometimes execute more than once.

In production systems this can cause duplicate payouts, repeated mutations, or inconsistent state.

Kybernis is a reliability layer that sits at the execution boundary of agent systems.

When an agent calls a tool:

1. execution intent is captured 2. the action is recorded in an execution ledger 3. idempotency guarantees are attached 4. the mutation commits exactly once

Retries become safe.

Kybernis is framework-neutral and works with agent frameworks like LangGraph, AutoGen, CrewAI, or custom systems.

I built this after repeatedly seeing reliability failures when AI agents interacted with production APIs.

Would love feedback from anyone building agent systems.

Comments

wingrammer•1h ago
Hi HN – I'm the founder of Kybernis.

The core problem we’re exploring is that AI agents are non-deterministic systems operating inside deterministic infrastructure.

Traditional systems assume actions run once.

Agents retry steps, re-plan tasks, and execute asynchronously.

That combination makes duplicate execution surprisingly easy.

Kybernis focuses on the execution boundary where agents trigger real mutations (payments, infrastructure changes, APIs).

Curious if others deploying agents have run into similar reliability issues.

Show HN: Msgspec-config, yet another config library for msgspec

https://github.com/maxpareschi/msgspec-config
1•maxpareschi•56s ago•0 comments

Microsoft Is Stress-Testing the Agentic AI Bubble in Its Own Gaming Division

https://softcurrency.substack.com/p/microsoft-is-stress-testing-the-agentic
1•econgradstud•1m ago•1 comments

Android released a new official LLM code-generation benchmark: Android Bench

https://android-developers.googleblog.com/2026/03/elevating-ai-assisted-androi.html
1•zmlatorre•1m ago•0 comments

Trump fires Kristi Noem as DHS secretary

https://www.wsj.com/politics/policy/trump-dhs-kristi-noem-markwayne-mullin-85815862
4•hedayet•6m ago•1 comments

When Fonts Fall

https://www.figma.com/blog/when-fonts-fall/
1•emschwartz•6m ago•0 comments

Code Bonito – Design prompts for vibecoding tools

https://codebonito.com/
1•utshull•7m ago•1 comments

Async Programming Is Just Inject Time

https://willhbr.net/2026/03/02/async-inject-and-effects/
1•todsacerdoti•10m ago•0 comments

Doppelgänger / RRN Disinformation Infrastructure Ecosystem 2026

https://dti.domaintools.com/research/doppelganger-rrn-disinformation-infrastructure-ecosystem
1•nobody9999•12m ago•0 comments

Show HN: A Claude Code skill that renders decisions as interactive HTML pages

https://github.com/jnemargut/better-plan-mode
2•jontomato•16m ago•0 comments

A Modular Robot Dashboard

https://github.com/transitiverobotics/transact
1•chfritz•18m ago•0 comments

Foreman: A secure self-hosted agent orchestrator

https://www.palkeo.com/en/blog/foreman.html
1•palkeo•20m ago•0 comments

Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory

https://arxiv.org/abs/2603.04257
1•simonpure•20m ago•0 comments

Generative Linguistics, LLMs, and the Social Nature of Scientific Success

https://arxiv.org/abs/2503.20088
1•3willows•22m ago•0 comments

Economy of the Mughal Empire

https://en.wikipedia.org/wiki/Economy_of_the_Mughal_Empire
2•teleforce•22m ago•0 comments

A standard protocol to handle and discard low-effort, AI-Generated pull requests

https://406.fail/
2•Muhammad523•23m ago•0 comments

The Guy Who Played Barney the Dinosaur Now Runs a Tantric Sex Business

https://www.vice.com/en/article/the-guy-who-played-barney-the-dinosaur-now-runs-a-tantric-sex-bus...
3•Anon84•24m ago•0 comments

Show HN: Check out my new project – SitDeck

https://sitdeck.com
2•danushman•25m ago•1 comments

Story of a Failed Pentest (2018)

https://web.archive.org/web/20181118010006/https://threader.app/thread/1063423110513418240
1•surround•27m ago•0 comments

Roblox launches real-time AI chat rephrasing to filter out banned language

https://techcrunch.com/2026/03/05/roblox-launches-real-time-ai-chat-rephrasing-to-filter-out-bann...
1•SaaSasaurus•29m ago•0 comments

Ask HN: Did you change the site on mobile?

1•kosolam•31m ago•3 comments

The five AI value models driving business reinvention

https://openai.com/index/the-five-ai-value-models-driving-business-reinvention
1•surprisetalk•33m ago•0 comments

SaaSpocalypse: Enterprises are suddenly worried about the future of SaaS

https://www.techcircle.in/2026/02/09/saaspocalypse-explained-why-enterprises-are-suddenly-worried...
1•bigwheels•34m ago•0 comments

FastClaw: Strong and powerfull AI easy to use for new users or pro users

https://fastclaw.live/
1•samhabesha•35m ago•0 comments

Show HN: Tarmac – Know what Claude Code will cost before you run it

https://github.com/CodeSarthak/tarmac
1•sarthakaggarwal•38m ago•1 comments

How we would have managed a recent incident at Port with an incident agent

https://www.port.io/blog/how-we-would-have-managed-a-recent-incident-at-port-with-an-incident-agent
1•donutshop•39m ago•0 comments

Mo Samuels wrote this blog post

https://idiallo.com/blog/why-we-hate-llm-articles
2•firefoxd•41m ago•0 comments

How good is Claude, really?

https://alinpanaitiu.com/blog/how-good-is-claude-really/
3•alin23•42m ago•1 comments

Show HN: StockMRRket – trade indie startup "stocks" priced from real MRR data

https://stockmrrket.com/
1•stemonteduro•43m ago•0 comments

Whoops, Websites Realize That Killing Their Comment Sections Was a Mistake

https://www.techdirt.com/2026/02/03/whoops-websites-realize-that-killing-their-comment-sections-w...
2•augustocallejas•43m ago•1 comments

Amazon down – live updates on outage as shoppers can't check out

https://www.tomsguide.com/computing/live/amazon-is-down-live-outage-report-as-users-cant-check-ou...
1•dylan604•45m ago•0 comments