news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Building reproducible LLM agents with strict determinism guarantees

https://github.com/crasofuentes-hub/deterministic-agent-system

1•Agente2026•2h ago

Comments

Agente2026•2h ago

I’ve been building a deterministic agent system in TypeScript with the explicit goal of making agent behavior auditable, reproducible and bounded — something that most agent frameworks currently ignore or treat as optional.

Key properties: • Bounded tool loops with fixpoint convergence or max_iterations • Deterministic planners (det-tools, mock, llm-live stub/real-path) • Reproducibility via plan/execution/finalTrace hashes + replay bundle v2 • Surfaced violations instead of silent failures • Capability-driven tool selection (latest addition) • HTTP API (/agent/run, /tools, /capabilities) with negative test coverage • One-command demos (npm run demo:agent:llm-live:real, etc.)

It’s still early (v0.2.0, 165 commits in ~12 days), no LLM live full yet, no npm package, but the bounded/replay/determinism core is already verifiable via tests and demos.

Repo: https://github.com/crasofuentes-hub/deterministic-agent-syst...

Would love feedback from people working on reproducible agents, AI safety evals, or enterprise workflows where non-determinism is a real problem.

Thanks!

Show HN: How many working days in 2026? And your income in pizzas

https://gettti.me/tools/working-days

1•v_b•3m ago•0 comments

Show HN: Free market intelligence tool, analyze HN, find users pain points

https://whatstechin.com/

1•losalah•4m ago•0 comments

How I Use Claude Code as a Designer at Shopify [video]

https://www.youtube.com/watch?v=aVDAhJ3PtLg

1•benr•5m ago•1 comments

Show HN: Engram – open-source persistent memory for AI agents (Bun and SQLite)

https://github.com/zanfiel/engram

1•zanfiel•6m ago•1 comments

The Complete Guide to Building Skills for Claude [pdf]

https://resources.anthropic.com/hubfs/The-Complete-Guide-to-Building-Skill-for-Claude.pdf

1•Terretta•7m ago•0 comments

Agentic coding doesn't = technical debt

https://inmydata.ai/blog/agentic-coding-discipline/

1•nfinch•8m ago•0 comments

A 10% traffic spike took down a stable system in 3 minutes and 47 seconds

https://www.orchenginex.com/publications/queue-collapse-traffic-spike

1•Mlondy•10m ago•0 comments

Show HN: This is what I Want from the Internet

https://jetzt.cx/about

1•krickelkrackel•14m ago•1 comments

Nvidia backs AI data center startup Nscale as it hits $14.6B valuation

https://www.cnbc.com/2026/03/09/nscale-ai-data-center-nvidia-raise.html

2•voxadam•16m ago•0 comments

Building a system to track market narratives and behavioral signals

1•mindquantai•16m ago•0 comments

Tiny Terminal

https://meimakes.com/tiny-terminal/

1•the-mitr•19m ago•0 comments

Production query plans without production data

https://boringsql.com/posts/portable-stats/

1•todsacerdoti•21m ago•0 comments

Russia targets Signal and WhatsApp accounts in cyber campaign

https://english.aivd.nl/latest/news/2026/03/09/russia-targets-signal-and-whatsapp-accounts-in-cyb...

1•HelloUsername•21m ago•0 comments

Mahjong Mentor

https://mj-mentor.lovable.app/

1•dpzl•23m ago•1 comments

Fear Is Destroying Your Org

https://yanivpreiss.com/2026/03/08/fear-is-destroying-your-org/

2•PretzelFisch•24m ago•0 comments

Show HN: Yawn – Yet Another Worktree Navigator (CLI, Pipes into Fzf)

https://github.com/ComeBertrand/yawn

1•ComeBertrand•26m ago•0 comments

Show HN: Unpinched – open-source PinchTab and CDP bridge detector

https://github.com/Helixar-AI/Unpinched

1•Siri_D•27m ago•0 comments

API Traffic Analyzer for Kubernetes

https://kubeshark.com/

1•l1am0•30m ago•0 comments

Hermes Agent

https://github.com/NousResearch/hermes-agent

1•tosh•30m ago•0 comments

Show HN: commitgen-cc – Generate Conventional Commit message locally with Ollama

https://github.com/Eaglemann/commitgen-cc

1•eagleman•31m ago•1 comments

Ask HN: What is your oldest living presence on the World Wide Web?

2•dhruv3006•33m ago•0 comments

The Economy of Loneliness

https://app.dateseriously.com/the-economy-of-loneliness.html

1•skanderbm•33m ago•0 comments

Show HN: Think Better – Inject Decision Frameworks into Claude and Copilot

https://github.com/HoangTheQuyen/think-better

1•hoangthequyen01•38m ago•0 comments

35 days in. 28 training cycles across 3 base models

https://forgeintelligence.substack.com/p/forge-intelligence-edition-5

1•beakmull•38m ago•0 comments

Returning to Rails in 2026

https://www.markround.com/blog/2026/03/05/returning-to-rails-in-2026/

4•todsacerdoti•39m ago•3 comments

Russian hackers breach Signal and WhatsApp accounts officials, Netherlands warns

https://www.reuters.com/world/europe/russia-backed-hackers-breach-signal-whatsapp-accounts-offici...

2•repelsteeltje•39m ago•0 comments

Gyro-Claw – Secure execution runtime for AI agents

4•gyroscape•40m ago•0 comments

Building GREMLIN's Lair

https://peebs.org/building-gremlins-lair/

2•nemesisj•40m ago•0 comments

Every language should have a UUID type

https://nafees.bearblog.dev/every-language-should-have-a-uuid-type/

2•mnafees•45m ago•0 comments

Building a Stripe dashboard with an ESP32 desktop clock and Rust

https://duggan.ie/posts/building-a-stripe-dashboard-with-an-esp32-desktop-clock-and-rust

2•duggan•46m ago•1 comments