frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Continuum – CI drift guard for LLM workflows

https://github.com/Mofa1245/Continuum
1•Mofa1245•1h ago

Comments

Mofa1245•1h ago
AI outputs change.

Models update. Prompts evolve. Small output shifts can silently break production logic.

If you're extracting structured data (invoices, tickets, reports) from LLMs, a tiny change in model output can cascade into incorrect downstream behavior.

Continuum records a multi-step LLM workflow once, then deterministically replays and verifies it later.

If anything changes — raw model output, parsed JSON, or derived memory — your CI fails.

Example:

1. Run `continuum invoice-demo` 2. It extracts structured fields from an invoice 3. Run `continuum verify-all --strict` → PASS 4. Modify a stored value (e.g., 72 → 99) 5. Run verify again → FAIL

It’s a simple drift guard for LLM pipelines.

No hosted service. No external storage. Just deterministic replay + strict diffing.

Repository: https://github.com/Mofa1245/Continuum

Feedback welcome.

Mofa1245•1h ago
A few clarifications:

- This isn’t trying to make LLMs deterministic. - It records the full workflow output once, then replays and diffs it later. - The goal is CI drift detection, not runtime enforcement.

Curious how others are currently guarding against silent output drift in production.

Show HN: AISH – PTY-first shell wrapper that shows signal, not noise

https://origo-labs.github.io/aish/
1•mrorigo•2m ago•0 comments

Show HN: Yaw – A terminal built around the Claude Code/Codex CLI workflow

https://yaw.sh
1•tkjef•4m ago•0 comments

Porn depicting sex between step-relatives set to be banned in the UK

https://www.lbc.co.uk/article/pornography-sexual-relationships-step-relatives-5HjdTkd_2/
5•GaryBluto•5m ago•0 comments

Ask HN: What will OpenAI employees do now who have signed notdividedorg petition

1•Imustaskforhelp•6m ago•1 comments

Show HN: Dice Jaga – A simple Yahtzee x Roguelike inspired web game

1•bryanhogan•7m ago•0 comments

The Federalist Papers No 1

https://avalon.law.yale.edu/18th_century/fed01.asp
1•tyleo•8m ago•1 comments

Show HN: Dream Pruning – biologically-inspired SVD consolidation for LLMs

https://medium.com/towards-artificial-intelligence/dream-pruning-what-happens-when-ai-models-slee...
1•dexmac221•9m ago•0 comments

Debugging reproducible build issues in Rust

https://notes.8pit.net/notes/iqfs.html
1•fanf2•9m ago•0 comments

Stolen Gemini API key racks up $82,000 in 48 hours

https://llmhorrors.com/all/gemini-stolen-api-key-82k/
3•salkahfi•9m ago•0 comments

Show HN: Finclaw, a thesis based AI agent for investing

https://github.com/martinpmm/Finclaw
1•Martinmm•12m ago•0 comments

Building an Autonomous SRE Team with AI Agents: A 5-Day Experiment

https://medium.com/@beniamin.calota/building-an-autonomous-sre-team-with-ai-agents-a-5-day-experi...
1•b3n1amin•13m ago•0 comments

Upgrading OpenClaw to Latest on Jetson Nano with Node 22

https://brtkwr.com/posts/2026-03-02-upgrading-openclaw-to-latest-node22-on-jetson-nano/
1•thunderbong•15m ago•0 comments

Qwen 3.5: small models with impressive performance

https://twitter.com/Alibaba_Qwen/status/2028460046510965160
2•moondistance•16m ago•0 comments

Show HN: OpenClaw Horror Stories – leaderboard of worst AI agent incidents

https://openclaw-horror-leaderboard.vercel.app
1•bhekanik•19m ago•1 comments

SteptronOss: Lightweight, AI-native training framework for large language models

https://github.com/stepfun-ai/SteptronOss
1•limoce•20m ago•0 comments

Ask HN: How are you structuring Markdown-based context for AI coding agents?

1•lepuski•21m ago•0 comments

Nobody Gets Promoted for Simplicity

https://terriblesoftware.org/2026/03/03/nobody-gets-promoted-for-simplicity/
1•matheusml•22m ago•0 comments

VoxCSS – CSS voxel engine for the DOM

https://voxcss.com
3•robin_reala•22m ago•1 comments

Show HN: LynxPrompt – Self-hostable, federated AI config rules manager

https://github.com/GeiserX/LynxPrompt
2•geiser•23m ago•0 comments

Algorithmic Feeds Need to Be Banned

https://shubhamjain.co/2026/02/25/algorithmic-feeds-need-to-be-banned/
2•shubhamjain•23m ago•0 comments

Addicted to the algorithm – Big Tech lobbies to keep us hooked on social media

https://corporateeurope.org/en/2026/02/addicted-algorithm-0
2•robtherobber•26m ago•0 comments

Show HN: Live Vulnerability Intelligence Dashboard (Trending CVEs)

https://www.leakycreds.com/vulnerability-intelligence
1•saynsec•26m ago•1 comments

Npmx: a fast, modern browser for the NPM registry

https://npmx.dev/blog/alpha-release
2•todsacerdoti•31m ago•1 comments

Anthropic's Killer-Robot Dispute with The Pentagon

https://www.theatlantic.com/technology/2026/03/inside-anthropics-killer-robot-dispute-with-the-pe...
3•helloplanets•32m ago•0 comments

From Abilities to AI Agents: Introducing the WordPress MCP Adapter

https://developer.wordpress.org/news/2026/02/from-abilities-to-ai-agents-introducing-the-wordpres...
1•bph•33m ago•1 comments

Show HN: CryptoMorning – AI crypto briefing via a single Claude.md file

https://plus8bit.github.io/cryptomorning/
1•plus8bit•34m ago•1 comments

.NET Devs: You're Probably Ignoring This Attribute (Stop)

https://www.youtube.com/watch?v=a7Kh8hDd96E
1•aloneguid•34m ago•0 comments

Show HN: Web2cli – Every website is a Unix command

https://github.com/jb41/web2cli
2•michaeloblak•36m ago•1 comments

India's top court angry after junior judge cites fake AI-generated orders

https://www.bbc.com/news/articles/c178zzw780xo
5•tchalla•37m ago•0 comments

Mullvad VPN: Banned TV Ad in the Streets of London [video]

https://www.youtube.com/watch?v=rwhznrpgl7k
28•vanyauhalin•41m ago•8 comments