frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Continuum – Unit tests for LLM workflows

https://github.com/Mofa1245/Continuum
2•Mofa1245•1h ago

Comments

Mofa1245•1h ago
LLM non-determinism is a silent killer for production data.

A prompt tweak or model update can shift an extraction from:

amount: 72

to:

amount: "72.00"

Nothing crashes. Your pipeline just sends incorrect numbers downstream.

I built Continuum to treat AI workflows like unit tests.

It records a known-good run and replays it deterministically in CI. If any step output drifts, verification fails.

Local-first CLI. No SaaS.

Example included: invoice extraction pipeline with deterministic replay.

Repo: https://github.com/Mofa1245/Continuum

Mofa1245•1h ago
I built this after running into a recurring issue with LLM pipelines.

Small prompt changes would fix one extraction but silently break another. Nothing threw an error – downstream systems just started receiving slightly different JSON.

Traditional unit tests didn't help because the raw LLM output itself was changing.

Continuum records the full workflow run (LLM call, parsed JSON, etc.) and replays it deterministically so CI can catch drift before production.

Curious how others are handling this problem in real systems. Are people snapshot-testing LLM outputs today?

Find the perfect icon for your design

https://iconsroom.com/
2•mdanassaif•1m ago•0 comments

DuckDB Kernel for Jupyter

https://medium.com/@gribanov.vladimir/building-a-full-featured-duckdb-kernel-for-jupyter-with-a-d...
1•jonbaer•1m ago•0 comments

The Pyramid and the Tomb

https://minutes.substack.com/p/the-pyramid-and-the-tomb
1•jger15•2m ago•0 comments

Diablo 2 in First Person with Unreal Engine [video]

https://www.youtube.com/watch?v=0NPl7ZGg14E
1•metadat•8m ago•0 comments

Lp(a) testing is now recommended to prevent heart disease

https://www.npr.org/2026/03/13/nx-s1-5747111/cholesterol-guidelines-lipoproteina-test
1•brandonb•8m ago•0 comments

Simulations on Xbox 360: Cardiac arrhythmias, re-entry and the Halting problem

https://www.sciencedirect.com/science/article/abs/pii/S1476927109000486
1•rbanffy•10m ago•0 comments

Pug 3.0.4

https://github.com/pugjs/pug/releases
1•guzik•11m ago•0 comments

AutoHarness: Improving LLM agents by automatically synthesizing a code harness

https://arxiv.org/abs/2603.03329
1•simonpure•12m ago•0 comments

LightSwarm – A script making ClaudeCodeMax into a free light easy swarm

https://github.com/craftfortress/lightswarm
1•ionwake•12m ago•0 comments

People using Tesla Autopilot to drive while drunk

https://bsky.app/profile/niedermeyer.online/post/3mgxbqti3yc22
3•doener•13m ago•0 comments

Show HN: A single CLI to manage llama.cpp/vLLM/Ollama models

https://github.com/av/harbor/releases/tag/v0.4.4
1•everlier•15m ago•0 comments

Systems Thinking Is Brain Rot for Analysts

https://blundercheck.timberschroff.com/p/systems-thinking-is-brain-rot-for
1•anarbadalov•15m ago•0 comments

Way to Run OpenClaw Locally on AMD Ryzen

https://www.amd.com/en/resources/articles/run-openclaw-locally-on-amd-ryzen-ai-max-and-radeon-gpu...
1•mirzap•15m ago•1 comments

Create a 5s 1080p Video in 4.5s with FastVideo on a Single GPU

https://1080p.fastvideo.org/
7•zhisbug•15m ago•1 comments

The Long Telegram

https://en.wikipedia.org/wiki/X_Article
1•simonebrunozzi•16m ago•0 comments

Coding Is Dead, Long Live Programming

https://ian-cooper.writeas.com/coding-is-dead-long-live-programming
2•tbayramov•16m ago•0 comments

Show HN: URLert Guard – Real-time phishing forensics for Chrome

https://www.urlert.com/blog/announcing-urlert-guard
1•tomerhe•17m ago•1 comments

Define once, use everywhere: a metrics layer for ClickHouse with MooseStack

https://clickhouse.com/blog/metrics-layer-with-fiveonefour
1•oatsandsugar•18m ago•0 comments

`mcpx` – MCP in a CLI

https://github.com/evantahler/mcpx
5•evantahler•18m ago•3 comments

Social Media, Reset

https://meetzeta.com
1•paicom•19m ago•0 comments

Show HN: KayZeer – Vimium-style keyboard navigation for macOS

https://github.com/serjster/KayZeer
1•serjts•20m ago•2 comments

More than 135 open hardware devices flashable with your own firmware

https://openhardware.directory
2•iosifnicolae2•21m ago•0 comments

MacBook Neo Is the Most Repairable MacBook in 14 Years

https://www.ifixit.com/News/116152/macbook-neo-is-the-most-repairable-macbook-in-14-years
2•mrzool•22m ago•1 comments

Privacy-first tools you buy once and own forever. No subscription.Productivity

https://visualtools.cc/
2•rutkute•24m ago•1 comments

Linkding: A self-hosted bookmark manager

https://linkding.link
1•cdrnsf•29m ago•1 comments

Re//verse 2026: Hacking the Xbox One [video]

https://www.youtube.com/watch?v=FTFn4UZsA5U
2•zap_rpisec•32m ago•0 comments

Code Is a Liquid Now

https://micro.inessential.com/2026/03/13/code-is-a-liquid-now.html
1•sonicrocketman•32m ago•0 comments

Show HN: orb-ui – Voice AI UI Components for React (Vapi, ElevenLabs, etc.)

https://orb-ui.com/
2•alexanderqchen•32m ago•0 comments

Regolith Simulants Desorb and Weather After Exposure to Life Support Effluent

https://pubs.acs.org/doi/10.1021/acsearthspacechem.5c00267
1•PaulHoule•35m ago•0 comments

Show HN: Loop your agents like a dandy little b*tch

https://github.com/geekforbrains/loopsie
2•geekforbrains•36m ago•0 comments