frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Pre-execution verification for LLM-generated agentic workflows

https://github.com/le0li0n/workflow-verify
2•jaredwaxman•1h ago

Comments

jaredwaxman•1h ago
I was building agentic workflows for my CRM — Otter.ai recordings → Clay enrichment → CRM updates — and got tired of LLM-generated pipelines silently doing the wrong thing. A pipeline that "worked" was pushing contacts without validating email format, making API calls I didn't authorize, and failing silently when field names didn't match between steps.

The problem isn't that LLMs write bad code. It's that there's no contract between what you asked for and what runs. Structured outputs solve format. Guardrails AI solves content safety. Temporal solves execution. Nobody checks whether the workflow itself makes sense as a pipeline.

So I built a verification layer. The LLM outputs a workflow AST via structured outputs. Before anything executes, the engine type-checks data flow across steps, validates schemas at boundaries, and requires every side effect (API calls, DB writes, webhooks) to be explicitly declared. You get a manifest — "this workflow READs from Salesforce and WRITEs to HubSpot" — that a compliance system can review without reading code.

~800 lines of Python, zero deps beyond Pydantic, MIT licensed. Would especially love feedback from folks building agentic systems in production — the schema library for domain-specific patterns is the most obvious area for contributions.

ConvertlyAI•1h ago
I love this approach to verification. I literally just launched my own AI formatting engine yesterday, and the hardest part wasn't the generation—it was building strict system-level guardrails to stop the model from outputting generic fluff words and breaking my slide formatting. Are you doing this pre-execution verification purely through secondary prompt checks, or are you running it through a separate smaller model first?
jaredwaxman•1h ago
Thanks! We're doing pre-execution verification through static analysis of the workflow AST — no secondary model involved. The verifier runs deterministically against declared effects and type constraints, so it catches issues before anything executes. Curious about your approach — are your guardrails rule-based or are you using a classifier?

Düren's Hydrogen Bet: The Math Behind a Looming Liability

https://cleantechnica.com/2026/03/01/durens-hydrogen-bet-the-math-behind-a-looming-liability/
1•simonebrunozzi•1m ago•0 comments

Using Structured Light Scanning and Photogrammetry in Cultural Heritage

https://www.mdpi.com/2078-2489/17/3/237
1•PaulHoule•1m ago•0 comments

Financial AGI announced – outperforms human experts on 12 professional exams

https://portfoliopilot.com/technology/financial-agi
2•aharm•2m ago•0 comments

Most AI agent demos won't survive enterprise security review

1•ibreakthecloud•2m ago•0 comments

Show HN: Experiment- enforcing accessibility guardrails during AI UI generation

https://github.com/nadav-senseit/ai-accessibility-enforcement
1•nadav_senseit•3m ago•0 comments

Ask HN: Have you noticed how the number of 'Show HN' posts has skyrocketed?

1•nodivbyzero•3m ago•0 comments

CSUN Assistive Technology Conference 2026 files

https://daisy.org/news-events/articles/csun-conference-2026-files/
1•bryanrasmussen•3m ago•0 comments

Show HN: Chatddit.com Fresh off the vibe press

1•gitprolinux•4m ago•0 comments

I'm a Coin Boy, Too (2023)

https://taylor.town/coin-boys
1•surprisetalk•4m ago•0 comments

Formal Verification in the Age of AI

https://verse.systems/blog/post/2026-03-05-formal-verification-ai/
1•matt_d•4m ago•0 comments

I Love Email (2023)

https://blog.xoria.org/email/
1•surprisetalk•4m ago•0 comments

Case Study: lynnandtonic.com 2025 refresh

https://lynnandtonic.com/thoughts/entries/case-study-2025-refresh/
1•surprisetalk•4m ago•0 comments

A Day in the Life of an Enshittificator [video]

https://www.youtube.com/watch?v=T4Upf_B9RLQ
1•surprisetalk•4m ago•0 comments

Collabora at Embedded World 2026: Open-Source AI and Embedded Innovation

https://www.collabora.com/news-and-blog/news-and-events/bringing-the-ai-experience-to-embedded-wo...
1•losgehts•5m ago•0 comments

I built a real-time RER/train tracker for Paris commuters (PWA, no app store)

https://o-rer.com
2•raduvul•6m ago•1 comments

Arc Raiders – Discord SDK Data Exposure

https://timothymeadows.com/arc-raiders-discord-sdk-data-exposure/
3•esnard•7m ago•0 comments

Researchers discover Chickpeas can grow in moon dirt and make seeds

https://www.sciencenews.org/article/chickpeas-moon-dirt-seeds-space-farming
1•ck2•8m ago•0 comments

Asymmetric Goal Drift in Coding Agents Under Value Conflict

https://arxiv.org/abs/2603.03456
1•lrakster•9m ago•0 comments

Is Web Development Returning to PHP?

https://andreasyanaram.medium.com/is-web-development-quietly-returning-to-php-8dd5c3d47eee
1•ulrischa•10m ago•0 comments

Containers, but Without the Magic Part 1: Networking

https://www.nightshift.sh/blog/containers-without-magic-01
1•tensor_ninja•11m ago•0 comments

Show HN: ColdPitch – Find anyone, get a personalized cold email in seconds

https://www.coldpitch.ai
1•arishec•11m ago•0 comments

Why do elephants have such wrinkly skin?

https://www.sciencefocus.com/nature/why-do-elephants-have-such-wrinkly-skin
1•thunderbong•12m ago•0 comments

Left-handed people may have a psychological edge in competition

https://phys.org/news/2026-03-left-people-psychological-edge-competition.html
3•simonebrunozzi•12m ago•0 comments

The OpenAI Files

https://www.theverge.com/openai/688783/the-openai-files-will-help-you-understand-how-sam-altmans-...
2•MrBuddyCasino•12m ago•0 comments

Show HN: I trained a small local model to translate natural language to CLI

https://github.com/spicy-lemonade/zest-cli-infra
1•kiki_kuuki•13m ago•0 comments

How we fixed Postgres connection pooling on serverless with PgDog

https://circleback.ai/blog/how-we-fixed-postgres-connection-pooling-on-serverless-with-pgdog
2•alihaghani•13m ago•0 comments

No Cloud, No Waiting: Tool-Calling Agents on Consumer Hardware with LFM2-24B-A2B

https://www.liquid.ai/blog/no-cloud-tool-calling-agents-consumer-hardware-lfm2-24b-a2b
1•nimar•13m ago•0 comments

Kaoslabs – My Linux VPS and self-hosting experiments

https://kaoslabs.org
1•wilhart•13m ago•1 comments

Type-safe, K-sortable, globally unique identifier inspired by Stripe IDs

https://github.com/jetify-com/typeid
1•alcazar•17m ago•0 comments

Donald Trump insists there are no wind farms in China. Here are 20 in pictures

https://www.theguardian.com/us-news/gallery/2026/mar/05/pictures-china-wind-farms-trump
12•beardyw•18m ago•3 comments