frontpage.

I noticed a pattern: every LLM framework today lets the AI manage state and do math. Then we wonder why pipelines hallucinate numbers and break at 3 AM.

I took a different approach and built Aura-State, an open-source Python framework that compiles LLM workflows into formally verified state machines.

Instead of hoping the AI figures it out, I brought in real algorithms from hardware verification and statistical learning:

CTL Model Checking: the same technique used to verify flight control systems, now applied to LLM workflow graphs. Proves safety properties before execution.

Z3 Theorem Prover: every LLM extraction gets formally proven against business constraints. If the total ≠ price × quantity, Z3 catches it with a counterexample.

Conformal Prediction: distribution-free 95% confidence intervals on every extracted field. Not just "the LLM said $450k" but "95% CI: [$448k, $452k]."

MCTS Routing: Monte Carlo Tree Search (the algorithm behind AlphaGo) scores ambiguous state transitions mathematically.

Sandboxed Math: English math rules compile to Python AST. Zero hallucination calculations.

I ran a live benchmark against 10 real-estate sales transcripts using GPT-4o-mini: → 100% budget extraction accuracy ($0 mean error) → 20/20 Z3 proof obligations passed → 3/3 temporal safety properties proven → 65 automated tests passing

The gap between "it usually works" and "it provably works" is smaller than people think.

Would love feedback from anyone building production LLM systems; what would you want formally verified?

https://github.com/munshi007/Aura-State

Ask HN: When do you expect ChatGPT moment in robotics?

Tell HN: MitID, Denmark's digital ID, was down

Ask HN: How to approach new people in 2026?

Ask HN: How will most Anthropic customers respond to the threats by the govt?

Tell HN: YC companies scrape GitHub activity, send spam emails to users

Aura-State: Formally Verified LLM State Machine Compiler

I used 2D Base64 to bypass Gemini and expose Google's moderation flaws

Tell HN: My daily game won a Players Choice Award

Ask HN: How do we solve the bot flooding problem without destroying anonymity?

I built AI agents that do the grunt work solo founders hate

Ask HN: Builder.ai ($1B Microsoft-backed AI company) who's lookin at the assets?

Ask HN: Article to share with a technical manager about modern AI coding tools?

Garbage In, Garbage Out: The Degradation of Human Requirements in the LLM Era

I don't need AI to build me a new app. I need it to make Jira bearable

Super Editor – Atomic file editor with automatic backups (Python and Go)

Seeking Advice on Improving OCR for Watermarked PDFs in My RAG Pipeline

Ask HN: Who Is Using XMPP?

36yo: Career at home vs. Simple life abroad?

Ask HN: Why are some websites locking or using the audio device on Windows?

Ask HN: How do you handle duplicate side effects when jobs, workflows retry?

Ask HN: My competitor wants to buy us out, recommend a lawyer?

Ask HN: What's it like working in big tech recently with all the AI tools?

LazyGravity – I made my phone control Antigravity so I never leave bed

If you drive clock wise along the beach on an island

Ask HN: Starting a New Role with Ada

Ask HN: What will happen with Anthropics ultimatum?

Ask HN: What Happened to HTTPS://Www.keyvalues.com/?

I built a 151k-node GraphRAG swarm that autonomously invents SDG solutions

Ask HN: When do you expect ChatGPT moment in robotics?

Tell HN: MitID, Denmark's digital ID, was down

Ask HN: How to approach new people in 2026?

Ask HN: How will most Anthropic customers respond to the threats by the govt?

Tell HN: YC companies scrape GitHub activity, send spam emails to users

Aura-State: Formally Verified LLM State Machine Compiler

I used 2D Base64 to bypass Gemini and expose Google's moderation flaws

Tell HN: My daily game won a Players Choice Award

Ask HN: How do we solve the bot flooding problem without destroying anonymity?

I built AI agents that do the grunt work solo founders hate

Ask HN: Builder.ai ($1B Microsoft-backed AI company) who's lookin at the assets?

Ask HN: Article to share with a technical manager about modern AI coding tools?

Garbage In, Garbage Out: The Degradation of Human Requirements in the LLM Era

I don't need AI to build me a new app. I need it to make Jira bearable

Super Editor – Atomic file editor with automatic backups (Python and Go)

Seeking Advice on Improving OCR for Watermarked PDFs in My RAG Pipeline

Ask HN: Who Is Using XMPP?

36yo: Career at home vs. Simple life abroad?

Ask HN: Why are some websites locking or using the audio device on Windows?

Ask HN: How do you handle duplicate side effects when jobs, workflows retry?

Ask HN: My competitor wants to buy us out, recommend a lawyer?

Ask HN: What's it like working in big tech recently with all the AI tools?

LazyGravity – I made my phone control Antigravity so I never leave bed

If you drive clock wise along the beach on an island

Ask HN: Starting a New Role with Ada

Ask HN: What will happen with Anthropics ultimatum?

Ask HN: What Happened to HTTPS://Www.keyvalues.com/?

I built a 151k-node GraphRAG swarm that autonomously invents SDG solutions

Aura-State: Formally Verified LLM State Machine Compiler

Comments