frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Replacing $50k manual forensic audits with a deterministic .py engine

2•cd_mkdir•1h ago
I’m a software architect, and I recently built Exit Protocol (https://exitprotocols.com), an automated forensic accounting engine for high-conflict litigation.

Problem: If you get divorced and need to prove that a specific $250k in a heavily commingled joint bank account is your "separate property" (e.g., from a pre-marital startup exit), the burden of proof is strictly mathematical. Historically, this meant paying a forensic CPA $500/hour to dump years of blurry bank PDFs into Excel and manually trace every dollar. It takes weeks and routinely costs over $50,000.

I looked at the legal standard courts use for this—the Lowest Intermediate Balance Rule (LIBR)—and realized it wasn’t an accounting problem. It is a Distributed Systems state-machine problem.

Why we didn't just "Throw AI at it"?

There are a hundred legal-tech startups right now trying to use LLMs to summarize bank data. In a courtroom, GenAI is a fatal liability. If an LLM hallucinates a single transaction, the entire ledger is inadmissible under the Daubert standard.

To make this court-ready, we had to build a strictly deterministic pipeline:

1. Vision-Native Ingestion (Beating Tesseract) Bank statements are the final boss of OCR (merged cells, overlapping debit/credit columns). Standard linear OCR fails catastrophically. We built a spatial-grid OCR pipeline (using Azure Document Intelligence with a local Surya OCR fallback) that maps the geometric structure of the page. It reconstructs tabular ledgers perfectly, even from multi-generational "PDFs from hell."

2. The Deterministic Engine (LIBR) The LIBR algorithm acts as a one-way ratchet. If an account balance drops below your separate property claim amount, your claim is permanently capped at that new floor. Subsequent marital deposits do not refill it (the "replenishment fallacy"). The engine replays thousands of transactions chronologically, continuously evaluating S_t = min(S_t-1, B_t).

3. Resolving Timestamp Ambiguity Bank PDFs give you dates, not timestamps. If a $10k deposit and $10k withdrawal happen on the same day, order matters. We built a simulation toggle that forces "Worst Case" (withdrawals process first) vs "Best Case" sorting, establishing a mathematically irrefutable "Zone of Truth" for settlement negotiations.

4. Cryptographic Chain of Custody & Sovereign Mode Lawyers are terrified of cloud SaaS breaches. We containerized the entire monolith (Django 5.0/Postgres/Celery) via Docker so enterprise firms can run it air-gapped on their own hardware (Sovereign Mode). Furthermore, every generated PDF dossier is sealed with a SHA-256 hash of the underlying data snapshot, proving to a judge that the output hasn't been tampered with since generation.

If you want to see the math in action, we set up a "Demo Sandbox" populated with a synthetic, highly complex 3-year commingled ledger. You can run the engine yourself here (Desktop recommended): https://exitprotocols.com/simulation/uplink/

Here is the exact "Attorney Work Product" it generates from raw PDF or Forensic Audit Dossier our system generates- https://exitprotocols.com/static/documents/Forensic_Audit_Sa...

I'd love feedback from the HN crowd on the architecture—specifically handling edge-case data ingestion and maintaining cryptographic integrity in B2B enterprise deployments.

Cheers!

Comments

cd_mkdir•1h ago
Not a lawyer, so the Go-To-Market side in the legal space has been a steep learning curve. If anyone here has experience selling/deploying air-gapped, on-prem solutions to highly risk-averse, non-technical clients (like law firms), I would love to hear your battle stories.

Happy to answer any questions about the math, the OCR pipeline, or the architecture!

Sandbox link again: https://exitprotocols.com/simulation/uplink/

Major investor 'shocked and sad' that the games industry is 'demonizing' gen AI

https://www.pcgamer.com/software/ai/major-investor-is-shocked-and-sad-that-the-games-industry-is-...
1•stalfosknight•1m ago•0 comments

Quantum Teleportation Breakthrough Brings the Quantum Internet Closer

https://scitechdaily.com/quantum-teleportation-breakthrough-brings-the-quantum-internet-closer/
1•HardwareLust•4m ago•0 comments

Show HN: NumenText, a non-modal editing terminal IDE with LSP/DAP

https://github.com/numentech-co/numentext
1•rlogman•5m ago•0 comments

Airbus is preparing two uncrewed combat aircraft

https://www.airbus.com/en/newsroom/press-releases/2026-03-airbus-is-preparing-two-uncrewed-combat...
2•phasnox•6m ago•0 comments

RFC 4180 – CSV (2005)

https://www.rfc-editor.org/rfc/rfc4180
1•basilikum•11m ago•0 comments

Laws of Nature and Chances: What Breathes Fire into the Equations

https://ndpr.nd.edu/reviews/laws-of-nature-and-chances-what-breathes-fire-into-the-equations/
1•hhs•13m ago•0 comments

The slow death of the English boarding school

https://www.ft.com/content/cc7eb665-b689-4e2f-9e35-c6ab5dbf3980
1•bookofjoe•13m ago•1 comments

Show HN: I let the internet control my iPad with AI

https://play.thomaskidane.com/
1•meneliksecond•14m ago•1 comments

The Sound of Contamination: Headphones Contain Ing Hormone-Disrupting Chemicals

https://arnika.org/en/news/the-sound-of-contamination-all-analysed-headphones-on-the-central-euro...
2•microflash•16m ago•1 comments

'In Search of Now’ review: Blurring forever and a day

https://www.wsj.com/arts-culture/books/in-search-of-now-review-time-out-of-mind-2e33a184
1•hhs•18m ago•0 comments

Tokenizing Arithmetic Expressions

https://xnacly.me/posts/2023/calculator-lexer/
1•ibobev•19m ago•0 comments

Show HN: Nia CLI, an OSS CLI for agents to index, search, and research anything

https://github.com/nozomio-labs/nia-cli
1•jellyotsiro•19m ago•0 comments

Mojo's Not (Yet) Python

https://theconsensus.dev/p/2026/03/12/mojos-not-yet-python.html
1•ibobev•20m ago•0 comments

Introduction to SQLAlchemy 2 in Practice

https://blog.miguelgrinberg.com/post/introduction-to-sqlalchemy-2-in-practice
2•ibobev•20m ago•0 comments

What can change the nature of an AI?

https://onatm.dev/2026/03/14/what-can-change-the-nature-of-an-ai/
1•onatm•23m ago•0 comments

Siepr Economic Summit 2026 – California Wealth Tax [video]

https://www.youtube.com/watch?v=H54P6j7ER28
1•skmurphy•24m ago•1 comments

Does anyone else struggle to search inside large YouTube playlists?

https://chromewebstore.google.com/detail/todij-playlist-manager/fboiimochokolojefdohahhiapkkpccg
1•seyfigo•27m ago•1 comments

Plan 9's Acme: The Un-Terminal and Text-Based GUIs

https://www.danielmoch.com/posts/2025/01/acme/
2•birdculture•29m ago•1 comments

Tools Collection

https://theultimatewebtools.com/
1•Plaz•30m ago•0 comments

Verge Electric TS Pro Is a Revolutionary Motorcycle

https://www.autoweek.com/news/a60944655/verge-electric-ts-pro-motorcycle-review/
1•thunderbong•31m ago•0 comments

The datacenter where the day starts with topping up cerebrospinal fluid

https://www.theregister.com/2026/03/14/cortical_labs_biological_cloud/
2•spzb•32m ago•0 comments

GrobPaint: Somewhere Between MS Paint and Paint.net. Multiplatform by Default

https://github.com/groverburger/grobpaint
2•__grob•35m ago•0 comments

Show HN: GLinksWWW – A browser for power users tired of repetitive copy-pasting

https://github.com/rio719/gLinksWWW-browser
2•glinkswww•39m ago•1 comments

Show HN: A 2D football SIM focused on real-time coaching

https://www.touchlineshouting.com
1•pipnonsense•40m ago•0 comments

Off-Grid Electricity and Hot Water from Scrap Wood

https://hackaday.com/2026/03/14/off-grid-electricity-and-hot-water-from-scrap-wood/
4•toomuchtodo•43m ago•1 comments

Detexify

https://detexify.kirelabs.org/classify.html
1•jruohonen•43m ago•0 comments

Show HN: The best way to manage your skills – Better-Skills

3•trapani•45m ago•2 comments

Dependency cooldowns would be a good idea for Go

https://utcc.utoronto.ca/~cks/space/blog/programming/GoDependencyCooldownsGood
2•ingve•46m ago•1 comments

Trump Adviser Warns of Possible Israel Nuclear Escalation in Iran Conflict

https://www.newsweek.com/david-sacks-trump-administration-israel-nuclear-escalation-iran-war-1167...
4•jacquesm•47m ago•0 comments

Invariant Risk Minimization (2020)

https://arxiv.org/abs/1907.02893
1•gone35•48m ago•0 comments