Ask HN: Is anyone using LLM based document processing in production?

7•asdev•1mo ago

I'm wondering if anyone is actually using LLMs to process documents reliably in production. One hallucination could lead to a host of issues. For example, if someone is using LLMs to process documents and enter data into an ERP, if even one number is off it could cause accounting issues, inventory issues etc. Human in the loop doesn't help because the human would just have to read the document themselves to ensure accuracy, defeating the point of the automation.

Comments

cranberryturkey•1mo ago

we're using it at SummaryForge

asdev•1mo ago

in what context?

cranberryturkey•1mo ago

Summarizing pdfs

whinvik•1mo ago

We are. But our usecase is more tolerant of failures so it's probably not as much of an issue.

asdev•1mo ago

How do you remediate failures?

muzani•1mo ago

I have a project with them, processing auto insurance claims. Mostly extracting details from police reports like license plate numbers, extracting details of the incident.

"Human in the loop doesn't help because the human would just have to read the document themselves to ensure accuracy, defeating the point of the automation."

They're doing it manually without it. Semi-auto beats manual readily. There's still checks like submission of the number to grab the details of the individuals involved, and if the names, vehicle type, etc don't match, that automatically flags that something's off.

f_k•1mo ago

I'm working on this exact problem with https://citellm.com .

Every extracted field comes with a precise citation back to the source document (page + snippet + bounding box + confidence score) so reviewers can verify where each value came from.

Hallucinations get flagged automatically because there's no supporting text in the source.

The goal is to make HITL fast and not have reviewers read through the whole document.

ensemblehq•1mo ago

Have you tried using non-LLM based methods? Like starting with something rules-based and working through a layered multi-model setup?

That’s what we’ve been using for document extraction where accuracy needs precision (capital markets documents, medical assessments). We had a go at pure LLM with medical documents but the output was poor and felt like it would take substantial investment to create something more robust.

Creative problem-solving of unsolved puzzles during REM sleep

Show HN: Language learning through AI example sentences (onigiri.kr)

Wi-Fi 7 marketing is lying about its biggest feature [video]

Thoughts on LLMs

China's rare earth steel is transforming infrastructure [video]

Show HN: CodeMic

How to build a hero section that gets you a chance

Framework 13 Initial Impressions

Show HN: Peekr – An anonymous "Truth or Dare" game built with MERN

Casplist.eu

OpenAI exec becomes top Trump donor with $25M gift

(AI) Slop Terrifies Me

Anthropic's team cut ad creation time from 30 minutes to 30 seconds

Show HN: Elysia JIT "Compiler", why it's one of the fastest JavaScript framework

Cache Monet

Chinese Propaganda in Infomaniak's Euria, and a Reflection on Open Source AI

Show HN: A free, browser-only PDF tools collection built with Kimi k2.5

Curating a Show on My Ineffable Mother, Ursula K. Le Guin

Show HN: HackerStack.dev – 49 Curated AI Tools for Indie Hackers

Pensions Are a Ponzi Scheme

Divvy.club – Splitwise alternative that makes sense

Betterment data breach exposes 1.4M customers

MIT Technology Review has confirmed that posts on Moltbook were fake

Epstein Science: the people Epstein discussed scientific topics with

Bambuddy – a free, self-hosted management system for Bambu Lab printers

Every Failed M4 Gun Replacement Attempt

China ramps up energy boom flagged by Musk as key to AI race

Show HN: ClawBox – Dedicated OpenClaw Hardware (Jetson Orin Nano, 67 Tops, 20W)

Ask HN: AI never gets flustered, will that make us better as people or worse?

Show HN: HalalCodeCheck – Verify food ingredients offline