frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Is anyone using LLM based document processing in production?

4•asdev•8h ago
I'm wondering if anyone is actually using LLMs to process documents reliably in production. One hallucination could lead to a host of issues. For example, if someone is using LLMs to process documents and enter data into an ERP, if even one number is off it could cause accounting issues, inventory issues etc. Human in the loop doesn't help because the human would just have to read the document themselves to ensure accuracy, defeating the point of the automation.

Comments

cranberryturkey•8h ago
we're using it at SummaryForge
asdev•8h ago
in what context?
whinvik•7h ago
We are. But our usecase is more tolerant of failures so it's probably not as much of an issue.
asdev•6h ago
How do you remediate failures?
muzani•6h ago
I have a project with them, processing auto insurance claims. Mostly extracting details from police reports like license plate numbers, extracting details of the incident.

"Human in the loop doesn't help because the human would just have to read the document themselves to ensure accuracy, defeating the point of the automation."

They're doing it manually without it. Semi-auto beats manual readily. There's still checks like submission of the number to grab the details of the individuals involved, and if the names, vehicle type, etc don't match, that automatically flags that something's off.

f_k•2h ago
I'm working on this exact problem with https://citellm.com .

Every extracted field comes with a precise citation back to the source document (page + snippet + bounding box + confidence score) so reviewers can verify where each value came from.

Hallucinations get flagged automatically because there's no supporting text in the source.

The goal is to make HITL fast and not have reviewers read through the whole document.

A Codebase by an Agent for an Agent

https://ampcode.com/by-an-agent-for-an-agent
1•emersonmacro•1m ago•0 comments

Making Google Sans Flex

https://design.google/library/google-sans-flex-font
1•meetpateltech•4m ago•0 comments

GitHub 95

https://github95.vercel.app
2•keepamovin•6m ago•0 comments

What the hyperproduction of AI slop is doing to science

https://theconversation.com/what-the-hyperproduction-of-ai-slop-is-doing-to-science-272250
4•billybuckwheat•11m ago•0 comments

Firefox UI revamp sparks complaints, searches for alternatives (2014)

https://www.computerworld.com/article/1514198/firefox-ui-revamp-sparks-complaints-searches-for-al...
1•1gn15•13m ago•0 comments

Why and How China Will Win AI: A Systems Understanding of China's AI Playbook

https://www.zackaryia.com/blog/2025-12-11/why-and-how-china-will-win-ai/
1•Zackaryia•19m ago•0 comments

RFC1087 Ethics and the Internet (1989)

https://www.ietf.org/rfc/rfc1087.txt
3•1vuio0pswjnm7•20m ago•1 comments

Worst Technology Flops of 2025

https://www.technologyreview.com/2025/12/18/1130106/the-8-worst-technology-flops-of-2025/
3•devonnull•24m ago•0 comments

Microsoft Updates Windows 'To Stop Users from Downloading Google Chrome'

https://www.forbes.com/sites/zakdoffman/2025/12/18/microsoft-updates-windows-to-stop-users-downlo...
1•72f988bf•24m ago•2 comments

Recent discoveries on the acquisition of the highest levels of human performance

https://www.science.org/doi/10.1126/science.adt7790
1•tchalla•24m ago•1 comments

Evaluating Chain-of-Thought Monitorability

https://openai.com/index/evaluating-chain-of-thought-monitorability/
2•mfiguiere•25m ago•0 comments

Reimplementing Unix Correct: The Lost Bayesian Spelling Corrector

https://learningloom.substack.com/p/reimplementing-unix-correct-the-lost
1•atomicnature•25m ago•0 comments

Code Coverage

https://keploy.io/blog/community/understanding-code-coverage-in-software-testing
1•sophielane•30m ago•0 comments

A quantum mystery that stumped scientists for decades is solved

https://www.sciencedaily.com/releases/2025/12/251217082509.htm
2•croes•36m ago•0 comments

2026 Apple introducing more ads to increase opportunity in search results

https://ads.apple.com/app-store/help/ad-placements/0082-search-results
10•punnerud•36m ago•5 comments

Gut microbe Turicibacter prevents weight gain

https://newatlas.com/diet-nutrition/weight-gain-gut-microbe/
2•thunderbong•37m ago•0 comments

Getting bitten by Intel's poor naming scenes

https://lorendb.dev/posts/getting-bitten-by-poor-naming-schemes/
13•LorenDB•39m ago•3 comments

Caro – a local offline shell companion for when you forget commands (alpha)

https://www.caro.sh/
1•kobi_kadosh•39m ago•1 comments

Linux Foundation Annual Report 2025

https://www.linuxfoundation.org/resources/publications/linux-foundation-annual-report-2025
2•lawrencejgd•42m ago•0 comments

Brown University Shooting Suspect Found Dead in New Hampshire

https://www.vanityfair.com/news/story/person-of-interest-arrested-in-brown-shooting
1•sampo•43m ago•1 comments

Why OpenAI’s Move to Skills Matters If You’re Shipping AI Agents

https://medium.com/@ohansemmanuel/why-openais-move-to-skills-matters-if-you-re-shipping-ai-agents...
2•ohans•45m ago•1 comments

Nob – Turn any terminal AI-powered (Open Source)

https://github.com/hetpatel-11/nob
1•hkpatel•45m ago•0 comments

Building ScrapeForge in public starting tomorrow

1•Vishwas-Batra•47m ago•0 comments

Meta Is Developing a New AI Image and Video Model Code-Named 'Mango'

https://www.wsj.com/tech/ai/meta-developing-new-ai-image-and-video-model-code-named-mango-16e785c7
1•fortran77•51m ago•1 comments

Blackbox open source agentic coding tool that lives in your terminal

https://github.com/blackboxaicode/cli
1•rizkrob•52m ago•1 comments

Administration Plans to Break Up Premier Weather and Climate Research Center

https://www.nytimes.com/2025/12/17/climate/national-center-for-atmospheric-research-trump.html
1•zekrioca•52m ago•0 comments

Mistakes marred Australian telco firewall upgrade, contributing to deaths

https://www.theregister.com/2025/12/19/optus_emergency_outages_cause_report/
3•defrost•1h ago•0 comments

CDC Spelling Error Dictionary

https://ftp.cdc.gov/pub/health_Statistics/nchs/Software/mmds/2009/spell/mmds_spell.txt
1•gregsadetsky•1h ago•0 comments

I Made UTM Triggered Popups

1•matanblay•1h ago•0 comments

Man suspected in shooting at Brown and nuclear physicist has been found dead

https://www.theguardian.com/us-news/2025/dec/18/suspect-brown-university-shooting
3•pogue•1h ago•0 comments