frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Creative problem-solving of unsolved puzzles during REM sleep

https://academic.oup.com/nc/article/2026/1/niaf067/8456489
1•tchalla•3m ago•0 comments

Show HN: Language learning through AI example sentences (onigiri.kr)

https://jpen.onigiri.kr/
1•jaehakl•4m ago•0 comments

Wi-Fi 7 marketing is lying about its biggest feature [video]

https://www.youtube.com/watch?v=-5o_Qu3XToQ
2•wateralien•4m ago•0 comments

Thoughts on LLMs

https://finestructure.co/blog/2026/2/6/thoughts-on-llms
1•interpol_p•8m ago•0 comments

China's rare earth steel is transforming infrastructure [video]

https://www.youtube.com/watch?v=DfNN1Es02hI
1•zeristor•8m ago•0 comments

Show HN: CodeMic

https://codemic.io/#hn
1•seansh•8m ago•0 comments

How to build a hero section that gets you a chance

https://www.indiehackers.com/post/how-to-build-a-hero-section-that-actually-gets-you-a-chance-bff...
1•allinonetools_•9m ago•0 comments

Framework 13 Initial Impressions

https://www.abgn.me/posts/frame-work-13-initial-impressions
2•albingroen•9m ago•0 comments

Show HN: Peekr – An anonymous "Truth or Dare" game built with MERN

https://peekr-black.vercel.app/
1•peekrtrue•11m ago•1 comments

Casplist.eu

https://casplist.eu
1•PhilipV•19m ago•1 comments

OpenAI exec becomes top Trump donor with $25M gift

https://finance.yahoo.com/news/openai-exec-becomes-top-trump-230342268.html
4•doener•19m ago•0 comments

(AI) Slop Terrifies Me

https://ezhik.jp/ai-slop-terrifies-me/
2•Ezhik•19m ago•0 comments

Anthropic's team cut ad creation time from 30 minutes to 30 seconds

https://claude.com/blog/how-anthropic-uses-claude-marketing
2•Brajeshwar•28m ago•0 comments

Show HN: Elysia JIT "Compiler", why it's one of the fastest JavaScript framework

https://elysiajs.com/internal/jit-compiler
1•saltyaom•28m ago•0 comments

Cache Monet

https://cachemonet.com
1•keepamovin•29m ago•0 comments

Chinese Propaganda in Infomaniak's Euria, and a Reflection on Open Source AI

https://gagliardoni.net/#20260208_euria
1•tomgag•30m ago•1 comments

Show HN: A free, browser-only PDF tools collection built with Kimi k2.5

https://pdfuck.com
3•Justin3go•32m ago•0 comments

Curating a Show on My Ineffable Mother, Ursula K. Le Guin

https://hyperallergic.com/curating-a-show-on-my-ineffable-mother-ursula-k-le-guin/
2•bryanrasmussen•38m ago•0 comments

Show HN: HackerStack.dev – 49 Curated AI Tools for Indie Hackers

https://hackerstack.dev
1•pascalicchio•45m ago•0 comments

Pensions Are a Ponzi Scheme

https://poddley.com/?searchParams=segmentIds=b53ff41f-25c9-4f35-98d6-36616757d35b
2•onesandofgrain•51m ago•9 comments

Divvy.club – Splitwise alternative that makes sense

https://divvy.club
1•filepod•52m ago•0 comments

Betterment data breach exposes 1.4M customers

https://www.americanbanker.com/news/1-4-million-data-breach-betterment-shinyhunters-salesforce
1•NewCzech•52m ago•0 comments

MIT Technology Review has confirmed that posts on Moltbook were fake

https://www.technologyreview.com/2026/02/06/1132448/moltbook-was-peak-ai-theater/
2•helloplanets•52m ago•0 comments

Epstein Science: the people Epstein discussed scientific topics with

https://edge.dog/templates/cml9p8slu0009gdj2p0l8xf4r
2•castalian•53m ago•0 comments

Bambuddy – a free, self-hosted management system for Bambu Lab printers

https://bambuddy.cool
3•maziggy•57m ago•1 comments

Every Failed M4 Gun Replacement Attempt

https://www.youtube.com/watch?v=jrnAU67_EWg
3•tomaytotomato•58m ago•1 comments

China ramps up energy boom flagged by Musk as key to AI race

https://techxplore.com/news/2026-02-china-ramps-energy-boom-flagged.html
2•myk-e•58m ago•0 comments

Show HN: ClawBox – Dedicated OpenClaw Hardware (Jetson Orin Nano, 67 Tops, 20W)

https://openclawhardware.dev
2•superactro•1h ago•0 comments

Ask HN: AI never gets flustered, will that make us better as people or worse?

1•keepamovin•1h ago•0 comments

Show HN: HalalCodeCheck – Verify food ingredients offline

https://halalcodecheck.com/
3•pythonbase•1h ago•0 comments
Open in hackernews

Ask HN: Is anyone using LLM based document processing in production?

7•asdev•1mo ago
I'm wondering if anyone is actually using LLMs to process documents reliably in production. One hallucination could lead to a host of issues. For example, if someone is using LLMs to process documents and enter data into an ERP, if even one number is off it could cause accounting issues, inventory issues etc. Human in the loop doesn't help because the human would just have to read the document themselves to ensure accuracy, defeating the point of the automation.

Comments

cranberryturkey•1mo ago
we're using it at SummaryForge
asdev•1mo ago
in what context?
cranberryturkey•1mo ago
Summarizing pdfs
whinvik•1mo ago
We are. But our usecase is more tolerant of failures so it's probably not as much of an issue.
asdev•1mo ago
How do you remediate failures?
muzani•1mo ago
I have a project with them, processing auto insurance claims. Mostly extracting details from police reports like license plate numbers, extracting details of the incident.

"Human in the loop doesn't help because the human would just have to read the document themselves to ensure accuracy, defeating the point of the automation."

They're doing it manually without it. Semi-auto beats manual readily. There's still checks like submission of the number to grab the details of the individuals involved, and if the names, vehicle type, etc don't match, that automatically flags that something's off.

f_k•1mo ago
I'm working on this exact problem with https://citellm.com .

Every extracted field comes with a precise citation back to the source document (page + snippet + bounding box + confidence score) so reviewers can verify where each value came from.

Hallucinations get flagged automatically because there's no supporting text in the source.

The goal is to make HITL fast and not have reviewers read through the whole document.

ensemblehq•1mo ago
Have you tried using non-LLM based methods? Like starting with something rules-based and working through a layered multi-model setup?

That’s what we’ve been using for document extraction where accuracy needs precision (capital markets documents, medical assessments). We had a go at pure LLM with medical documents but the output was poor and felt like it would take substantial investment to create something more robust.