frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Python Only Has One Real Competitor

https://mccue.dev/pages/2-6-26-python-competitor
1•dragandj•1m ago•0 comments

Tmux to Zellij (and Back)

https://www.mauriciopoppe.com/notes/tmux-to-zellij/
1•maurizzzio•2m ago•1 comments

Ask HN: How are you using specialized agents to accelerate your work?

1•otterley•3m ago•0 comments

Passing user_id through 6 services? OTel Baggage fixes this

https://signoz.io/blog/otel-baggage/
1•pranay01•4m ago•0 comments

DavMail Pop/IMAP/SMTP/Caldav/Carddav/LDAP Exchange Gateway

https://davmail.sourceforge.net/
1•todsacerdoti•4m ago•0 comments

Visual data modelling in the browser (open source)

https://github.com/sqlmodel/sqlmodel
1•Sean766•6m ago•0 comments

Show HN: Tharos – CLI to find and autofix security bugs using local LLMs

https://github.com/chinonsochikelue/tharos
1•fluantix•7m ago•0 comments

Oddly Simple GUI Programs

https://simonsafar.com/2024/win32_lights/
1•MaximilianEmel•7m ago•0 comments

The New Playbook for Leaders [pdf]

https://www.ibli.com/IBLI%20OnePagers%20The%20Plays%20Summarized.pdf
1•mooreds•8m ago•0 comments

Interactive Unboxing of J Dilla's Donuts

https://donuts20.vercel.app
1•sngahane•9m ago•0 comments

OneCourt helps blind and low-vision fans to track Super Bowl live

https://www.dezeen.com/2026/02/06/onecourt-tactile-device-super-bowl-blind-low-vision-fans/
1•gaws•11m ago•0 comments

Rudolf Vrba

https://en.wikipedia.org/wiki/Rudolf_Vrba
1•mooreds•11m ago•0 comments

Autism Incidence in Girls and Boys May Be Nearly Equal, Study Suggests

https://www.medpagetoday.com/neurology/autism/119747
1•paulpauper•12m ago•0 comments

Wellness Hotels Discovery Application

https://aurio.place/
1•cherrylinedev•13m ago•1 comments

NASA delays moon rocket launch by a month after fuel leaks during test

https://www.theguardian.com/science/2026/feb/03/nasa-delays-moon-rocket-launch-month-fuel-leaks-a...
1•mooreds•13m ago•0 comments

Sebastian Galiani on the Marginal Revolution

https://marginalrevolution.com/marginalrevolution/2026/02/sebastian-galiani-on-the-marginal-revol...
2•paulpauper•17m ago•0 comments

Ask HN: Are we at the point where software can improve itself?

1•ManuelKiessling•17m ago•0 comments

Binance Gives Trump Family's Crypto Firm a Leg Up

https://www.nytimes.com/2026/02/07/business/binance-trump-crypto.html
1•paulpauper•17m ago•0 comments

Reverse engineering Chinese 'shit-program' for absolute glory: R/ClaudeCode

https://old.reddit.com/r/ClaudeCode/comments/1qy5l0n/reverse_engineering_chinese_shitprogram_for/
1•edward•17m ago•0 comments

Indian Culture

https://indianculture.gov.in/
1•saikatsg•20m ago•0 comments

Show HN: Maravel-Framework 10.61 prevents circular dependency

https://marius-ciclistu.medium.com/maravel-framework-10-61-0-prevents-circular-dependency-cdb5d25...
1•marius-ciclistu•20m ago•0 comments

The age of a treacherous, falling dollar

https://www.economist.com/leaders/2026/02/05/the-age-of-a-treacherous-falling-dollar
2•stopbulying•20m ago•0 comments

Ask HN: AI Generated Diagrams

1•voidhorse•23m ago•0 comments

Microsoft Account bugs locked me out of Notepad – are Thin Clients ruining PCs?

https://www.windowscentral.com/microsoft/windows-11/windows-locked-me-out-of-notepad-is-the-thin-...
5•josephcsible•23m ago•1 comments

Show HN: A delightful Mac app to vibe code beautiful iOS apps

https://milq.ai/hacker-news
6•jdjuwadi•26m ago•1 comments

Show HN: Gemini Station – A local Chrome extension to organize AI chats

https://github.com/rajeshkumarblr/gemini_station
1•rajeshkumar_dev•27m ago•0 comments

Welfare states build financial markets through social policy design

https://theloop.ecpr.eu/its-not-finance-its-your-pensions/
2•kome•30m ago•0 comments

Market orientation and national homicide rates

https://onlinelibrary.wiley.com/doi/10.1111/1745-9125.70023
4•PaulHoule•31m ago•0 comments

California urges people avoid wild mushrooms after 4 deaths, 3 liver transplants

https://www.cbsnews.com/news/california-death-cap-mushrooms-poisonings-liver-transplants/
1•rolph•31m ago•0 comments

Matthew Shulman, co-creator of Intellisense, died 2019 March 22

https://www.capenews.net/falmouth/obituaries/matthew-a-shulman/article_33af6330-4f52-5f69-a9ff-58...
3•canucker2016•32m ago•1 comments
Open in hackernews

Show HN: ClearDoc – Extract fields from any document using OCR and LLM

http://cleardoc.v5ent.com/
1•Mignet•6mo ago
Hi HN!

I recently launched a prototype of *ClearDoc*, an AI-powered tool to extract structured data from unstructured documents like invoices, bills of lading, certificates, etc.

It uses *OCR (PaddleOCR)* and *LLMs* to detect and align key fields — even for complex documents with tables, nested fields, or in different languages.

It doesn't require templates and can be *self-hosted* (demo runs on my own GPU).

Live demo (no sign-up): http://cleardoc.v5ent.com/ Demo video: https://www.youtube.com/watch?v=u83T6iewfNs

Right now: - Fields are auto-aligned visually on the document - Works with PDFs, images, scans - No custom field design/editing in the demo yet

Would love feedback on: - Which use cases matter most to you? - What would make this valuable enough to adopt?

Thanks!

Comments

Mignet•6mo ago
pls feel free to report any issue
Mignet•6mo ago
*Building an AI-Powered Document Understanding Tool – Feedback Welcome*

Hi HN!

I'm working on a tool called *ClearDoc*, which uses AI to extract structured data from unstructured documents like invoices, bills of lading, and certificates. The biggest challenge we've faced so far is accurately extracting data from complex documents, especially those with tables and nested fields.

### What I’m Looking to Discuss: - How do you approach extracting data from complex documents like invoices or contracts? - If you’ve worked with OCR or document processing tools, what have been your biggest challenges?

We’ve built a demo that uses PaddleOCR and LLMs to extract and align data. I’d love to get your thoughts on how we could improve the accuracy of data extraction, or whether you think a no-template approach is valuable.

If you’re interested, feel free to try out the demo (no sign-up required) and let me know your thoughts!

[ClearDoc Demo](https://cleardoc.v5ent.com/)

Looking forward to your feedback!

#AI #OCR #MachineLearning #DocumentProcessing

Mignet•6mo ago
hi HN! — just pushed a new update to *ClearDoc*, my AI tool to extract *structured data from unstructured documents* (like invoices, logistics forms, certificates, etc.)

---

### What’s New:

*HTTPS Enabled:* The live demo is now secure at [https://cleardoc.v5ent.com](https://cleardoc.v5ent.com), so no more browser warnings.

*Improved Homepage Messaging:* Based on user feedback, the homepage now has a much clearer value proposition and simplified CTA. For example, “Reasoning Output” is now simply “View Extracted Data.”

*Performance Tweaks:* Faster processing, better alignment, and cleaner output.

*Coming soon: Confidence Scores + Feedback Loop* So users will see which extracted fields the AI is “most sure” about — and be able to correct any errors to improve future results.

---

### What is ClearDoc?

ClearDoc helps you *turn messy PDFs/images into clean JSON* — without templates, without fine-tuning, and fully self-hostable.

It combines: - OCR (PaddleOCR) - LLM (OpenAI-compatible) - Field alignment + visual overlays - JSON Schema output (customizable)

Demo: https://cleardoc.v5ent.com Video: https://www.youtube.com/watch?v=u83T6iewfNs

---

### Who is this for?

- Developers building document-based tools - Finance / accounting teams who copy-paste data - Logistics / trade teams processing paperwork - Anyone who hates manually parsing PDFs

---

### I'm looking for:

1. Early users with real docs they want to process 2. Edge cases you'd like to see it handle 3. Feedback on the extraction quality / experience

I’d love to hear what you think — or help if you're facing similar problems.

Thanks — Charles