frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I built an API to stop manual data entry from invoices and resumes

2•scannyai•2h ago
Hi HN,

I’m the founder of Scanny AI (https://scanny-ai.com/).

I built this because I noticed that despite all the advancements in AI, businesses are still hiring people to manually copy-paste data from PDFs to Excel. Standard OCR tools often just give you a "blob of text" that still requires manual cleanup.

What it does: Scanny AI takes unstructured documents (Invoices, Resumes, IDs, Receipts) and extracts specific data points into structured formats (JSON, CSV, Excel).

How it works: Unlike regex-based parsers or standard OCR, we use context-aware models to understand the document layout. This means it can identify a "Total Amount" on an invoice even if the layout changes, or extract "Implied Skills" from a CV that aren't explicitly listed as keywords.

Current Use Cases:

Invoices: Extracting line items, tax, and vendor details.

Resumes: Parsing experience and skills for HR.

IDs: extracting PII for KYC checks.

We are currently in Early Access and I’m looking for feedback on the extraction accuracy and the API usability.

I’ve enabled Free Credits for new sign-ups so you can test it on your own documents without paying.

I’d love to hear your thoughts on the edge cases (messy handwriting, weird layouts, etc.) and what features you’d like to see next.

Link: https://scanny-ai.com/

Thanks!

Comments

fuzzy_lumpkins•54m ago
definitely going to pass this on to a couple friends who were just talking about vendor/sales data issues this past week.
jaredsohn•51m ago
Why not just use a standard LLM prompt?

Playing with Turmites: better than crypto/rand?

https://blog.vrypan.net/2025/12/28/playing-with-turmites-better-than-crypto-rand/
2•vrypan•3m ago•1 comments

Apple retires 25 products, ends iconic iPhone SE era

https://news.az/news/apple-retires-25-products-ends-iconic-iphone-se-era
2•doener•3m ago•0 comments

Curator 'shocked' by Melbourne pitch performance in Ashes

https://www.rnz.co.nz/news/sport/582828/curator-shocked-by-melbourne-pitch-performance-in-ashes
2•tigerlily•10m ago•0 comments

ThinkTank, an "idea processor" that launched a religion (of outliners)

https://stonetools.ghost.io/thinktank-dos/
2•ChristopherDrum•17m ago•0 comments

Shields.io Uses the GitHub API

https://shields.io/blog/token-pool
2•angristan•18m ago•0 comments

Show HN: DeviceGPT – AI-powered Android device monitor with real data

1•teamzlab•18m ago•0 comments

The brain decides what to remember with sequential molecular timers

https://medicalxpress.com/news/2025-11-brain-reveals-sequentially-molecular-timers.html
1•PaulHoule•23m ago•0 comments

The Question Nobody Asks

https://aliveness.kunnas.com/articles/the-question-nobody-asks
1•ekns•23m ago•0 comments

Multi-Tenant SaaS's Wildcard TLS: An Overview of DNS-01 Challenges

https://www.skeptrune.com/posts/wildcard-tls-for-multi-tenant-systems/
1•skeptrune•25m ago•0 comments

Fast Cvvdp Implementation in C

https://github.com/halidecx/fcvvdp
2•todsacerdoti•27m ago•0 comments

The Sociology of the Crease

https://www.sebs.website/blog/the-sociology-of-the-crease
2•Incerto•31m ago•0 comments

Show HN: SecureNow – Security Fixes You Can Apply Today

https://www.securenow.dev
1•pelmenibenni•34m ago•0 comments

How to Complain

https://outerproduct.net/trivial/2024-03-25_complain.html
3•ysangkok•34m ago•1 comments

How to never make a bad decision

https://docs.google.com/document/d/1Ni9EOFM4-rADFT-cJHKUJUod8408zPk4zE-6IIHB4kU/edit?usp=sharing
1•PhilosophyForAI•38m ago•1 comments

Release age v1.3.0: post-quantum (and more)

https://github.com/FiloSottile/age/releases/tag/v1.3.0
1•birdculture•38m ago•0 comments

CEOs are hugely expensive. Why not automate them?

https://www.newstatesman.com/business/companies/2023/05/ceos-salaries-expensive-automate-robots
71•nis0s•40m ago•39 comments

Rethinking Tools in MCP

https://cra.mr/rethinking-the-definition-of-tools-in-mcp/
1•jshchnz•41m ago•0 comments

Spherical Cow

https://lib.rs/crates/spherical-cow
8•Natfan•46m ago•1 comments

Show HN: Golazo – Live soccer updates in your terminal

https://github.com/0xjuanma/golazo
2•rocajuanma•47m ago•0 comments

Slaughtering Competition Problems with Quantifier Elimination

https://grossack.site/2021/12/22/qe-competition.html
5•todsacerdoti•47m ago•0 comments

Airlines call in psychologists to stop passengers risking their lives for bags

https://www.telegraph.co.uk/business/2025/12/27/airlines-call-psychologists-passengers-risking-li...
2•elsewhen•48m ago•1 comments

62 years in the making: NYC's newest water tunnel nears the finish line

https://ny1.com/nyc/all-boroughs/news/2025/11/09/water--dep--tunnels-
22•eatonphil•52m ago•3 comments

Show HN: Upload a song and get a finished music video (no editing, no prompts)

https://musicvideogenerator.app/
2•hexadecimal•56m ago•1 comments

Halifax video game workers form first Ubisoft union in North America

https://www.cbc.ca/news/canada/nova-scotia/ubisoft-forms-first-union-north-america-halifax-9.7028674
3•cf100clunk•58m ago•0 comments

Two strangers. A terrorist bomb. An extraordinary tale of courage

https://bungalow-magazine.com/p/the-bench-2f5e
1•rmason•1h ago•0 comments

Show HN: Thingo

https://thingoboard.com
1•jryan49•1h ago•0 comments

As AI gobbles up chips, prices for devices may rise

https://www.npr.org/2025/12/28/nx-s1-5656190/ai-chips-memory-prices-ram
20•geox•1h ago•8 comments

Boost.MultiIndex Refactored

http://bannalia.blogspot.com/2025/12/boostmultiindex-refactored.html
1•ibobev•1h ago•0 comments

Mercury: The planet that shouldn't exist

https://www.bbc.com/future/article/20251223-mercury-the-planet-that-shouldnt-exist
1•1659447091•1h ago•0 comments

Why Your AI Characters Turn To Mush (and how I fixed it)

https://ghostintheweights.substack.com/p/why-your-ai-characters-turn-to-mush
2•llamataboot•1h ago•1 comments