frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I built an API to stop manual data entry from invoices and resumes

2•scannyai•1mo ago
Hi HN,

I’m the founder of Scanny AI (https://scanny-ai.com/).

I built this because I noticed that despite all the advancements in AI, businesses are still hiring people to manually copy-paste data from PDFs to Excel. Standard OCR tools often just give you a "blob of text" that still requires manual cleanup.

What it does: Scanny AI takes unstructured documents (Invoices, Resumes, IDs, Receipts) and extracts specific data points into structured formats (JSON, CSV, Excel).

How it works: Unlike regex-based parsers or standard OCR, we use context-aware models to understand the document layout. This means it can identify a "Total Amount" on an invoice even if the layout changes, or extract "Implied Skills" from a CV that aren't explicitly listed as keywords.

Current Use Cases:

Invoices: Extracting line items, tax, and vendor details.

Resumes: Parsing experience and skills for HR.

IDs: extracting PII for KYC checks.

We are currently in Early Access and I’m looking for feedback on the extraction accuracy and the API usability.

I’ve enabled Free Credits for new sign-ups so you can test it on your own documents without paying.

I’d love to hear your thoughts on the edge cases (messy handwriting, weird layouts, etc.) and what features you’d like to see next.

Link: https://scanny-ai.com/

Thanks!

Comments

fuzzy_lumpkins•1mo ago
definitely going to pass this on to a couple friends who were just talking about vendor/sales data issues this past week.
scannyai•4w ago
Thanks a lot for the support, I'd be happy to support them and offer some free credits to try it.
jaredsohn•1mo ago
Why not just use a standard LLM prompt?
scannyai•4w ago
You absolutely can for prototypes, but at production scale, you'll hit major issues with cost, latency, and random JSON formatting errors. We handle the heavy lifting—optimizing the vision pipeline and enforcing strict schemas—so you don't have to build and maintain the glue code around the model yourself.

155M US land parcel boundaries

https://www.kaggle.com/datasets/landrecordsus/us-parcel-layer
1•tjwebbnorfolk•1m ago•0 comments

Private Inference

https://confer.to/blog/2026/01/private-inference/
1•jbegley•5m ago•0 comments

Font Rendering from First Principles

https://mccloskeybr.com/articles/font_rendering.html
1•krapp•8m ago•0 comments

Show HN: Seedance 2.0 AI video generator for creators and ecommerce

https://seedance-2.net
1•dallen97•12m ago•0 comments

Wally: A fun, reliable voice assistant in the shape of a penguin

https://github.com/JLW-7/Wally
1•PaulHoule•13m ago•0 comments

Rewriting Pycparser with the Help of an LLM

https://eli.thegreenplace.net/2026/rewriting-pycparser-with-the-help-of-an-llm/
1•y1n0•15m ago•0 comments

Lobsters Vibecoding Challenge

https://gist.github.com/MostAwesomeDude/bb8cbfd005a33f5dd262d1f20a63a693
1•tolerance•15m ago•0 comments

E-Commerce vs. Social Commerce

https://moondala.one/
1•HamoodBahzar•16m ago•1 comments

Avoiding Modern C++ – Anton Mikhailov [video]

https://www.youtube.com/watch?v=ShSGHb65f3M
2•linkdd•17m ago•0 comments

Show HN: AegisMind–AI system with 12 brain regions modeled on human neuroscience

https://www.aegismind.app
2•aegismind_app•21m ago•1 comments

Zig – Package Management Workflow Enhancements

https://ziglang.org/devlog/2026/#2026-02-06
1•Retro_Dev•22m ago•0 comments

AI-powered text correction for macOS

https://taipo.app/
1•neuling•26m ago•1 comments

AppSecMaster – Learn Application Security with hands on challenges

https://www.appsecmaster.net/en
1•aqeisi•27m ago•1 comments

Fibonacci Number Certificates

https://www.johndcook.com/blog/2026/02/05/fibonacci-certificate/
1•y1n0•28m ago•0 comments

AI Overviews are killing the web search, and there's nothing we can do about it

https://www.neowin.net/editorials/ai-overviews-are-killing-the-web-search-and-theres-nothing-we-c...
3•bundie•33m ago•1 comments

City skylines need an upgrade in the face of climate stress

https://theconversation.com/city-skylines-need-an-upgrade-in-the-face-of-climate-stress-267763
3•gnabgib•34m ago•0 comments

1979: The Model World of Robert Symes [video]

https://www.youtube.com/watch?v=HmDxmxhrGDc
1•xqcgrek2•39m ago•0 comments

Satellites Have a Lot of Room

https://www.johndcook.com/blog/2026/02/02/satellites-have-a-lot-of-room/
2•y1n0•39m ago•0 comments

1980s Farm Crisis

https://en.wikipedia.org/wiki/1980s_farm_crisis
4•calebhwin•40m ago•1 comments

Show HN: FSID - Identifier for files and directories (like ISBN for Books)

https://github.com/skorotkiewicz/fsid
1•modinfo•45m ago•0 comments

Show HN: Holy Grail: Open-Source Autonomous Development Agent

https://github.com/dakotalock/holygrailopensource
1•Moriarty2026•52m ago•1 comments

Show HN: Minecraft Creeper meets 90s Tamagotchi

https://github.com/danielbrendel/krepagotchi-game
1•foxiel•59m ago•1 comments

Show HN: Termiteam – Control center for multiple AI agent terminals

https://github.com/NetanelBaruch/termiteam
1•Netanelbaruch•1h ago•0 comments

The only U.S. particle collider shuts down

https://www.sciencenews.org/article/particle-collider-shuts-down-brookhaven
2•rolph•1h ago•1 comments

Ask HN: Why do purchased B2B email lists still have such poor deliverability?

1•solarisos•1h ago•3 comments

Show HN: Remotion directory (videos and prompts)

https://www.remotion.directory/
1•rokbenko•1h ago•0 comments

Portable C Compiler

https://en.wikipedia.org/wiki/Portable_C_Compiler
2•guerrilla•1h ago•0 comments

Show HN: Kokki – A "Dual-Core" System Prompt to Reduce LLM Hallucinations

1•Ginsabo•1h ago•0 comments

Software Engineering Transformation 2026

https://mfranc.com/blog/ai-2026/
1•michal-franc•1h ago•0 comments

Microsoft purges Win11 printer drivers, devices on borrowed time

https://www.tomshardware.com/peripherals/printers/microsoft-stops-distrubitng-legacy-v3-and-v4-pr...
4•rolph•1h ago•1 comments