frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I built an API to stop manual data entry from invoices and resumes

2•scannyai•1mo ago
Hi HN,

I’m the founder of Scanny AI (https://scanny-ai.com/).

I built this because I noticed that despite all the advancements in AI, businesses are still hiring people to manually copy-paste data from PDFs to Excel. Standard OCR tools often just give you a "blob of text" that still requires manual cleanup.

What it does: Scanny AI takes unstructured documents (Invoices, Resumes, IDs, Receipts) and extracts specific data points into structured formats (JSON, CSV, Excel).

How it works: Unlike regex-based parsers or standard OCR, we use context-aware models to understand the document layout. This means it can identify a "Total Amount" on an invoice even if the layout changes, or extract "Implied Skills" from a CV that aren't explicitly listed as keywords.

Current Use Cases:

Invoices: Extracting line items, tax, and vendor details.

Resumes: Parsing experience and skills for HR.

IDs: extracting PII for KYC checks.

We are currently in Early Access and I’m looking for feedback on the extraction accuracy and the API usability.

I’ve enabled Free Credits for new sign-ups so you can test it on your own documents without paying.

I’d love to hear your thoughts on the edge cases (messy handwriting, weird layouts, etc.) and what features you’d like to see next.

Link: https://scanny-ai.com/

Thanks!

Comments

fuzzy_lumpkins•1mo ago
definitely going to pass this on to a couple friends who were just talking about vendor/sales data issues this past week.
scannyai•4w ago
Thanks a lot for the support, I'd be happy to support them and offer some free credits to try it.
jaredsohn•1mo ago
Why not just use a standard LLM prompt?
scannyai•4w ago
You absolutely can for prototypes, but at production scale, you'll hit major issues with cost, latency, and random JSON formatting errors. We handle the heavy lifting—optimizing the vision pipeline and enforcing strict schemas—so you don't have to build and maintain the glue code around the model yourself.

A delightful Mac app to vibe code beautiful iOS apps

https://milq.ai/hacker-news
1•jdjuwadi•2m ago•1 comments

Show HN: Gemini Station – A local Chrome extension to organize AI chats

https://github.com/rajeshkumarblr/gemini_station
1•rajeshkumar_dev•2m ago•0 comments

Welfare states build financial markets through social policy design

https://theloop.ecpr.eu/its-not-finance-its-your-pensions/
2•kome•6m ago•0 comments

Market orientation and national homicide rates

https://onlinelibrary.wiley.com/doi/10.1111/1745-9125.70023
3•PaulHoule•6m ago•0 comments

California urges people avoid wild mushrooms after 4 deaths, 3 liver transplants

https://www.cbsnews.com/news/california-death-cap-mushrooms-poisonings-liver-transplants/
1•rolph•7m ago•0 comments

Matthew Shulman, co-creator of Intellisense, died 2019 March 22

https://www.capenews.net/falmouth/obituaries/matthew-a-shulman/article_33af6330-4f52-5f69-a9ff-58...
3•canucker2016•8m ago•1 comments

Show HN: SuperLocalMemory – AI memory that stays on your machine, forever free

https://github.com/varun369/SuperLocalMemoryV2
1•varunpratap369•9m ago•0 comments

Show HN: Pyrig – One command to set up a production-ready Python project

https://github.com/Winipedia/pyrig
1•Winipedia•11m ago•0 comments

Fast Response or Silence: Conversation Persistence in an AI-Agent Social Network [pdf]

https://github.com/AysajanE/moltbook-persistence/blob/main/paper/main.pdf
1•EagleEdge•11m ago•0 comments

C and C++ dependencies: don't dream it, be it

https://nibblestew.blogspot.com/2026/02/c-and-c-dependencies-dont-dream-it-be-it.html
1•ingve•11m ago•0 comments

Show HN: Vbuckets – Infinite virtual S3 buckets

https://github.com/danthegoodman1/vbuckets
1•dangoodmanUT•12m ago•0 comments

Open Molten Claw: Post-Eval as a Service

https://idiallo.com/blog/open-molten-claw
1•watchful_moose•12m ago•0 comments

New York Budget Bill Mandates File Scans for 3D Printers

https://reclaimthenet.org/new-york-3d-printer-law-mandates-firearm-file-blocking
2•bilsbie•13m ago•1 comments

The End of Software as a Business?

https://www.thatwastheweek.com/p/ai-is-growing-up-its-ceos-arent
1•kteare•14m ago•0 comments

Exploring 1,400 reusable skills for AI coding tools

https://ai-devkit.com/skills/
1•hoangnnguyen•15m ago•0 comments

Show HN: A unique twist on Tetris and block puzzle

https://playdropstack.com/
1•lastodyssey•18m ago•0 comments

The logs I never read

https://pydantic.dev/articles/the-logs-i-never-read
1•nojito•19m ago•0 comments

How to use AI with expressive writing without generating AI slop

https://idratherbewriting.com/blog/bakhtin-collapse-ai-expressive-writing
1•cnunciato•21m ago•0 comments

Show HN: LinkScope – Real-Time UART Analyzer Using ESP32-S3 and PC GUI

https://github.com/choihimchan/linkscope-bpu-uart-analyzer
1•octablock•21m ago•0 comments

Cppsp v1.4.5–custom pattern-driven, nested, namespace-scoped templates

https://github.com/user19870/cppsp
1•user19870•22m ago•1 comments

The next frontier in weight-loss drugs: one-time gene therapy

https://www.washingtonpost.com/health/2026/01/24/fractyl-glp1-gene-therapy/
2•bookofjoe•25m ago•1 comments

At Age 25, Wikipedia Refuses to Evolve

https://spectrum.ieee.org/wikipedia-at-25
2•asdefghyk•28m ago•4 comments

Show HN: ReviewReact – AI review responses inside Google Maps ($19/mo)

https://reviewreact.com
2•sara_builds•28m ago•1 comments

Why AlphaTensor Failed at 3x3 Matrix Multiplication: The Anchor Barrier

https://zenodo.org/records/18514533
1•DarenWatson•29m ago•0 comments

Ask HN: How much of your token use is fixing the bugs Claude Code causes?

1•laurex•33m ago•0 comments

Show HN: Agents – Sync MCP Configs Across Claude, Cursor, Codex Automatically

https://github.com/amtiYo/agents
1•amtiyo•34m ago•0 comments

Hello

2•otrebladih•35m ago•1 comments

FSD helped save my father's life during a heart attack

https://twitter.com/JJackBrandt/status/2019852423980875794
3•blacktulip•38m ago•0 comments

Show HN: Writtte – Draft and publish articles without reformatting, anywhere

https://writtte.xyz
1•lasgawe•40m ago•0 comments

Portuguese icon (FROM A CAN) makes a simple meal (Canned Fish Files) [video]

https://www.youtube.com/watch?v=e9FUdOfp8ME
1•zeristor•41m ago•0 comments