frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Launch HN: Pulse (YC S24) – Production-grade unstructured document extraction

20•sidmanchkanti21•1h ago
Hi HN, we’re Sid and Ritvik, co-founders of Pulse. Pulse is a document extraction system to create LLM-ready text. We built Pulse as we realized that although modern vision language models are very good at producing plausible text, that makes them risky for OCR and data ingestion at scale.

When we started working on document extraction, we assumed the same thing many teams do today: foundation models were improving quickly, multi modal systems appeared to read documents well, and for small or clean inputs that assumption often held. The limitations showed up once we began processing real documents in volume. Long PDFs, dense tables, mixed layouts, low-fidelity scans, and financial or operational data exposed errors that were subtle, hard to detect, and expensive to correct. Outputs often looked reasonable while containing small but meaningful mistakes, especially in tables and numeric fields.

A lot of our work since then has been applied research. We run controlled evaluations on complex documents, fine tune vision models, and build labeled datasets where ground truth actually matters. There have been many nights where our team stayed up hand annotating pages, drawing bounding boxes around tables, labeling charts point by point, or debating whether a number was unreadable or simply poorly scanned. That process shaped our intuition far more than benchmarks alone.

One thing became clear quickly. The core challenge was not extraction itself, but confidence. Vision language models embed document images into high-dimensional representations optimized for semantic understanding rather than precise transcription. That process is inherently lossy. When uncertainty appears, models tend to resolve it using learned priors instead of surfacing ambiguity. This behavior can be helpful in consumer settings. In production pipelines, it creates verification problems that do not scale well.

Pulse grew out of trying to address this gap through system design rather than prompting alone. Instead of treating document understanding as a single generative step, the system separates layout analysis from language modeling. Documents are normalized into structured representations that preserve hierarchy and tables before schema mapping occurs. Extraction is constrained by schemas defined ahead of time, and extracted values are tied back to source locations so uncertainty can be inspected rather than guessed away. In practice, this results in a hybrid approach that combines traditional computer vision techniques, layout models, and vision language models, because no single approach handled these cases reliably on its own.

We are intentionally sharing a few documents that reflect the types of inputs that motivated this work. These are representative of cases where we saw generic OCR or VLM-based pipelines struggle.

Here is a financial 10K: https://platform.runpulse.com/dashboard/examples/example1

Here is a newspaper: https://platform.runpulse.com/dashboard/examples/example2

Here is a rent roll: https://platform.runpulse.com/dashboard/examples/example3

Pulse is not perfect, particularly on highly degraded scans or uncommon handwriting, and there is still room for improvement. The goal is not to eliminate errors entirely, but to make them visible, auditable, and easier to reason about.

Pulse is available via usage-based access to the API and platform You can try it here and access the API docs here.

Demo link here: https://video.runpulse.com/video/pulse-platform-walkthrough-...

We’re interested in hearing how others here evaluate correctness for document extraction, which failure modes you have seen in practice, and what signals you rely on to decide whether an output can be trusted. We will be around to answer questions and are happy to run additional documents if people want to share examples.

Comments

sidcool•1h ago
Congrats on launching. Seems very interesting.
asdev•39m ago
How is this different from Extend(Also YC)?
aryan1silver•38m ago
looks really cool, congrats on the launch! are you guys using something similar to docling[https://github.com/docling-project/docling]?
throw03172019•30m ago
Congrats on launch! We have been using this for a new feature we are building in our SaaS app. It’s results were better than Datalab from our tests, especially in the handwriting category.
mikert89•21m ago
AI models will do all this natively

Conductor: Context-driven development for Gemini CLI

https://developers.googleblog.com/conductor-introducing-context-driven-development-for-gemini-cli/
1•keithba•27s ago•0 comments

Chatbots inform young voters but don't change their vote choices

https://www.pnas.org/doi/10.1073/pnas.2515516122
1•zerolatitude•48s ago•0 comments

Making agentic government work: 7 principles for safer, smarter AI adoption

https://www.nextgov.com/ideas/2025/12/making-agentic-government-work-7-principles-safer-smarter-a...
1•WaitWaitWha•2m ago•0 comments

Toys with the highest play-time and lowest clean-up-time

https://joannabregan.substack.com/p/toys-with-the-highest-play-time-and
1•surprisetalk•3m ago•0 comments

There's no such thing as a fake feather [video]

https://www.youtube.com/watch?v=N5yV1Q9O6r4
1•surprisetalk•3m ago•0 comments

Remove Black Color with Shaders

https://yuanchuan.dev/remove-black-color-with-shaders
1•surprisetalk•3m ago•0 comments

I figured out how to stop making engines and start making games [video]

https://www.youtube.com/watch?v=3kzNiMMnT4U
2•surprisetalk•3m ago•0 comments

Asahi Linux Progress Linux 6.18

https://asahilinux.org/2025/12/progress-report-6-18/
1•birdculture•3m ago•0 comments

The Year in Physics

https://www.quantamagazine.org/the-year-in-physics-20251217/
1•ibobev•3m ago•0 comments

The Open Evaluation Standard: Benchmarking Nvidia Nemotron 3 Nano

https://huggingface.co/blog/nvidia/nemotron-3-nano-evaluation-recipe
1•ibobev•4m ago•0 comments

Military Standard on Software Control Levels

https://entropicthoughts.com/mil-std-882e-software-control
3•ibobev•5m ago•0 comments

Don't Worry About College Majors

https://thinkhuman.com/dont-worry-about-college-majors/
1•jamesgill•5m ago•0 comments

"Mother of All Demos" (1968)

https://www.youtube.com/watch?v=B6rKUf9DWRI
1•nothrowaways•5m ago•0 comments

Show HN: Turn your startup logo into a holiday Google doodle

https://doodle.logic.inc/
1•sgk284•6m ago•0 comments

Yann LeCun raising €500M at €3B valuation for new AI startup

https://sifted.eu/articles/yann-lecun-ami-labs-3bn-valuation
2•harscoat•7m ago•0 comments

Sadly, Fortnite will not return to iOS in Japan in 2025 as promised

https://twitter.com/TimSweeneyEpic/status/2001494517996732598
1•ChrisArchitect•8m ago•1 comments

Ask HN: How to fight back against Lovable, Replit, etc. in enterprise products

2•bears123•8m ago•0 comments

You can now verify Google AI-generated videos in the Gemini app

https://blog.google/technology/ai/verify-google-ai-videos-gemini-app/
1•meetpateltech•10m ago•1 comments

GitHub Actions Down

2•1qaboutecs•10m ago•0 comments

Agent Skills is now an open standard

https://claude.com/blog/organization-skills-and-directory
2•adocomplete•13m ago•0 comments

Good if make prior after data instead of before

https://dynomight.substack.com/p/prior
1•crescit_eundo•14m ago•0 comments

The View from Inside the AI Bubble

https://www.theatlantic.com/technology/2025/12/neurips-ai-bubble-agi/685250/
1•CharlesW•14m ago•0 comments

Ruby Weekly #780: What's New in Ruby 4.0

https://rubyweekly.com/issues/780
2•brandrick•15m ago•0 comments

Show HN: A better interface for base model LLMs

https://github.com/transkatgirl/Tapestry-Loom
2•transkatgirl•17m ago•1 comments

Show HN: Peachka – Protecting Videos from Stealing

https://peachka.net/
1•superdario•18m ago•0 comments

The Simple Essence of Monomorphization (Oopsla 2025) [video]

https://www.youtube.com/watch?v=xqoTzQ9b9S0
1•matt_d•19m ago•0 comments

Show HN: Git rewind – your Git year in review

https://gitrewind.dev/
3•thijser•21m ago•0 comments

Updates on the Weir Language

https://elijahpotter.dev/articles/updates-on-the-weir-language
1•chilipepperhott•22m ago•0 comments

We Found the Craziest Wave on Earth [video]

https://www.youtube.com/watch?v=UnnkjzxlNIw
2•blindriver•22m ago•0 comments

Paper2Video: Automatic Video Generation from Scientific Papers

https://showlab.github.io/Paper2Video/
1•bilsbie•22m ago•0 comments