frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Unsiloed AI – #1 on olmOCR-Bench

5•adnan9999•1h ago
Most of the document parsers fail on real world challenges like complex tables, handwritten documents, historical document scans, equations, multi-column layouts, complex reading order, etc. We built Unsiloed Parser to handle exactly these cases.

Our latest parser v3.1 achieved #1 rank and scored 88.0 strict pass-rate on olmOCR-Bench. We ran the evaluation across 1,403 PDFs and 8,413 unit tests using the unmodified upstream Allen AI scorer (olmocr==0.4.27) and found Unsiloed beats 18 other OCR services, including GPT-5.5, Claude Opus 4.7, LlamaParse, Reducto, Azure Document Intelligence, AWS Textract, and Unstructured.

When we dug deeper into the failure cases, we found many errors were not OCR errors but things like \frac vs \dfrac, whitespace differences, or equivalent LaTeX renderings. We ran a secondary LLM-as-Judge evaluation to classify real misses vs semantic equivalents, which lifts the corrected score to 94.8 (explained deeply in the blog post).

Blog with full methodology and examples: https://www.unsiloed.ai/blog/unsiloed-ai-achieves-1-rank-on-...

Evaluation Code for reproducibility: https://github.com/Unsiloed-AI/unsiloed-olmocr-benchmark

Feel free to post your messiest PDFs in the comment and we'll run it through Unsiloed parser and share the output here.

Comments

adnan9999•1h ago
Founder here. If you've got a notorious PDF you would like us to try , pls feel free to drop it in the comments. We'll run it and share the output here.
warthog•1h ago
website has no self serve sign up
adnan9999•56m ago
Yeah, we're not fully self-serve yet. Shoot me an email at adnan@unsiloed.ai with some info on the use case and I'll get you set up on the platform.
pshishodia•17m ago
Damn unreal that OCR was so unsolved until now

Show HN: Write your BPF programs in Go, not C

https://github.com/boratanrikulu/gobee
41•boratanrikulu•4d ago•21 comments

Show HN: OpenBrief – Local-first video downloader/summarizer

https://github.com/tantara/openbrief
7•tantara•1h ago•0 comments

Show HN: Nerve – self hosted runtime for AI agents

https://github.com/ClickHouse/nerve
2•animetyan•36m ago•2 comments

Show HN: Audiomass – a free, open-source multitrack audio editor for the web

https://audiomass.co/?multitrack=1
507•pantelisk•1d ago•110 comments

Show HN: Fungible – A local personal finance app in the terminal

https://github.com/tomfunk/fungible
2•tomfunk•1h ago•0 comments

Show HN: Unsiloed AI – #1 on olmOCR-Bench

5•adnan9999•1h ago•4 comments

Show HN: Geomatic – A command-driven geometry studio enabled with autodiff

https://www.tinyvolt.com/geomatic
63•nivter•14h ago•14 comments

Show HN: Built a tool to create brand-consistent images using AI

https://inktag.io
4•gsharma1•2h ago•0 comments

Show HN: Bae – AI companion built around persistent memory architecture

https://bae.ppl.studio
2•zeshutmax•2h ago•0 comments

Show HN: TryPost – open-source Social Media Scheduler

https://trypost.it/en
2•paulocastellano•2h ago•2 comments

Show HN: I made Pokémon but with real animals in the real world

https://apps.apple.com/gb/app/animalis-game/id6762081213
2•robert-whiteley•3h ago•0 comments

Show HN: Volt – front end tooling for Phoenix that runs inside the BEAM

https://github.com/elixir-volt/volt
15•dannote•9h ago•1 comments

Show HN: Cursed Browser – a VLM reads the HTML and hallucinates the page

https://github.com/scosman/cursed_browser
5•scosman•5h ago•1 comments

Show HN: Linear Chess – Normal Chess, on a 1D board

https://youbee.cloud/chess/chess.html
3•MarcellusDrum•5h ago•0 comments

Show HN: Anyone interested in a tool helps to explore C++ ASTs

https://uvic-aurora.github.io/acav-manual/index.html
47•leomicv•4d ago•3 comments

Show HN: Freenet, a peer-to-peer platform for decentralized apps

https://freenet.org/
383•sanity•4d ago•268 comments

Show HN: ShadowCat – file transfer through QR Codes in a Browser

https://github.com/unprovable/ShadowCat
164•unprovable•3d ago•62 comments

Show HN: Local note engine uses LLM to organize notes into a knowledge graph

https://github.com/AlexWasHeree/NoteCast
8•AlexWasHeree•1d ago•4 comments

Show HN: My homelab is outperforming the stock market

https://stocks.sjer.red
9•shepherdjerred•1d ago•1 comments

Show HN: The Front Page – Newspaper-style front page for Hacker News

https://thefrontpage.dev/
20•stagas•1d ago•8 comments

Show HN: Kanban CLI (A local-first, agent-first task manager for the terminal)

https://codeberg.org/hydrafog/kanban
15•hydra-f•1d ago•8 comments

Show HN: Rmux – A programmable terminal multiplexer with a Playwright-style SDK

https://github.com/helvesec/rmux
191•shideneyu•4d ago•94 comments

Show HN: Agent.email – sign up via curl, claim with a human OTP

96•adisingh13•4d ago•106 comments

Show HN: I Dedicated 4 Years to Mastering Offline Password Cracking

267•bojta-lepenye•4d ago•60 comments

Show HN: NanoApps: Run custom homebrew apps on iPod nano 7th generation

https://twitter.com/freemyipod/status/2058920520708468974
3•user890104•8h ago•0 comments

Show HN: Open-source .docx editor library for building document apps

https://github.com/eigenpal/docx-editor
106•thisisjedr•4d ago•16 comments

Show HN: Forge – Guardrails take an 8B model from 53% to 99% on agentic tasks

https://github.com/antoinezambelli/forge
685•zambelli•6d ago•251 comments

Show HN: I reverse engineered Apple's video wallpapers

https://github.com/kageroumado/phosphene
426•kageroumado•4d ago•106 comments

Show HN: SaveNeighbor – food delivery through your own personal network

https://www.saveneighbor.com
2•JJonesRatio•22h ago•5 comments

Show HN: MarketChacha – Reddit for traders with verified track records

https://marketchacha.com
4•rsingh867•10h ago•0 comments