frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast360 – A web tool to benchmark open-source OCR models side-by-side

https://fast360.xyz
4•yanaimngvov•5mo ago
Hey HN,

Like many of you, I've been building RAG pipelines recently, and constantly hit a wall at the very first step: getting clean, structured Markdown from PDFs.

I found myself in a loop of "environment hell"—spinning up different Conda environments to test Marker, then PP-StructureV3, then MinerU, just to see which one worked best for a specific paper or financial report. It was a massive time sink. Static leaderboards weren't much help, because they don't tell you how a model will perform on your specific, messy document.

So, I built the tool I wished I had. It's a simple web utility that I call an "OCR Arena."

You can try it here: https://fast360.xyz

The idea is simple: upload a document, select from a lineup of 7 leading open-source models, and it runs them all in parallel, showing you the results side-by-side. The goal is to get you from "which parser should I use?" to having the best possible Markdown in under a minute.

It's completely free, and I made sure there's no login/signup required so you can try it with zero friction. Here’s a quick GIF of the workflow:

https://github.com/shijincai/fast360/blob/main/nologin.gif

The tech stack is a pretty standard setup: Next.js/React on the frontend, a Node.js/Express backend acting as a BFF, and a Python service that manages the model execution via a Redis/BullMQ queue.

This is a web service, not an open-source project, but I've set up a public GitHub repo to act as an information hub, a place to track community feedback, and to share more about the tech. You can find that here:

GitHub: https://github.com/shijincai/fast360

I built this to solve my own problem, but I'm hoping it might be useful to some of you as well. I'll be here all day to answer any questions and listen to your thoughts.

Comments

yanaimngvov•5mo ago
One of the most fascinating (and challenging) parts of building this was seeing just how wildly different the "best" model can be depending on the document type.

For example, during testing, I found that Marker is an absolute champion for clean, single-column layouts like blog posts. But throw a dense, multi-column academic paper at it, and MinerU often produces a far superior, structured output with proper LaTeX. Then, for a complex invoice table, PP-StructureV3 frequently beats both of them.

This really solidified my belief that a "one-size-fits-all" parser is a myth. The future seems to be less about finding a single perfect model and more about building a quick, effective workflow for selecting the right specialist for the job. It's a classic "routing" problem, and this tool is my attempt at solving the first step of that puzzle.

Show HN: SafeClaw – a way to manage multiple Claude Code instances in containers

https://github.com/ykdojo/safeclaw
1•ykdojo•43s ago•0 comments

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

https://huggingface.co/blog/huggingface/one-year-since-the-deepseek-moment-blog-3
1•gmays•1m ago•0 comments

The Evolution of the Interface

https://www.asktog.com/columns/038MacUITrends.html
1•dhruv3006•2m ago•0 comments

Azure: Virtual network routing appliance overview

https://learn.microsoft.com/en-us/azure/virtual-network/virtual-network-routing-appliance-overview
1•mariuz•3m ago•0 comments

Seedance2 – multi-shot AI video generation

https://www.genstory.app/story-template/seedance2-ai-story-generator
1•RyanMu•6m ago•1 comments

Πfs – The Data-Free Filesystem

https://github.com/philipl/pifs
1•ravenical•9m ago•0 comments

Go-busybox: A sandboxable port of busybox for AI agents

https://github.com/rcarmo/go-busybox
2•rcarmo•10m ago•0 comments

Quantization-Aware Distillation for NVFP4 Inference Accuracy Recovery [pdf]

https://research.nvidia.com/labs/nemotron/files/NVFP4-QAD-Report.pdf
1•gmays•11m ago•0 comments

xAI Merger Poses Bigger Threat to OpenAI, Anthropic

https://www.bloomberg.com/news/newsletters/2026-02-03/musk-s-xai-merger-poses-bigger-threat-to-op...
1•andsoitis•11m ago•0 comments

Atlas Airborne (Boston Dynamics and RAI Institute) [video]

https://www.youtube.com/watch?v=UNorxwlZlFk
1•lysace•12m ago•0 comments

Zen Tools

http://postmake.io/zen-list
1•Malfunction92•14m ago•0 comments

Is the Detachment in the Room? – Agents, Cruelty, and Empathy

https://hailey.at/posts/3mear2n7v3k2r
1•carnevalem•15m ago•0 comments

The purpose of Continuous Integration is to fail

https://blog.nix-ci.com/post/2026-02-05_the-purpose-of-ci-is-to-fail
1•zdw•17m ago•0 comments

Apfelstrudel: Live coding music environment with AI agent chat

https://github.com/rcarmo/apfelstrudel
1•rcarmo•18m ago•0 comments

What Is Stoicism?

https://stoacentral.com/guides/what-is-stoicism
3•0xmattf•18m ago•0 comments

What happens when a neighborhood is built around a farm

https://grist.org/cities/what-happens-when-a-neighborhood-is-built-around-a-farm/
1•Brajeshwar•19m ago•0 comments

Every major galaxy is speeding away from the Milky Way, except one

https://www.livescience.com/space/cosmology/every-major-galaxy-is-speeding-away-from-the-milky-wa...
2•Brajeshwar•19m ago•0 comments

Extreme Inequality Presages the Revolt Against It

https://www.noemamag.com/extreme-inequality-presages-the-revolt-against-it/
2•Brajeshwar•19m ago•0 comments

There's no such thing as "tech" (Ten years later)

1•dtjb•20m ago•0 comments

What Really Killed Flash Player: A Six-Year Campaign of Deliberate Platform Work

https://medium.com/@aglaforge/what-really-killed-flash-player-a-six-year-campaign-of-deliberate-p...
1•jbegley•20m ago•0 comments

Ask HN: Anyone orchestrating multiple AI coding agents in parallel?

1•buildingwdavid•22m ago•0 comments

Show HN: Knowledge-Bank

https://github.com/gabrywu-public/knowledge-bank
1•gabrywu•27m ago•0 comments

Show HN: The Codeverse Hub Linux

https://github.com/TheCodeVerseHub/CodeVerseLinuxDistro
3•sinisterMage•28m ago•2 comments

Take a trip to Japan's Dododo Land, the most irritating place on Earth

https://soranews24.com/2026/02/07/take-a-trip-to-japans-dododo-land-the-most-irritating-place-on-...
2•zdw•28m ago•0 comments

British drivers over 70 to face eye tests every three years

https://www.bbc.com/news/articles/c205nxy0p31o
40•bookofjoe•29m ago•13 comments

BookTalk: A Reading Companion That Captures Your Voice

https://github.com/bramses/BookTalk
1•_bramses•30m ago•0 comments

Is AI "good" yet? – tracking HN's sentiment on AI coding

https://www.is-ai-good-yet.com/#home
3•ilyaizen•30m ago•1 comments

Show HN: Amdb – Tree-sitter based memory for AI agents (Rust)

https://github.com/BETAER-08/amdb
1•try_betaer•31m ago•0 comments

OpenClaw Partners with VirusTotal for Skill Security

https://openclaw.ai/blog/virustotal-partnership
2•anhxuan•31m ago•0 comments

Show HN: Seedance 2.0 Release

https://seedancy2.com/
2•funnycoding•32m ago•0 comments