frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Fast360 – A web tool to benchmark open-source OCR models side-by-side

https://fast360.xyz
2•yanaimngvov•1h ago
Hey HN,

Like many of you, I've been building RAG pipelines recently, and constantly hit a wall at the very first step: getting clean, structured Markdown from PDFs.

I found myself in a loop of "environment hell"—spinning up different Conda environments to test Marker, then PP-StructureV3, then MinerU, just to see which one worked best for a specific paper or financial report. It was a massive time sink. Static leaderboards weren't much help, because they don't tell you how a model will perform on your specific, messy document.

So, I built the tool I wished I had. It's a simple web utility that I call an "OCR Arena."

You can try it here: https://fast360.xyz

The idea is simple: upload a document, select from a lineup of 7 leading open-source models, and it runs them all in parallel, showing you the results side-by-side. The goal is to get you from "which parser should I use?" to having the best possible Markdown in under a minute.

It's completely free, and I made sure there's no login/signup required so you can try it with zero friction. Here’s a quick GIF of the workflow:

https://github.com/shijincai/fast360/blob/main/nologin.gif

The tech stack is a pretty standard setup: Next.js/React on the frontend, a Node.js/Express backend acting as a BFF, and a Python service that manages the model execution via a Redis/BullMQ queue.

This is a web service, not an open-source project, but I've set up a public GitHub repo to act as an information hub, a place to track community feedback, and to share more about the tech. You can find that here:

GitHub: https://github.com/shijincai/fast360

I built this to solve my own problem, but I'm hoping it might be useful to some of you as well. I'll be here all day to answer any questions and listen to your thoughts.

Comments

yanaimngvov•1h ago
One of the most fascinating (and challenging) parts of building this was seeing just how wildly different the "best" model can be depending on the document type.

For example, during testing, I found that Marker is an absolute champion for clean, single-column layouts like blog posts. But throw a dense, multi-column academic paper at it, and MinerU often produces a far superior, structured output with proper LaTeX. Then, for a complex invoice table, PP-StructureV3 frequently beats both of them.

This really solidified my belief that a "one-size-fits-all" parser is a myth. The future seems to be less about finding a single perfect model and more about building a quick, effective workflow for selecting the right specialist for the job. It's a classic "routing" problem, and this tool is my attempt at solving the first step of that puzzle.

Pivotal Token Search (PTS): Targeting Critical Decision Points in LLM Training

https://huggingface.co/blog/codelion/pts
1•codelion•2m ago•0 comments

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

https://arxiv.org/abs/2506.05209
1•bckr•3m ago•0 comments

Welcome to the YIMBYest Neighbourhood in America

https://www.economist.com/united-states/2025/08/17/welcome-to-the-yimbyest-neighbourhood-in-america
1•Element_•4m ago•0 comments

Setting Up Changesets with Bun Workspaces

https://ianm.com/posts/2025-08-18-setting-up-changesets-with-bun-workspaces
1•simplyianm•6m ago•0 comments

IMDB Terminal Browser

https://github.com/isene/IMDB
4•thunderbong•21m ago•2 comments

Liquid Glass. Why?

https://furbo.org/2025/08/17/liquid-glass-why/
3•tambourine_man•31m ago•0 comments

Toward Robust Hyper-Detailed Image Captioning

https://arxiv.org/abs/2412.15484
2•fzliu•37m ago•0 comments

Show HN: SamwiseOS – A web-based, AI-first OS with a Python kernel in Pyodide

https://samwiseos.neocities.org
3•oopismcgoopis•38m ago•0 comments

Show HN: Code Cause Collective – Devs creating solutions for humanity

https://codecause.dev/
2•Brysonbw•39m ago•0 comments

The System Inside the System

https://worksonmymachine.ai/p/the-system-inside-the-system
2•todsacerdoti•42m ago•0 comments

I Asked AI to Invent the Perfect Political System. Here's What It Created

https://wisewolfmedia.substack.com/p/ai-creates-perfect-governmental-system
3•gscott•42m ago•0 comments

Claude-Usage ROI

https://github.com/brock-rb2t/claude-usage
2•brockferocious•45m ago•3 comments

Q chat can use tmux as a comms bus ootb, Claude code can't

https://mooreniemi.github.io
3•mooreniemi•47m ago•0 comments

AI models can send subliminal messages that teach other AIs to be 'evil'

https://www.livescience.com/technology/artificial-intelligence/the-best-solution-is-to-murder-him-in-his-sleep-ai-models-can-send-subliminal-messages-that-teach-other-ais-to-be-evil-study-claims
4•breve•49m ago•0 comments

Best Practices for Submitting and Reviewing Merge Requests in Debian

https://optimizedbyotto.com/post/debian-salsa-merge-request-best-practices/
3•JNRowe•50m ago•0 comments

Show HN: 500+ Golang Interview Questions Quiz

https://applyre.com/resources/500-interview-questions/golang/
2•kan101•52m ago•0 comments

Show HN: I built an app that turns random fridge items into real meals

https://apps.apple.com/us/app/recipepal-pro/id6476252066
2•abouhmad•54m ago•1 comments

Why does T-Mobile use DoD network space (21.0.0.0/8) for it's internal network?

https://old.reddit.com/r/tmobileisp/comments/1gg7361/why_is_tmobile_giving_network_210008_addresses/
3•WarOnPrivacy•54m ago•1 comments

Customizing Lisp REPLs

https://aartaka.me/customize-repl.html
18•todsacerdoti•1h ago•0 comments

Decimal FP faster than binary?

3•newbie-02•1h ago•0 comments

PyCrucible – Another PyInstaller Alternative

https://github.com/razorblade23/PyCrucible
2•razorblade23•1h ago•0 comments

We built ToolVerse: An AI Tools Discovery Platform with 500 curated apps

https://toolverse.com
2•yuanchuangAI•1h ago•1 comments

Serverless Infrastructure for AI apps – 3x perf of baseten, 1/5 the cost

https://www.hyperpodai.com
2•ollayf•1h ago•0 comments

Show HN: Fast360 – A web tool to benchmark open-source OCR models side-by-side

https://fast360.xyz
2•yanaimngvov•1h ago•1 comments

How not to check or poll URLs, as illustrated by Fediverse software

https://utcc.utoronto.ca/~cks/space/blog/web/HowNotToCheckURLs
3•zdw•1h ago•0 comments

How CATL Made Sodium-Ion Batteries 90% Cheaper

https://www.youtube.com/watch?v=Wf84NJSiAeU
6•joak•1h ago•0 comments

If AI takes most of our jobs, money as we know it will be over. What then?

https://theconversation.com/if-ai-takes-most-of-our-jobs-money-as-we-know-it-will-be-over-what-then-262338
7•Improvement•1h ago•0 comments

Injecting doubts in the CoT of reasoning models

https://github.com/martianlantern/cot-doubt-injection
2•martianlantern•1h ago•0 comments

If you love it, download it

https://erysdren.me/blog/2025-08-16/
4•Improvement•1h ago•0 comments

What If Every Speed Limit Suddenly Changed? [video]

https://www.youtube.com/watch?v=bVpzCEiT4oo
2•Klaster_1•1h ago•0 comments