frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Mistral OCR 3

https://mistral.ai/news/mistral-ocr-3
234•pember•1d ago

Comments

pzo•1h ago
there has been so many open source OCR in the last 3 months that would be good to compare to those especially when some are not even 1B params and can be run on edge devices.

- paddleOCR-VL

- olmOCR-2

- chandra

- dots.ocr

I kind of miss there is not many leaderboard sections or arena for OCR and CV and providers hosting those. Neglected on both Artificial Analysis and OpenRouter.

pzo•1h ago
what I like in MistralOCR is that they have simple pricing $1/1k pages and API hosted on their servers. With other OCR is hard to compare pricing because are token based and you don't know how many tokens is the image unless you run your own test.

E.g. with Gemini 3.0 flash you might seem that model pricing increased only slightly comparing to Gemini 2.5 flash until you test it and will see that what used to be 258 per 384x384 input tokens now is around 3x more.

hereme888•46m ago
https://www.codesota.com/ocr
culi•31m ago
Someone posted a project here about a month ago where they compare models in head-to-head matchups similar to llmarena

https://www.ocrarena.ai/leaderboard

Hasn't been updated for Mistral but so far gemeni seems to top the leaderboard.

jeffbee•17m ago
OCR developers from decades past must be slapping their foreheads now that it seems users will wait a whole minute per page and be happy.
andai•4m ago
How can something have a very high ELO but a very low win rate?
andai•5m ago
I spent like three hours trying to get one of these running and then gave up. I think the paddleOCR one.

It took an hour and a half to install 12 gigabytes of pytorch dependencies that can't even run on my device, and then it told me it had some sort of versioning conflict. (I think I was supposed to use UV, but I had run out of steam by that point.)

Maybe I should have asked Claude to install it for me. I gave Claude root on a $3 VPS, and it seems to enjoy the sysadmin stuff a lot more than I do...

Incidentally I had a similar experience installing open web UI... It installed 12 GB of pytorch crap.. I rage quit and deleted the whole thing, and replicated the functionality I actually needed in 100 lines of HTML.... Too bad I can't do that with OCR ;)

Tiberium•1h ago
From a tweet: https://x.com/i/status/2001821298109120856

> can someone help folks at Mistral find more weak baselines to add here? since they can't stomach comparing with SoTA....

> (in case y'all wanna fix it: Chandra, dots.ocr, olmOCR, MinerU, Monkey OCR, and PaddleOCR are a good start)

belval•1h ago
I've worked on document extraction a lot and while the tweet is too flippant for my taste, it's not wrong. Mistral is comparing itself to non-VLM computer vision services. While not necessarily what everyone needs, they are a very different beasts compared to VLM based extraction because it gives you precise bounding boxes, usually at the cost of larger "document understanding".

Its failure mode are also vastly different. VLM-based extraction can misread entire sentences or miss entire paragraphs. Sonnet 3 had that issue. Computer vision models instead will make in-word typos.

petcat•1h ago
It seems like Mistral is just chasing around sort of "the fringes" of what could be useful AI features. Are they just getting out-classed by OAI, Google, Anthropic?

It seems like EU in general should be heavily invested in Mistral's development, but it doesn't seem like they are.

VWWHFSfQ•1h ago
I think there is a lot of broad support, but they're just kind of hamstrung by EU regulation on AI development at this stage. I think the end game will ultimately be getting acquired by an American company, and then relocating.
tensor•1h ago
I hope the EU blocks any acquisitions by American companies. The west needs to start protecting its strategic assets.
tensor•1h ago
Form processing is vastly more useful than meme generation. When people need to do real work this is the sort of tool they are going to reach for.
sbuttgereit•1h ago
Yep. I saw the title and got excited.... this is a particular problem area where I think these things can be very effective. There are so many data entry class tasks which don't require huge knowledge or judgement... just clear parsing and putting that into a more machine digestible form.

I don't know... feels like this sort of area, while not nearly so sexy as video production or coding or (etc.)... but seems like reaching a better-than-human performance level should be easier for these kinds of workloads.

lawlessone•1h ago
>It seems like EU in general should be heavily invested

Maybe, i think it will be to our benefit when the bubble pops that we are not heavily invested, no harm investing a little.

BoredPositron•1h ago
I guess it's better to do the same stuff everyone else is doing?
bee_rider•47m ago
Following the leaders too closely seems like a bad move, at least until a profitable business model for an AI model training company is discovered. Mistral’s models are pretty good, right? I mean they don’t have all the scaffolding around them that something like chatGPT does, but building all that scaffolding could be wasted effort until a profitable business model is shown.

Until then, they seem to be able to keep enough talent in the EU to train reasonably good models. The kernel is there, which seems like the attainable goal.

qwytw•20m ago
>Mistral’s models are pretty good, right

Are they? IIRC their best model is still worse than the gpt-oss-120B?

film42•1h ago
Is open router still sending all OCR jobs to Mistral? I wonder if they're trying to keep that spot. Seems like Mistral and Google are the best at OCR right now, with Google leading Mistral by a fair bit.
numlocked•28m ago
(I work at OpenRouter) If you send a PDF to our API we will:

1. Use native PDF parsing if the model supports it

2. Use this Mistral OCR model (we updated to this version yesterday)

3. UNLESS you override the "engine" param to use an alternate. We support a JS-based (non-LLM) parser as well [0]

So yes, in practice a lot of OCR jobs go to Mistral, but not all of them.

Would love to hear requests for other parsers if folks have them!

[0] https://openrouter.ai/docs/guides/overview/multimodal/pdfs#p...

hereme888•47m ago
I'm reading worse performance than many OSS offerings like Paddle, MinerU, MonkeyOCR, etc:

https://www.codesota.com/ocr

breadislove•34m ago
i just gave it a quick spin on my fav documents. quick check:

- table entries hallucinated - tables messed up (tables merged, forgot rows) - forgot to parse some text passages

if you are doing something serious, i would not use it

ipsum2•22m ago
You might want to mention that you are a competitor.
ghjv•20m ago
thought you were being flip or assuming that, but checked their profile and you are right. I agree that this should be disclosed in their comment.
vasco•26m ago
Gave it a birth registry from a Portuguese locality from 1755 which my dad and I often decipher to figure out geneology and it did a terrible job.

Regular Gemini Thinking can actually get 70-80% of the documents correct except lots of mistakes on given names. Chatgpt maybe understands like 50-60%.

This Mistral model butchered the whole text, literally not a word was usable. To the point I think I'm doing something wrong.

The test document: https://files.fm/u/3hduyg65a5

zzleeper•14m ago
Oh god, I'm sure I wouldn't come close to 50%; that's so hard to read
tecoholic•11m ago
> Mistral OCR 3 is ideal for both high-volume enterprise pipelines and interactive document workflows.

I don’t know how they can make this statement with 79% accuracy rate. For any serious use case, this is an unacceptable number.

I work with scientific journals and issues like 2.9+0.5 and 29+0.5 is something we regularly run into that has us never being able to fully trust automated processes and require human verification every step.

MallocVoidstar•8m ago
Where are you seeing 79% accuracy? 79% only occurs on the page as a win rate, not an accuracy
g947o•3m ago
And I believe the number is 74%, compared to OCR 2.

What matters is whether this is better than competition/alternatives. Of course nobody is just going to take the output as is. If you do that, that's your problem.

Mistral OCR 3

https://mistral.ai/news/mistral-ocr-3
235•pember•1d ago•29 comments

Garage – An S3 object store so reliable you can run it outside datacenters

https://garagehq.deuxfleurs.fr/
355•ibobev•6h ago•80 comments

TP-Link Tapo C200: Hardcoded Keys, Buffer Overflows and Privacy

https://www.evilsocket.net/2025/12/18/TP-Link-Tapo-C200-Hardcoded-Keys-Buffer-Overflows-and-Priva...
165•sibellavia•3h ago•41 comments

8-bit Boléro

https://linusakesson.net/music/bolero/index.php
90•Aissen•10h ago•14 comments

Graphite is joining Cursor

https://cursor.com/blog/graphite
128•fosterfriends•6h ago•151 comments

GotaTun -- Mullvad's WireGuard Implementation in Rust

https://mullvad.net/en/blog/announcing-gotatun-the-future-of-wireguard-at-mullvad-vpn
498•km•10h ago•105 comments

Amazon will allow ePub and PDF downloads for DRM-free eBooks

https://www.kdpcommunity.com/s/article/New-eBook-Download-Options-for-Readers-Coming-in-2026?lang...
475•captn3m0•12h ago•258 comments

Performance Hints (2023)

https://abseil.io/fast/hints.html
16•danlark1•4h ago•17 comments

Buteyko Method

https://en.wikipedia.org/wiki/Buteyko_method
6•rzk•19m ago•1 comments

Show HN: Stickerbox, a kid-safe, AI-powered voice to sticker printer

https://stickerbox.com/
29•spydertennis•2h ago•29 comments

Show HN: TinyPDF – 3kb pdf library (70x smaller than jsPDF)

https://github.com/Lulzx/tinypdf
61•lulzx•1d ago•8 comments

Qwen-Image-Layered: transparency and layer aware open diffusion model

https://huggingface.co/papers/2512.15603
12•dvrp•18h ago•2 comments

Rust's Block Pattern

https://notgull.net/block-pattern/
69•zdw•17h ago•22 comments

Believe the Checkbook

https://robertgreiner.com/believe-the-checkbook/
97•rg81•6h ago•38 comments

NOAA deploys new generation of AI-driven global weather models

https://www.noaa.gov/news-release/noaa-deploys-new-generation-of-ai-driven-global-weather-models
34•hnburnsy•1d ago•17 comments

The FreeBSD Foundation's Laptop Support and Usability Project

https://github.com/FreeBSDFoundation/proj-laptop
115•mikece•7h ago•41 comments

Ask HN: How are you LLM-coding in an established code base?

39•adam_gyroscope•3d ago•25 comments

Reverse Engineering US Airline's PNR System and Accessing All Reservations

https://alexschapiro.com/security/vulnerability/2025/11/20/avelo-airline-reservation-api-vulnerab...
69•bearsyankees•3h ago•31 comments

The pitfalls of partitioning Postgres yourself

https://hatchet.run/blog/postgres-partitioning
22•abelanger•3d ago•3 comments

Lite^3, a JSON-compatible zero-copy serialization format

https://github.com/fastserial/lite3
104•cryptonector•6d ago•29 comments

Wall Street Ruined the Roomba and Then Blamed Lina Khan

https://www.thebignewsletter.com/p/how-wall-street-ruined-the-roomba
138•connor11528•3h ago•87 comments

Show HN: I Made Loom for Mobile

https://demoscope.app
48•admtal•5h ago•31 comments

Postfix Macros and Let Place

https://nadrieril.github.io/blog/2025/12/09/postfix-macros-and-let-place.html
8•todsacerdoti•5d ago•1 comments

You can now play Grand Theft Auto Vice City in the browser

https://dos.zone/grand-theft-auto-vice-city/
205•Alifatisk•2h ago•57 comments

Response Healing: Reduce JSON defects by 80%+

https://openrouter.ai/announcements/response-healing-reduce-json-defects-by-80percent
17•numlocked•1d ago•13 comments

Detailed balance in large language model-driven agents

https://arxiv.org/abs/2512.10047
10•Anon84•3d ago•0 comments

Prompt caching for cheaper LLM tokens

https://ngrok.com/blog/prompt-caching/
252•samwho•3d ago•62 comments

History LLMs: Models trained exclusively on pre-1913 texts

https://github.com/DGoettlich/history-llms
726•iamwil•23h ago•357 comments

Getting bitten by Intel's poor naming schemes

https://lorendb.dev/posts/getting-bitten-by-poor-naming-schemes/
268•LorenDB•16h ago•141 comments

Show HN: Stepped Actions – distributed workflow orchestration for Rails

https://github.com/envirobly/stepped
77•klevo•5d ago•11 comments