frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Mistral OCR 4

https://mistral.ai/news/ocr-4/
172•meetpateltech•2h ago

Comments

Ducki•1h ago
I was processing 55 year old paper files, most of them severely degraded, with its predecessor model. I was very impressed! I also tried Abbyy Finereader but it didn't even come close in my experience.
philipkglass•1h ago
I used Abbyy Finereader for several years. I loved it. I completed some large projects with it. Modern VLMs put classic FineReader to shame for processing low-resolution/degraded/non-standard text.

I'm personally using the small Qwen 3.5 models. If you have an OCR problem, Mistral OCR 4 is probably great. Open weights models that you can run on a laptop may also work great.

jppope•1h ago
Is there something wrong with their certificate? Chromium is saying https isn't valid
collabs•1h ago
Looks good to me on both brave (on android) and firefox (on windows 11). Lets see what ssl labs says (it is running now)

https://www.ssllabs.com/ssltest/analyze.html?d=mistral.ai&la...

Looks good so far, A+ on ipv4 as well as ipv6

Edit: I also asked Gemini 3.1 Pro to analyze the certificate and it looks good

It looks like you have shared an `about:certificate` URL containing a chain of three Base64-encoded X.509 TLS/SSL certificates. This specific chain is used to secure connections to *mistral.ai*.

Here is the decoded breakdown of the certificate chain you provided:

## Certificate Chain Overview

This is a standard three-tier certificate chain issued by Google Trust Services for the Mistral AI domain.

---

### 1. Leaf Certificate (End-Entity)

This is the specific certificate issued to the website to verify its identity and encrypt traffic.

* *Subject (Common Name):* `mistral.ai` * *Subject Alternative Names (SANs):* `mistral.ai`, `workers.mistral.ai` * *Issuer:* WE1 (Google Trust Services) * *Valid From:* June 13, 2026 * *Valid To:* September 11, 2026 * *Key Type:* Elliptic Curve (ECDSA)

### 2. Intermediate Certificate

This certificate acts as a bridge between the website's certificate and the trusted Root CA.

* *Subject:* WE1 (Google Trust Services) * *Issuer:* GTS Root R4 (Google Trust Services LLC) * *Valid From:* December 13, 2023 * *Valid To:* February 20, 2029 * *Key Type:* Elliptic Curve (ECDSA)

### 3. Root Certificate

This is the foundational trust anchor pre-installed in browsers and operating systems.

* *Subject:* GTS Root R4 (Google Trust Services LLC) * *Issuer:* GTS Root R4 (Self-signed) * *Valid From:* June 22, 2016 * *Valid To:* June 22, 2036 * *Key Type:* Elliptic Curve (ECDSA)

jppope•42m ago
thanks I'm going to have to check whats going on with my setup then
mdrzn•1h ago
It'll be interesting to see how this ranks against https://github.com/baidu/Unlimited-OCR
cdnsteve•1h ago
Right, just announced https://x.com/BaiduAI_News/status/2069322806748410291
ge96•1h ago
1000 pages for $4? damn how does it compare to llama parse I wonder
thenthenthen•1h ago
Or Apples local OCR/Vision models?
aliljet•52m ago
I was just using infinity parser 2 (flash, to be fair) for pennies self-hosted to run through thousands of pages of documents with remarkable confidence. I decided to use https://huggingface.co/datasets/allenai/olmOCR-bench to determine what was the best OCR tool, yesterday, but I've got no idea what the best is now. What is the dominant OCR eval right now? Between Baidu and Mistral this morning, I wonder if there's a new tool to switch to..
freezed8•43m ago
(jerry from llamaindex here) we're gonna benchmark on ParseBench and report the results!
tdubey•1h ago
Are there benchmarks for how this performs on charts, or maybe more accurately, plots? I've yet to find a model that can digitize a plot into X,Y points with some accuracy in my use case of digitizing old datasheets.
utopiah•1h ago
"A note on out-of-scope use. OCR 4 is a document-understanding model, not a decision-maker. It is not intended for medical diagnosis, legal advice or judgment, high-stakes financial decisions, safety-critical systems, real-time/latency-sensitive processing, or non-document inputs (raw audio, video, etc.). "

Can't wait for the "oh so innovative" manager who will suggest during the next meeting "Ok... but what if WE used it for high-stakes financial decisions on non-document inputs like a photo from my phone?"

I guarantee you somebody on HN is going to comment about this "idea" next week.

weird-eye-issue•1h ago
Why would anybody do that you would simply get terrible results compared to dozens of other more capable models. It's for converting to text not answering questions. Just seems like you need some sort of weird angle to bring out an anti AI stance
alex43578•43m ago
I think his comment is referring to a scenario where a decision is made on financial numbers that are misrecognized. E.g. 9.0% actual is OCR’d as 90%
leoc•24m ago
“I delegated critical financial decisions to my OCR software, and you won’t believe what happened next.”
gpm•1h ago
Do these models (this one or its competitors) do handwriting recognition?
weird-eye-issue•1h ago
If you mean handwriting to text then yes
gpm•58m ago
Yep that's what I mean, thanks :)
9cb14c1ec0•58m ago
Yes, we have successfully used Mistral OCR for digitizing handwritten forms. You always have low percentage that need human review and adjustment, but overall Mistral has been highly accurate (their price is amazing, too).
observationist•52m ago
In the sense that you can get similarity scores for individual characters referenced against a known database of characters written by various individuals. You can get stylometry scores out of small LLMs that do demographic segmentation based on writing style using the same methods.

They won't have the capacity to be fed an image of handwritten text and say "Ahh, this is a note written by Winston Churchill!". You could very easily use these models and your agent framework of choice, like Hermes, the Segment Anything models, and other foss tooling to build a dedicated, specialist handwriting recognition system. Or facial recognition, or fingerprint recognition, etc - these sorts of things can be done very procedurally, without a lot of interpretive AI.

Insanity•1h ago
Recently I tied OCR with Opus 4.8. (I know, not technically right tool for the job). All I needed to do was extract dates from receipts. It got about 20% of the dates wrong yet rated all as “high confidence”.

Should have probably tried a more OCR specific model

nik736•1h ago
Opus is very good at OCR. Way better than the small 1-4B VLMs. If Opus failed, most likely those smaller models will fail as well.
MostlyStable•52m ago
How long have you been testing this? Have you noted a large improvement? I tested Opus for this quite a while ago (maybe 4.5? Whatever was out about a year ago), and it performed quite poorly on my use case.
nik736•48m ago
I have put together an internal benchmark on 1000s of business documents with weird tables, structure, etc. that I run on every relevant model release. Opus 4.8 performs very very well. But it is obviously overkill for the task (and expensive at doing so). I just wanted to respond to the OP.
Insanity•32m ago
I'm assuming that the reason I didn't have good success rate is because it was not scanned documents, but photographs, and lighting conditions weren't always ideal. I think scanned business documents are a happy-case scenario in a way. (obv, you seem to run it against some complex documents, so that's impressive)
stri8ted•1h ago
Way too expensive. Google vision OCR (which they failed to compare against), is $1.50 per 1k pages. Vs $4 from Mistral.
pmxi•59m ago
This has been a niche where Mistral has actually been successful. Btw, Hindi and Japanese are bucketed in "Rare Languages," which is odd.
ZiiS•52m ago
I read that as "languages under-represented in the training set".
greenleafone7•55m ago
After paying for Mistral and using it for a while I genuinely hated it. It's a productivity black hole and can't realistically compete with anyone. I chose it only because it was European, but no. I'd rather let my one year subscription go to waste than use anything 'Mistral'.
adlk•44m ago
what did you use it for and when?
amunozo•34m ago
Same, I got a refund 3 days later. It is unusable.
maelito•24m ago
Opposite advice. It's very useful to me for dev and general tasks.

Been using Claude in parallele, it's better not not that much, just 10x (or 100x ?) more expensive.

InsideOutSanta•19m ago
Mistral's coding models aren't on par with current SOTA US and Chinese models if that's what you're referring to, but I rather like their OCR models.
lxgr•11m ago
> After paying for Mistral and using it for a while I genuinely hated it

For OCR?

mcbetz•55m ago
Little on differences other than bounding boxes and double the price compared to their previous OCR v3 model from December - https://mistral.ai/news/mistral-ocr-3/ - other benchmarks were used back then.
MostlyStable•53m ago
Does anyone know of OCR benchmarks that include hand-written documents? I'm currently using Gemini pro 3 for this, and error rates are quite good, but it's a little bit pricey, and I'd be interested in a cheaper model that could perform as well, but almost all the OCR benchmarks I'm aware of (and I believe all the ones included in this announcement) are about printed/typeset text.
andrewmutz•43m ago
A tangential observation: the video on the linked page wasn't what I expected. I thought Mistral was a european AI company, so I didnt expect the video to be filmed in San Francisco featuring three people who don't seem to be european.

I'm not against them being a global organization, that's wonderful. I was just surprised. I expected a parisian office and european accents.

rjzzleep•35m ago
Unfortunately Europeans are terrible customers for making money. They ask a lot of questions and they're very stingy with their wallets. Americans on the other hand ...
rsynnott•6m ago
~Any borderline-large European tech company will have an office on the US west coast, for sales if nothing else. And probably sales engineering. The timezone difference is eight to ten hours; there is really no way around it.

(I did work for one which had an office in Vancouver, instead; same tz.)

flashfaffe2•3m ago
To the best of my knowledge, most of the founding team started their careers in the US ( meta,etc..) and their primary investors are US VCs. In that regard, they smartly benefit on both side : US funding and European brains
mrkn1•12m ago
This runs for free on CPU https://github.com/kouhxp/textsnap
bpodgursky•1h ago
I do not believe this story.

Opus 4.8 scanned hundreds of PDFs for me recently with the worst handwriting imaginable. 100% successful, other than one record where even I could not figure out what was written.

9cb14c1ec0•56m ago
I believe it. Makes me curious what your prompt was that got such a good result out of Opus.
Insanity•52m ago
I do not believe this story, because of the message I just posted above.

That's not really productive lol, I'm glad it worked for you but these models are non-deterministic and 'YMMV' very much applies everywhere. I had it parse receipts (in fairness, in variable lightning), all taken from iPhone cameras in the past year. And yeah, not a great job, about 20% failed to get the date correct. (Not outrageously wrong, e.g 05/20/2026 becomes 05/23/2026.

YMMV, glad it worked for you.

bpodgursky•46m ago
Are you sure you weren't using Sonnet or a low-effort reasoning mode?
Insanity•39m ago
Yes, lol
pwython•8m ago
Were your images larger than 3.75 megapixels (Opus max resolution)? iPhones generally take 24MP photos.

Show HN: TikZ Editor – WYSIWYG editor for figures in LaTeX

https://tikz.dev/editor/
137•DominikPeters•1h ago•24 comments

Unlimited OCR: One-Shot Long-Horizon Parsing

https://github.com/baidu/Unlimited-OCR
279•ingve•4h ago•77 comments

Spying on kids to save kids from spying is stupid

https://pluralistic.net/2026/06/23/destroy-the-village/
261•hn_acker•2h ago•166 comments

Lift4D: Harmonizing Single-View 3D Estimation for 4D Reconstruction In-the-Wild

https://lift4d.github.io/
32•ilreb•1h ago•1 comments

AI's Affordability Crisis

https://blog.dshr.org/2026/06/ais-affordability-crisis.html
88•ilreb•1h ago•78 comments

Mistral OCR 4

https://mistral.ai/news/ocr-4/
172•meetpateltech•2h ago•48 comments

Five monitors on a Commodore 128 [video]

https://www.youtube.com/watch?v=ul5hC3PY1Yg
9•EvanAnderson•21h ago•0 comments

Show HN: Bun-sqlgen – Type-safe raw SQL for Bun, no ORM

https://github.com/ilbertt/bun-sqlgen
27•ilbert•1h ago•13 comments

MSG Made Dossier on Activists Who Opposed Facial Recognition

https://www.404media.co/madison-square-garden-made-dossier-on-activists-who-opposed-facial-recogn...
139•cdrnsf•2h ago•28 comments

Plotnine

https://plotnine.org/
178•tosh•4d ago•57 comments

Open Source for IBM Z and LinuxONE

https://community.ibm.com/community/user/blogs/elizabeth-k-joseph1/2026/06/18/linuxone-open-sourc...
18•ncruces•3d ago•1 comments

Show HN: Treedocs: Documentation that automatically checks for staleness

https://dandylyons.github.io/treedocs/
13•DandyLyons•1h ago•7 comments

GLM-5.2 – How to Run Locally

https://unsloth.ai/docs/models/glm-5.2
521•TechTechTech•18h ago•248 comments

Lossless GIF recompression via exhaustive search

https://blog.arusekk.pl/posts/lossless-gif-recompression/
29•ZacnyLos•3h ago•3 comments

Will It Mythos?

https://swelljoe.com/post/will-it-mythos/
243•mindingnever•12h ago•180 comments

Crypto in 2026: Oh, This Is the Bad Place

https://www.stephendiehl.com/posts/bad_place_2026/
292•ibobev•6h ago•331 comments

VibeThinker: 3B param model that beats Opus 4.5 on reasoning with novel SFT+GRPO

https://arxiv.org/abs/2606.16140
309•timhigins•14h ago•165 comments

80386 Early Start Memory Access

https://nand2mario.github.io/posts/2026/80386_early_start/
17•nand2mario•3h ago•1 comments

Researchers used math to crack Wordle

https://www.binghamton.edu/news/story/6327/s-m-a-r-t-these-researchers-used-math-to-crack-wordle
20•hhs•2d ago•26 comments

Steam Machine launches today

https://store.steampowered.com/news/group/45479024/view/685257114654870245
1821•theschwa•23h ago•1555 comments

The Low-Tech AI of Elden Ring

https://nega.tv/posts/low-tech-ai-of-elden-ring.html
20•g0xA52A2A•4h ago•5 comments

Show HN: Neural Particle Automata

https://selforg-npa.github.io/
64•esychology•7h ago•14 comments

In praise of memcached

https://jchri.st/blog/in-praise-of-memcached/
236•j03b•15h ago•99 comments

The Traditional Vi

https://ex-vi.sourceforge.net/
50•exvi•7h ago•34 comments

Giant Banana Pulled Over: Driver Says Cops Have Stopped Him 100s of Times

https://cowboystatedaily.com/2026/06/18/giant-banana-pulled-over-in-montana-driver-says-cops-have...
167•speckx•2d ago•60 comments

Show HN: Shumai – open-source Frame.io alternative for creative work

https://github.com/shumaiOne/shumai
39•Yiling-J•6h ago•2 comments

Elevated error rate across multiple models

https://status.claude.com/incidents/jbhf20wjmzrf
163•rob•1h ago•186 comments

OpenAI DayBreak – GPT-5.5-Cyber

https://openai.com/index/daybreak-securing-the-world/
182•AaronO•14h ago•135 comments

8086 Segmented Memory was a good idea

https://owl.billpg.com/8086-segmented-memory-was-a-good-idea-almost/
54•billpg•2d ago•97 comments

My Mathematical Regression

https://blog.dahl.dev/posts/my-mathematical-regression/
342•aleda145•4d ago•134 comments