frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Mistral OCR 4

https://mistral.ai/news/ocr-4/
99•meetpateltech•1h ago

Comments

Ducki•41m ago
I was processing 55 year old paper files, most of them severely degraded, with its predecessor model. I was very impressed! I also tried Abbyy Finereader but it didn't even come close in my experience.
philipkglass•9m ago
I used Abbyy Finereader for several years, for some large projects. I loved it. Modern VLMs put FineReader to shame for processing low-resolution/degraded/non-standard text.

I'm personally using the small Qwen 3.5 models. If you have an OCR problem, Mistral OCR 4 is probably great. Open weights models that you can run a laptop may also work great.

jppope•39m ago
Is there something wrong with their certificate? Chromium is saying https isn't valid
collabs•27m ago
Looks good to me on both brave (on android) and firefox (on windows 11). Lets see what ssl labs says (it is running now)

https://www.ssllabs.com/ssltest/analyze.html?d=mistral.ai&la...

Looks good so far, A+ on ipv4 as well as ipv6

Edit: I also asked Gemini 3.1 Pro to analyze the certificate and it looks good

It looks like you have shared an `about:certificate` URL containing a chain of three Base64-encoded X.509 TLS/SSL certificates. This specific chain is used to secure connections to *mistral.ai*.

Here is the decoded breakdown of the certificate chain you provided:

## Certificate Chain Overview

This is a standard three-tier certificate chain issued by Google Trust Services for the Mistral AI domain.

---

### 1. Leaf Certificate (End-Entity)

This is the specific certificate issued to the website to verify its identity and encrypt traffic.

* *Subject (Common Name):* `mistral.ai` * *Subject Alternative Names (SANs):* `mistral.ai`, `workers.mistral.ai` * *Issuer:* WE1 (Google Trust Services) * *Valid From:* June 13, 2026 * *Valid To:* September 11, 2026 * *Key Type:* Elliptic Curve (ECDSA)

### 2. Intermediate Certificate

This certificate acts as a bridge between the website's certificate and the trusted Root CA.

* *Subject:* WE1 (Google Trust Services) * *Issuer:* GTS Root R4 (Google Trust Services LLC) * *Valid From:* December 13, 2023 * *Valid To:* February 20, 2029 * *Key Type:* Elliptic Curve (ECDSA)

### 3. Root Certificate

This is the foundational trust anchor pre-installed in browsers and operating systems.

* *Subject:* GTS Root R4 (Google Trust Services LLC) * *Issuer:* GTS Root R4 (Self-signed) * *Valid From:* June 22, 2016 * *Valid To:* June 22, 2036 * *Key Type:* Elliptic Curve (ECDSA)

mdrzn•37m ago
It'll be interesting to see how this ranks against https://github.com/baidu/Unlimited-OCR
cdnsteve•13m ago
Right, just announced https://x.com/BaiduAI_News/status/2069322806748410291
ge96•25m ago
1000 pages for $4? damn how does it compare to llama parse I wonder
thenthenthen•14m ago
Or Apples local OCR/Vision models?
tdubey•21m ago
Are there benchmarks for how this performs on charts, or maybe more accurately, plots? I've yet to find a model that can digitize a plot into X,Y points with some accuracy in my use case of digitizing old datasheets.
utopiah•18m ago
"A note on out-of-scope use. OCR 4 is a document-understanding model, not a decision-maker. It is not intended for medical diagnosis, legal advice or judgment, high-stakes financial decisions, safety-critical systems, real-time/latency-sensitive processing, or non-document inputs (raw audio, video, etc.). "

Can't wait for the "oh so innovative" manager who will suggest during the next meeting "Ok... but what if WE used it for high-stakes financial decisions on non-document inputs like a photo from my phone?"

I guarantee you somebody on HN is going to comment about this "idea" next week.

weird-eye-issue•5m ago
Why would anybody do that you would simply get terrible results compared to dozens of other more capable models. It's for converting to text not answering questions. Just seems like you need some sort of weird angle to bring out an anti AI stance
gpm•11m ago
Do these models (this one or its competitors) do handwriting recognition?
weird-eye-issue•4m ago
If you mean handwriting to text then yes
gpm•3m ago
Yep that's what I mean, thanks :)
9cb14c1ec0•3m ago
Yes, we have successfully used Mistral OCR for digitizing handwritten forms. You always have low percentage that need human review and adjustment, but overall Mistral has been highly accurate (their price is amazing, too).
Insanity•8m ago
Recently I tied OCR with Opus 4.8. (I know, not technically right tool for the job). All I needed to do was extract dates from receipts. It got about 20% of the dates wrong yet rated all as “high confidence”.

Should have probably tried a more OCR specific model

nik736•7m ago
Opus is very good at OCR. Way better than the small 1-4B VLMs. If Opus failed, most likely those smaller models will fail as well.
bpodgursky•6m ago
I do not believe this story.

Opus 4.8 scanned hundreds of PDFs for me recently with the worst handwriting imaginable. 100% successful, other than one record where even I could not figure out what was written.

stri8ted•7m ago
Way too expensive. Google vision OCR (which they failed to compare against), is $1.50 per 1k pages. Vs $4 from Mistral.
pmxi•3m ago
This has been a niche where Mistral has actually been successful. Btw, Hindi and Japanese are bucketed in "Rare Languages," which is odd.

Show HN: The Cascade Graph – An interactive map of AI and energy constraints

https://atomprophet.io/tools/cascade/
1•antisyzygy•38s ago•0 comments

Show HN: Agent skills that review user-facing agent UX from your codebase

https://github.com/Correl8AI/skills
1•romz•1m ago•0 comments

Likely Math Behind Subquadratic

https://github.com/jonsmirl/ssa
1•jonsmirl•2m ago•0 comments

We started gathering product feedback from visiting agents

https://www.sanity.io/blog/how-to-get-product-feedback-from-agents
1•jarodreyes•3m ago•0 comments

Generate vehicle bill of sale documents online

https://www.billandsale.com
1•xiyan•3m ago•1 comments

Sunsetting a Package Manager

https://nesbitt.io/2026/06/23/sunsetting-a-package-manager.html
1•chmaynard•4m ago•0 comments

Why marketing AI hallucinates: how can we ground it in platform-native data

https://www.fuse.is/blog/how-fuse-talks-to-tiktok-ads
1•rkovashikawa•4m ago•0 comments

Two Indexed Hash Tables

https://vnmakarov.github.io/data%20structures/c/c++/open-source/2026/06/23/two-indexed-hash-table...
1•theanonymousone•5m ago•0 comments

Blaiso Launches Today

https://www.blaiso.com/
1•msass•7m ago•1 comments

Why American data centers can't plug in

https://worksinprogress.co/issue/why-american-data-centers-cant-plug-in/
1•surprisetalk•7m ago•1 comments

Squadron 42 Release Announcement in October?

https://nosygamer.blogspot.com/2026/06/squadron-42-release-announcement-in.html
1•speckx•9m ago•0 comments

Takes more than an hour to deploy code to test?

1•GamingAtWork•9m ago•0 comments

Sparky – pocket size CI server and workflow manager

https://github.com/melezhik/sparky
1•melezhik•10m ago•0 comments

Stairwell In C# with Ultracontrapipe in A (Lydian) [video]

https://www.youtube.com/watch?v=23G5QDWqDUY
1•starkparker•10m ago•0 comments

Show HN: Project Cherub – A Forked TempleOS. Early Build and Future Plans

1•Rubinoslaw•10m ago•2 comments

Show HN: Publish.my – Static hosting where the AI agent is the customer

https://publish.my/
1•aizuikmal•11m ago•0 comments

AI's Affordability Crisis

https://blog.dshr.org/2026/06/ais-affordability-crisis.html
2•ilreb•12m ago•0 comments

Ask HN: How often do you use "Answer now" when using LLMs?

1•dwa3592•12m ago•1 comments

Show HN: An open MCP that gives coding agents page-cited embedded datasheets

https://github.com/ByteAsk/ByteAsk-Embedded-MCP
1•anirudhak47•12m ago•0 comments

Show HN: Nimic – Pure Python as a systems language with AOT compilation

https://github.com/dima-quant/nimic
2•dima-quant•14m ago•1 comments

New Meta AI Glasses

https://www.meta.com/gb/ai-glasses/
3•trollied•14m ago•0 comments

Trump vs. Anthropic: The AI wars are heating up

https://www.computerworld.com/article/4187893/trump-vs-anthropic-the-ai-wars-are-heating-up.html
2•CrankyBear•14m ago•0 comments

Show HN: Treedocs: Documentation that automatically checks for staleness

https://dandylyons.github.io/treedocs/
2•DandyLyons•15m ago•0 comments

Vulnerability Reports Are Not Special Anymore

https://words.filippo.io/vuln-reports/
2•mooreds•16m ago•0 comments

Show HN: Superbounce64, a MIPS R4300i/Nintendo 64 Homebrew Project

https://github.com/Cyd0n1a/Superbounce64
1•Cyd0n1a•16m ago•0 comments

Cursor acquires Continue, an open-source alternative to GitHub Copilot

https://thenewstack.io/cursor-acquires-continue-coding/
1•Brajeshwar•17m ago•0 comments

Agent skills for interface builders – by Tailwind devs

https://ui.sh
2•tempaccount420•19m ago•0 comments

So you want to build your own frontier model

https://www.givedirection.com/blog.html#build-your-own-frontier-model
2•AndrewKemendo•19m ago•0 comments

The World-Building Doors Are Open, Again

https://twitter.com/joshelman/status/2069420268116967797
1•tosh•21m ago•0 comments

Show HN: Videopython – local-first video processing, editing and AI workflows

https://github.com/bartwojtowicz/videopython
3•randomstate•22m ago•0 comments