frontpage.

Excited to share Nanonets-OCR-s, a powerful and lightweight (3B) VLM model that converts documents into clean, structured Markdown. This model is trained to understand document structure and content context (like tables, equations, images, plots, watermarks, checkboxes, etc.).

Key Features:

LaTeX Equation Recognition Converts inline and block-level math into properly formatted LaTeX, distinguishing between $...$ and $$...$$.

Image Descriptions for LLMs Describes embedded images using structured <img> tags. Handles logos, charts, plots, and so on.

Signature Detection & Isolation Finds and tags signatures in scanned documents, outputting them in <signature> blocks.

Watermark Extraction Extracts watermark text and stores it within <watermark> tag for traceability.

Smart Checkbox & Radio Button Handling Converts checkboxes to Unicode symbols like , , and for reliable parsing in downstream apps.

Complex Table Extraction Handles multi-row/column tables, preserving structure and outputting both Markdown and HTML formats.

Huggingface / GitHub / Try it out: https://huggingface.co/nanonets/Nanonets-OCR-s

Try it with Docext in Colab: https://github.com/NanoNets/docext/blob/main/PDF2MD_README.m...

Tailscale Founder Talks Future IPO as Revenue Surges on AI Adoption

On the Usability of Editable Software

Plotform – Product Hunt but for your book launches

$100 Hamburger

Pip's Quake

Exploring the Dangers of AI in Mental Health Care

Review: 'Print the Legend' gives form to 3-D printer companies' history (2014)

SIMD-friendly algorithms for substring searching

After 18 Years of Infertility, an AI Tool Let a Couple Conceive

Building a WordPress MCP Server for Claude: Automating Blog Posts with AI

Chebfun: Open-source package for computing with functions to 15-digit accuracy

Let's Play some Glider 4.0 with John Calhoun [video]

Show HN: I created an AI form builder and it's free

Experimental Spacetime Distortion: Generating Gravitational Waves in the Lab

UK unis to cough up to £10M on Java to keep Oracle off their backs

The secret fast track for animal drugs

Comment on the Illusion of Thinking

Premium accounts to fund the matrix.org homeserver

Anne Wojcicki to buy back 23andMe and its data for $305M

DHS is using CBP Home Mobile App to incentivize the voluntary self-deportation

Filedb: Disk Based Key-Value Store Inspired by Bitcask

Venusian pancake dome likely formed due to elastic lithosphere and dense lava

Culinary Ocean that Separates the US and Europe: innards (1993)

Why do French men pee on the street [video]

Baking the Y Combinator from scratch (again)

ArkFlow and Python: Easy Real-Time AI

Regenerate Your Land

Show HN: Hack to Save Any Videos from YouTube

The Tech Job Meltdown

Fastmigrate: Database Migrations for SQLite

Show HN: Open-source 3B param model better than Mistral OCR