frontpage.

Curious if anyone here has used Reducto for document parsing or retrieval pipelines.

They seem to focus on generating LLM-ready chunks using a mix of vision-language models and something they call “embedding-optimized” or intelligent chunking. The idea is that it preserves document layout and meaning (tables, figures, etc.) before generating embeddings for RAG or vector search systems.

I’m mostly wondering how this works in practice

- Does their “embedding-aware” chunking noticeably improve retrieval or reduce hallucinations?

- Did you still need to run additional preprocessing or custom chunking on top of it?

- How well does it play with downstream systems like Elasticsearch or Pinecone?

Basically trying to understand whether Reducto’s semantic chunking is a meaningful improvement over just doing traditional fixed-size or recursive splits.

Would appreciate hearing from anyone who’s tried it in production or at scale.

Hurricane Melissa poised to become catastrophic major hurricane, head to Jamica

The Weekend SSL Certificate Expiration Pattern

Belittled Magazine: Thirty years after the Sokal affair

Sam Altman's next startup eyes using sound waves to read your brain

Haiku 4.5 – you'd be amazed if you gave it a chance

LeafTok – Applied TikTok's Swipe UX to ePub/PDF Reading

I want to build the next Silicon Valley in northern Mexico, can you help?

Landonorris.com Stack Explained [video]

Leaving the Freedesktop.org Community

Stablecoin Use for Payments Jumps 70% Since US Regulation

Landonorris.com

How the 'cryptobro' mystique is taking over culture

AI agents require to-do lists to stay on track

Meta will ban rival AI chatbots from WhatsApp

The Father of French Journalism Who Documented Paris's Socialist Revolution

The A1200 – Full-Size. Full Keyboard. Full Nostalgia

Cloudflare blocked in Spain; Proton VPN signups surge 200%

Trader who made $190M shorting crash also apparently bet on CZ's pardon

How programs get run: ELF binaries

An Efficient Implementation of Self, a Dynamically-Typed Object-Oriented Langua [pdf]

The Weaponized Internet Theory

Any hardware tinkerers have suggestions for building this mechanism?

Nisus Writer: Schrödinger's Word Processor

Author, Director, Performer, Audience

An Update on TinyKVM

SynthID-Image: Invisibly Watermarking AI-Generated Imagery

AI vs. AI

Scopist

Unprecedented UK heatwave created extreme temperate wildfire risk

Computational Complexity (2023) [pdf]

Anyone used Reducto for parsing? How good is their embedding-aware chunking?