frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Anyone used Reducto for parsing? How good is their embedding-aware chunking?

1•Bahushruth•11h ago
Curious if anyone here has used Reducto for document parsing or retrieval pipelines.

They seem to focus on generating LLM-ready chunks using a mix of vision-language models and something they call “embedding-optimized” or intelligent chunking. The idea is that it preserves document layout and meaning (tables, figures, etc.) before generating embeddings for RAG or vector search systems.

I’m mostly wondering how this works in practice

- Does their “embedding-aware” chunking noticeably improve retrieval or reduce hallucinations?

- Did you still need to run additional preprocessing or custom chunking on top of it?

- How well does it play with downstream systems like Elasticsearch or Pinecone?

Basically trying to understand whether Reducto’s semantic chunking is a meaningful improvement over just doing traditional fixed-size or recursive splits.

Would appreciate hearing from anyone who’s tried it in production or at scale.

Hurricane Melissa poised to become catastrophic major hurricane, head to Jamica

https://yaleclimateconnections.org/2025/10/hurricane-melissa-poised-to-rapidly-intensify-as-it-he...
1•WarOnPrivacy•41s ago•0 comments

The Weekend SSL Certificate Expiration Pattern

https://www.haveibeenexpired.com/blog/weekend-certificate-expiration-pattern
1•adrukh•11m ago•1 comments

Belittled Magazine: Thirty years after the Sokal affair

https://thebaffler.com/salvos/belittled-magazine-robbins
1•Hooke•12m ago•0 comments

Sam Altman's next startup eyes using sound waves to read your brain

https://www.theverge.com/column/806666/sam-altman-merge-labs-brain-computer-interface-startup-hire
1•pedalpete•15m ago•1 comments

Haiku 4.5 – you'd be amazed if you gave it a chance

https://barazany.dev/blog/haiku-45-the-model-nobody-expected-to-care-about
2•barazany•22m ago•0 comments

LeafTok – Applied TikTok's Swipe UX to ePub/PDF Reading

https://leaftok.github.io/site/
2•iago-cavalcante•22m ago•1 comments

I want to build the next Silicon Valley in northern Mexico, can you help?

2•inodeman•24m ago•0 comments

Landonorris.com Stack Explained [video]

https://www.youtube.com/watch?v=HzL65tTeANs
2•ls-a•24m ago•0 comments

Leaving the Freedesktop.org Community

https://vt.social/@lina/115431232807081648
3•birdculture•26m ago•0 comments

Stablecoin Use for Payments Jumps 70% Since US Regulation

https://www.bloomberg.com/news/articles/2025-10-25/stablecoin-use-for-payments-jumps-70-since-us-...
2•gametorch•28m ago•0 comments

Landonorris.com

https://landonorris.com/
2•ls-a•29m ago•0 comments

How the 'cryptobro' mystique is taking over culture

https://english.elpais.com/lifestyle/2025-10-24/i-need-10k-a-month-to-live-well-how-the-cryptobro...
2•geox•30m ago•0 comments

AI agents require to-do lists to stay on track

https://blog.justcopy.ai/p/why-your-ai-agents-need-a-todo-list
6•anup_sia•35m ago•1 comments

Meta will ban rival AI chatbots from WhatsApp

https://www.techradar.com/ai-platforms-assistants/meta-will-ban-rival-ai-chatbots-from-whatsapp
1•JumpCrisscross•36m ago•0 comments

The Father of French Journalism Who Documented Paris's Socialist Revolution

https://hyperallergic.com/649093/bruno-braquehais-paris-commune-photos/
1•yubblegum•39m ago•0 comments

The A1200 – Full-Size. Full Keyboard. Full Nostalgia

https://www.indieretronews.com/2025/10/the-a1200-full-size-full-keyboard-full.html
1•ibobev•43m ago•0 comments

Cloudflare blocked in Spain; Proton VPN signups surge 200%

https://protonvpn.com/blog/spain-cloudflare-block
2•akyuu•46m ago•0 comments

Trader who made $190M shorting crash also apparently bet on CZ's pardon

https://cointelegraph.com/news/donald-trump-us-cz-binance-founder-pardon-crypto-trader-profit
3•amrrs•46m ago•0 comments

How programs get run: ELF binaries

https://lwn.net/Articles/631631/
10•st_goliath•46m ago•0 comments

An Efficient Implementation of Self, a Dynamically-Typed Object-Oriented Langua [pdf]

https://courses.cs.washington.edu/courses/cse501/15sp/papers/chambers.pdf
8•todsacerdoti•49m ago•2 comments

The Weaponized Internet Theory

https://saviradev.substack.com/p/the-weaponized-internet-theory
2•colinlevine•49m ago•2 comments

Any hardware tinkerers have suggestions for building this mechanism?

https://opus.cafe/goal/iammaker/0
1•eastoeast•49m ago•1 comments

Nisus Writer: Schrödinger's Word Processor

https://tidbits.com/2025/10/25/nisus-writer-schrodingers-word-processor/
1•zdw•53m ago•0 comments

Author, Director, Performer, Audience

https://bumbershootsoft.wordpress.com/2025/10/25/author-director-performer-audience/
1•ibobev•58m ago•0 comments

An Update on TinyKVM

https://fwsgonzo.medium.com/an-update-on-tinykvm-7a38518e57e9
8•ingve•59m ago•0 comments

SynthID-Image: Invisibly Watermarking AI-Generated Imagery

https://arxiv.org/abs/2510.09263
1•theanonymousone•59m ago•0 comments

AI vs. AI

https://www.newcartographies.com/p/ai-vs-ai
1•FromTheArchives•1h ago•0 comments

Scopist

https://en.wikipedia.org/wiki/Scopist
1•benbreen•1h ago•0 comments

Unprecedented UK heatwave created extreme temperate wildfire risk

https://www.nature.com/articles/s43247-025-02746-8
4•PaulHoule•1h ago•0 comments

Computational Complexity (2023) [pdf]

https://samuelhautamaki.eu/Computational_Complexity.pdf
1•todsacerdoti•1h ago•0 comments