Show HN: RAG Architecture for optimizing retrieval volume/relevancy tradeoff

1•Gregoryy•3w ago

We propose a RAG architecture that uses hierarchical semantic chunking and graph-based context exclusion to maximize relevant information while minimizing the total volume of retrieved context.

The system recursively splits documents into a hierarchical tree structure and dynamically selects the most optimally-sized chunk from each branch by identifying and excluding redundant ancestors and descendants during the search process.

This approach ensures a higher relevant-to-total information ratio by retrieving diverse segments from across the document without including overlapping or nested chunks

Comments

v_CodeSentinal•3w ago

Interesting approach. How does this architecture handle the 'lost in the middle' phenomenon when the retrieval volume increases? I've found that simply increasing volume often degrades reasoning quality unless re-ranking is extremely aggressive.

Gregoryy•3w ago

we consider this a separate issue that should be addressed using special approaches. however, the proposed technique generally tends to decrease the probability of something being lost in the middle by minimizing the total size of the retrieved documents.

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

A Bid-Based NFT Advertising Grid

AI readability score for your documentation

NASA Study: Non-Biologic Processes Don't Explain Mars Organics

I inhaled traffic fumes to find out where air pollution goes in my body

X said it would give $1M to a user who had previously shared racist posts

155M US land parcel boundaries

Private Inference

Font Rendering from First Principles

Show HN: Seedance 2.0 AI video generator for creators and ecommerce

Wally: A fun, reliable voice assistant in the shape of a penguin

Rewriting Pycparser with the Help of an LLM

Lobsters Vibecoding Challenge

E-Commerce vs. Social Commerce

Avoiding Modern C++ – Anton Mikhailov [video]

Show HN: AegisMind–AI system with 12 brain regions modeled on human neuroscience

Zig – Package Management Workflow Enhancements

AI-powered text correction for macOS

AppSecMaster – Learn Application Security with hands on challenges

Fibonacci Number Certificates

AI Overviews are killing the web search, and there's nothing we can do about it

City skylines need an upgrade in the face of climate stress

1979: The Model World of Robert Symes [video]

Satellites Have a Lot of Room

1980s Farm Crisis

Show HN: FSID - Identifier for files and directories (like ISBN for Books)

Show HN: Holy Grail: Open-Source Autonomous Development Agent

Show HN: Minecraft Creeper meets 90s Tamagotchi

Show HN: Termiteam – Control center for multiple AI agent terminals

The only U.S. particle collider shuts down