frontpage.

I keep hitting the same failure mode with agentic RAG over collections of similar PDFs, like monthly electricity and gas bills from the same utility provider.

It works well for retrieval: “Find my gas bill from January.”

Though even there similarity can be brittle. If I don’t specify the year, retrieval may surface the wrong January because multiple documents look nearly identical.

It really breaks down for aggregation: “How much did I spend on electricity and gas last year?” “Which months had the highest energy costs?”

At that point the problem feels misaligned with similarity search itself. You don’t want relevant chunks, you want structured values aggregated across documents.

Curious how people solve this. SQL tools? Structured extraction? Different agent patterns?

Show HN: Unusual Wikipedia

Show HN: Fenster – Run Chromes Local Gemini Nano as a CLI

Maker Camp: Shenzhen

A breakthrough in C/C++ dependency management

Learning to Orchestrate Agents in Natural Language with the Conductor

When LLMs Get Personal

Self-hosting isn't scary: a practical guide with Coolify and Hetzner

U.S. Department of State on Flickr

Does reading do us any good?

How to Get Traction or First Clients?

70x faster cold(ish) starts for SGLang

Durable, durable, durable: the AI infrastructure category is forming

EFF Challenges Secrecy in Eastern District of Texas Patent Case

Stop California's Paternalistic and Privacy-Destroying Social Media Ban

Shareholder primacy undermined its own logic

From Ms to 26 Ns: How a $20 eBay SFP Module Beat My NTP Setup

Agentic AI made DevOps and Agile obsolete

Will I ever retire? It doesn't look like it

Show HN: Slatewave – a single color palette across terminals, editors, and apps

I did no work for a year and no one noticed

Human biology is ill-adapted to modern cities

Tell HN: GitHub PRs disappearing but only from search

"Parse, don't validate" through the years with C++

Google's A2A Protocol: How AI Agents Will Talk to Each Other

The Signal is Broken

Three reasons why DeepSeek’s new model matters

Show HN: Terminal UI for managing SSH servers (users admin, file transfers)

Show HN: 2 weeks of coding, 3 months of OpenAI review, my ChatGPT App is live

Show HN: Vibe-coding video games with Claude (Day 14: Tetris)

GitHub Copilot is moving to usage-based billing

Ask HN: How do you solve aggregation when agentic RAG breaks down?