frontpage.

Flat chunking throws away document structure. A PDF isn’t a bag of paragraphs. It has sections, subsections, and a hierarchy that carries meaning. An agent that can’t navigate that structure can’t do serious research.

I ran into this building RAG over scientific literature. The standard approach (embed chunks, find top-k, generate) works fine for simple Q&A but falls apart when you need real research depth: multi-hop reasoning across papers, synthesizing conflicting results, tracing a finding back to the exact passage in a methods section. The problem wasn’t the models.

Dewey treats documents, sections, and chunks as first-class API primitives. The section manifest (full heading hierarchy with titles and byte offsets) lets agents scan cheaply before committing to full chunk retrieval, the same way a researcher skims a table of contents before reading. The /research endpoint runs an agentic loop; at “exhaustive” depth it can traverse an entire corpus, iteratively query, and return a grounded answer with numbered inline citations pointing to the exact source passage.

Two ways in:

- REST API + TypeScript/Python SDKs for developers building research or document Q&A into their apps - MCP server (@meetdewey/mcp on npm) for anyone using Claude, ChatGPT, or Cursor. Your document collections become tools without writing any code.

Bring your own OpenAI key and depth becomes a quality setting rather than a billing one. That includes AI image captioning, which makes figures and diagrams searchable alongside your text. No markup on generation.

Built this solo. Happy to answer questions about the architecture, the retrieval design, or anything else. Curious whether others have found section-aware retrieval makes a meaningful difference vs. flat chunking in practice.

Free tier, no credit card required: https://meetdewey.com

Law Firms Prefer Cubicles to Cubicle Dwellers

Encoding Team Standards

Show HN: ModelAtlas – Find AI models that HuggingFace search can't

Noyb win: Microsoft ordered to stop tracking school children

Kevin Rose Back at Digg

Jami – free/libre, end-to-end encrypted, and private communication software

Show HN: I adapted codex-plugin-cc's design for Gemini CLI's ACP

Major Claude Code source leak offers deep insight into how Anthropic tool works

Why Inventing Color TV Was So Difficult [video]

After 16 years and $8B, military new GPS software still doesn't work

Employers Beware: Uptick in BIPA Lawsuits Targeting AI Note-Taking Software

Textstring

Show HN: Wozcode – double Claude Code output

Show HN: Spotlytt, A platform to create video resumes

Jailbroken, open-source, pre-built iOS 26 virtual machine

A survey of how companies are buying AI professional services

SwarmSync– We built the transaction layer for autonomous AI Commerce

Visual Pipeline Builder

Claude Code Buddy Creator

Google attributes Axios hack to North Korea

Ask HN: I burnt out from software development. What now?

GitHub's Historic Uptime

VinFast going all-in on electric scooters with battery swap rollout in Vietnam

What about juniors?

LLMs encourages delusional thinking in patients, study finds

Two more Liquid Glass fixes in macOS 26.4

Do LLMs Break the Sapir-Whorf Hypothesis?

When an ALB Can Replace Nginx (and When It Can't)

Ask HN: Is there any founder building non AI startup in 2026?

GitHub Will Train Copilot Models with User Data by Default

Show HN: Dewey – Ingest docs, search semantically, get cited AI answers