frontpage.

Smart code reading for humans and AI agents. Tilth is what happens when you give ripgrep, tree-sitter, and cat a shared brain. --

v0.4.4: Added adaptive 2nd-hop impact analysis to callers search — when a function has ≤10 unique callers, tilth automatically traces callers-of-callers in a single scan. First full 26-task Opus baseline (previously 5 hard tasks only). Haiku adoption improved from 42% to 78%, flipping Haiku from a cost regression to -38% $/correct.

v0.4.5: Bumped TOKEN_THRESHOLD from 3500 to 6000 estimated tokens (~24KB), so mid-sized files return full content instead of an outline that agents then read back via 5–7 sequential --section calls. Fixed two major regressions: gin_radix_tree (+35% → ~tie) and rg_search_dispatch (+90% → -26% win). Sonnet hit 100% accuracy (52/52) and -34% $/correct overall.

https://github.com/jahala/tilth/

Full results: https://github.com/jahala/tilth/blob/main/benchmark/README.m...

-- PS: I dont have the budget to run the benchmark a lot (especially with Opus), so if any token whales has capacity to run some benchmarks, please feel free to PR results.

Maester – The Knowledge Engine of Your Company

Show HN: Anti-regression setup Claude Code – subagents, hooks, and Claude.md

SpawnAgent: Real-time on-chain intelligence and wallet monitoring platform

Confirmation: A Canadian Grocery Store and the Failure of Privacy Law

BBC Journalist SEO-Hacks ChatGPT and Google's AI

Show HN: SeaRoutes, find the shortest navigable sea routes on the globe

The Rise of the Financial Engineer

Show HN: Next job comes from someone you barely know

The Predatory Hegemon

US Draft Rules for Power over Nvidia's Global Sales

A Guide to Wine Certification Programs

Iranian strikes on Amazon data centers highlight industry's vulnerability

The Download: The startup that says it can stop lightning, and inside OpenAI's

Building a Database on S3

The largest open-source humanized voice library

Congress Is Considering Abolishing Your Right to Be Anonymous Online

Olmo Hybrid

Show HN: RedDragon, LLM-assisted IR analysis of code across languages

Exfiltrating passwords with no interaction using autofill

Show HN: Plought – Reduce noise in decision making

The Brand Age

We Only Accept Pre-Revenue Projects

My application programmer instincts failed when debugging assembler

Launch HN: Vela (YC W26) – AI for complex scheduling

Which H100 Instance to Train Nanochat – Benchmarking PCIe, SXM, and NVL

Düren's Hydrogen Bet: The Math Behind a Looming Liability

Using Structured Light Scanning and Photogrammetry in Cultural Heritage

Financial AGI announced – outperforms human experts on 12 professional exams

Most AI agent demos won't survive enterprise security review

Show HN: Experiment- enforcing accessibility guardrails during AI UI generation

Show HN: Reduce LLM token use by ~30% with this MCP/CLI tool(Claude benchmarked)

Comments