frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Focused input cuts LLM output tokens by 63% bench on CC with FastAPI

2•nicola_alessi•2h ago
I built an MCP server (vexp) that pre-indexes a codebase into a dependency graph and serves only relevant code to AI coding agents. While benchmarking it, I found something I wasn't looking for. The expected results were straightforward: less input context → lower cost, fewer tool calls → faster. But the output token reduction was the surprise.

Benchmark: 7 tasks on FastAPI (the OSS repo, ~800 Python files), 3 runs/task/arm, 42 total runs, Claude Sonnet 4.6, both arms in --strict-mcp-config isolation. Without graph: ~23 tool calls, ~40K input tokens, 504 output tokens, $0.78/task With graph: ~2.3 tool calls, ~8K input tokens, 189 output tokens, $0.33/task The 58% cost reduction and 22% speed improvement were expected. The 63% output token reduction was not. When Claude gets 40K tokens of context (most irrelevant), it generates a lot of "let me look at this file... I can see that..." narration while it orients itself. When it gets 8K tokens of pre-filtered, graph-ranked context, it skips straight to the answer. The exploration filler disappears. This seems like a general property of these models: noisy input → verbose output, focused input → focused output. I'd be curious if others have observed this in different contexts.

The approach: tree-sitter AST parsing → dependency graph in SQLite → single MCP tool (run_pipeline) that takes a task description, walks the graph, returns ranked context. Full source for high-centrality pivot nodes, compact skeletons for supporting code. Savings varied by task type — code understanding tasks saved the most (-64%), bug fixes the least (-30%). Makes sense: the more exploration a task normally requires, the more waste there is to cut.

Code: the graph resolution is handwritten Rust. The MCP transport, SQLite schema, and benchmark harness were built with Claude Code (felt appropriate). The benchmark analysis scripts were 100% Claude.

Free tier at https://vexp.dev — 2K nodes, 1 repo, no time limit. Runs locally (tree-sitter + SQLite, no cloud).

Show HN: I build a free topical authority map generator for blog

https://kitful.ai/write-tools/topical-map-generator
1•eashish93•30s ago•0 comments

Show HN: Headless Obsidian Sync Client

https://github.com/alexjbarnes/vault-sync
1•recouptreadmill•1m ago•0 comments

Show HN: VibeDiff – Blocks Claude Code from shipping breaking changes

https://github.com/SallahBoussettah/vibe-diff
1•Boussettah•1m ago•0 comments

Buckle Up for Bumpier Skies

https://www.newyorker.com/projects/interactive/2026/20260226-bilger-turbulence-header-prod/202602...
1•rbanffy•3m ago•0 comments

How To Put 30 Languages Into 1.1MB – hypher, a fast hyphenation library for Rust

https://laurmaedje.github.io/posts/hypher/
1•zdw•3m ago•0 comments

Prediction markets on Deutsche Bahn departure delays

https://bahn.bet
1•dancric•3m ago•0 comments

AI causing programmers to work longer hours fixing bugs

https://www.scientificamerican.com/article/why-developers-using-ai-are-working-longer-hours/
3•timoth3y•5m ago•1 comments

Show HN: A Free, interactive API course for product managers

https://api101.org/en
1•matb31240•6m ago•0 comments

Qwen 3.5: best open-weight vision models, now on live video at 200ms

https://blog.overshoot.ai/blog/qwen3.5-on-overshoot
1•YounElh•6m ago•0 comments

Voice Can Make Coding Agents Better (In Some Cases)

https://nimasadri11.github.io/random/voice-input-agents.html
2•nimasadri11•8m ago•0 comments

A Vindication of Bjorn Lomborg

https://humanprogress.org/a-vindication-of-bjorn-lomborg/
1•mpweiher•9m ago•0 comments

Study: LLMs Able to De-Anonymize User Accounts on Reddit, Hacker News

https://wjamesau.substack.com/p/warning-llms-able-to-de-anonymize
1•SLHamlet•9m ago•0 comments

A Soft-Landing Manual for the Second Gilded Age

https://www.joanwestenberg.com/a-soft-landing-manual-for-the-second-gilded-age/
1•spking•11m ago•0 comments

Claude Code skills for modern xOS (iOS, iPadOS, watchOS, tvOS) development

https://github.com/CharlesWiltgen/Axiom
1•rob•15m ago•0 comments

How Teens Use and View AI

https://www.pewresearch.org/internet/2026/02/24/how-teens-use-and-view-ai/
1•bookofjoe•16m ago•0 comments

Three scientists who said no to Epstein

https://www.science.org/content/article/meet-three-scientists-who-said-no-epstein
4•klipt•17m ago•0 comments

TrustLoop – Real-time policy enforcement and audit logging for AI agents

https://www.trustloop.live/
1•soji_mathew•19m ago•0 comments

Cybersecurity Forecast 2026 [pdf]

https://services.google.com/fh/files/misc/cybersecurity-forecast-2026-en.pdf
1•bookofjoe•23m ago•0 comments

Show HN: Interactive WordNet Visualizer-Explore Semantic Relations as a Graph

https://wordnet-vis.onrender.com/
1•ricky_risky•24m ago•0 comments

How to Manage Team Offsites Across Multiple Departments Without Micromanaging

https://daydreamsinruby.com/blog/2026-02-23-aligned-offsite-outcomes/
1•mooreds•25m ago•0 comments

Clud – super light-weight tool to turn natural language to terminal commands

https://github.com/oskob/clud
1•oskob•26m ago•2 comments

Log messages are mostly for the people operating your software

https://utcc.utoronto.ca/~cks/space/blog/programming/LogMessagesAreForOperation
1•todsacerdoti•27m ago•0 comments

A Race Within a Race: Exploiting CVE-2025-38617 in Linux Packet Sockets

https://blog.calif.io/p/a-race-within-a-race-exploiting-cve
3•WalterSobchak•27m ago•0 comments

So long, and thanks for all the logs

https://jerodsanto.net/2026/03/so-long-changelog/
2•mooreds•28m ago•0 comments

Computer Use Protocol – AI agents can perceive and interact with any desktop UI

https://github.com/computeruseprotocol/computeruseprotocol
3•k4cper-g•28m ago•0 comments

Why we love Vim (2021) [audio]

https://changelog.com/podcast/450
1•mooreds•29m ago•0 comments

Show HN: Limabean – a new implementation of Beancount in Clojure/Rust

https://github.com/tesujimath/limabean
1•tesujimath•29m ago•0 comments

Light-responsive porous aromatic frameworks manipulate CO2 uptake

https://www.pnas.org/doi/10.1073/pnas.2520024123
1•PaulHoule•30m ago•0 comments

Tech Legend Stewart Brand on Musk, Bezos and His Extraordinary Life

https://www.theguardian.com/technology/2026/feb/25/tech-legend-stewart-brand-on-musk-bezos-and-hi...
1•rmason•30m ago•0 comments

GoodSeed: A beautiful ML experiment tracker

https://goodseed.ai
1•gqsoqa•31m ago•0 comments