frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I benchmarked GitHub CLI, MCP, Tool Search, Code Mode so we know the differences

https://medium.com/@kunchenguid/i-benchmarked-github-cli-vs-mcp-vs-tool-search-vs-code-mode-turns-out-the-best-solution-is-none-93528d5039e4
2•akane8•1h ago

Comments

akane8•1h ago
MCP or CLI? There's a massive debate happening in the AI community around what's the best way to expose tools to agents. For curiosity, I spend the weekend doing an independent study and benchmarked Github CLI, MCP, MCP with Tool Search and MCP with Code Mode, all with real data and practical tasks.

Numbers are quite interesting. A few key findings -

1. Github MCP is 2–3x more expensive to use than CLI. Probably not surprising. There’s almost no practical reason to use their MCP except for some of the different handling of security

2. Tool Search saves upfront tokens but spends them on extra turns. Whether that trade-off pays depends on task complexity. ToolSearch also introduces a new failure mode due to imperfect search accuracy.

3. Code Mode is the cheapest way to use MCP, but still 2x more expensive than CLI, and it’s very slow. Code-mode also introduces a unique failure mode when the agent writes buggy code or poor error handling.

4. It's possible to push CLIs further towards higher success rate at lowest cost and latency with a principled design approach that treats agent ergonomics as first class concern. I detailed this in https://axi.md.

Benchmark harness, results and the reference implementation of gh-axi are all open sourced at https://github.com/kunchenguid/axi.

hightechmind•51m ago
The concurrency finding is the most striking part — MCP forcing sequential API calls because of schema retransmission between calls is a structural penalty that compounds exactly on the multi-step tasks where you'd most want MCP's composition benefits. The ci_failure_investigation example makes it concrete: 15 turns vs 3, 12× cost difference.

Have you looked at whether client-side schema caching could recover that? If the agent doesn't re-transmit tool definitions on every turn, the sequential vs parallel gap should narrow significantly.

akane8•42m ago
Technically MCP tools can be called in parallel as well, but it seems the agent is generally less likely to do so.

In the evaluation I did account for prompt caching, so the multi-turn penalty is already minimized, yet it was still significant enough to make the difference.

AI boom risks widening wealth divide, says BlackRock's Larry Fink

https://www.theguardian.com/technology/2026/mar/23/ai-boom-risks-widening-wealth-divide-blackrock...
1•devonnull•29s ago•0 comments

Dusking is a trend aimed at helping people switch off at the end of the day

https://theconversation.com/dusking-is-a-trend-aimed-at-helping-people-switch-off-at-the-end-of-t...
1•zeristor•1m ago•0 comments

SynthVision: Building a 110K Synthetic Medical VQA Dataset

https://huggingface.co/blog/OpenMed/synthvision
1•maziyar•3m ago•1 comments

Gabbard plans to shift coveted, CIA-backed high-tech fund In-Q-Tel to her office

https://www.politico.com/news/2026/03/23/in-q-tel-odni-cia-control-00840302
2•avidruntime•3m ago•0 comments

Philosophical DNA

https://diagnostic.millermanschool.com/
1•iamjfu•3m ago•0 comments

Where Should the Agent(s) Live?

https://opencomputer.dev/blog/where-should-the-agent-live
2•iacguy•4m ago•0 comments

Why LLMs can't paragraph well

https://hollisrobbinsanecdotal.substack.com/p/for-the-love-of-god-learn-to-paragraph
2•HR01•5m ago•0 comments

PyTorch 2.11 Released

https://pytorch.org/blog/pytorch-2-11-release-blog/
1•0bytematt•6m ago•0 comments

Minutes before Trump's announcement, $800M in trades made on oil prices

https://www.9news.com.au/world/donald-trump-iran-updates-oil-futures-trade-suspicious-betting-act...
5•inaros•7m ago•0 comments

AI Trained on Birdsong Can Recognize Whale Calls

https://spectrum.ieee.org/foundation-models-google-birds-whales
1•geox•9m ago•0 comments

Leonid Radvinsky, owner of OnlyFans, dies aged 43

https://www.theguardian.com/technology/2026/mar/23/leonid-radvinsky-onlyfans-owner-death
1•chirau•10m ago•0 comments

Absolute Beginner's Guide to Databasemaxxing

https://pthorpe92.dev/databasemaxxing/
1•dvektor•11m ago•0 comments

China Just Killed the B-Pillar Zeekr Mix 2026 [video]

https://www.youtube.com/watch?v=hGV-EUR2GYQ
1•thelastgallon•12m ago•0 comments

Show HN: A CLI for building and deploying Openclaw agents

https://pinata.cloud/blog/from-docker-dread-to-agentic-flow-introducing-the-pinata-cli/
1•madrov•14m ago•0 comments

You can now enable Claude to use your macOS computer to complete tasks

https://xcancel.com/claudeai/status/2036195789601374705
1•doener•16m ago•0 comments

Show HN: VoidLLM – privacy-first LLM proxy (Go, self-hosted)

https://github.com/voidmind-io/voidllm
1•chrisremo85•17m ago•0 comments

Show HN: Mutatr – an open source A/B testing agent

https://github.com/novynlabs-repo/mutatr
1•AhmedAshraf•21m ago•0 comments

Show HN: Nomad – Self-hosted collaborative travel planner

https://github.com/mauriceboe/NOMAD
1•mauriceboe•22m ago•0 comments

Pre-written OpenClaw agent config packs (SOUL.md, HEARTBEAT.md, AGENTS.md)

https://5580846822819.gumroad.com/l/svlapl
1•nami_creator•24m ago•0 comments

I reverse-engineered Claude Code

https://github.com/SeifBenayed/claude-code-sdk
1•seifbenayed1992•27m ago•0 comments

Dear Europe: Germany has shown the way forward

https://blog.documentfoundation.org/blog/2026/03/23/dear-europe/
2•doener•30m ago•0 comments

Pentagon Adopts New Limits for Journalists After Court Loss

https://www.nytimes.com/2026/03/23/business/media/pentagon-closes-journalists-work-area.html
5•doener•30m ago•0 comments

RocksDB development finds a CPU bug

https://rocksdb.org/blog/2026/02/17/cpu-bug.html
2•eatonphil•33m ago•1 comments

Classification: Accuracy, recall, precision, and related metrics

https://developers.google.com/machine-learning/crash-course/classification/accuracy-precision-recall
1•teleforce•33m ago•0 comments

Personal and Business Tax Returns via MCP

1•unmarkdown•35m ago•0 comments

Show HN: AI Morning Briefing – weather, stocks, news, and a quote emailed daily

https://ia2ai.ai/beta
2•ia2ai_official•35m ago•1 comments

XMMS Resuscitated: XMMS ported to gtk4 with gstreamer/pipewire

https://gitlab.com/cschalle/xmms-resuscitated
1•marcodiego•36m ago•0 comments

SettleTheVibe – Let strangers vote on who's right in your real-life disputes

https://settlethevibe.com
1•misterinfo•37m ago•0 comments

Big Nuclear's Big Mistake – Linear No-Threshold - Kyle Hill [video]

https://www.youtube.com/watch?v=gzdLdNRaPKc
1•zeristor•38m ago•1 comments

Study: Workers who fall for 'corporate bullshit' may be worse at their jobs

https://www.theguardian.com/business/2026/mar/23/corporate-speak-study
1•DamnInteresting•39m ago•0 comments