frontpage.

Show HN: Callmux – MCP multiplexer that cuts tool call context pollution by ~19x

https://github.com/edimuj/callmux

2•edimuj•1h ago

Every tool call an AI agent makes adds tokens to the conversation context. Not just the payload data, but the JSON wrappers, the role markers, and worst of all, the model's intermediate reasoning between calls ("Now I'll fetch the next one..."). These compound: each subsequent call re-processes everything before it, so total input tokens grow quadratically with sequential calls.

I built callmux to fix this. It's an MCP proxy that sits between your agent (Claude, Codex, etc.) and any MCP server, adding parallel execution, batching, pipelining, and caching as meta-tools. Instead of 7 sequential get_issue calls, the agent makes 1 callmux_parallel call. The actual data transferred is identical. What you eliminate is the per-call overhead.

The math surprised me. For a batch of 7 operations:

  Without callmux: ~525 tokens of structural overhead + ~900 tokens of intermediate reasoning = ~1,425 tokens of pollution
  With callmux: ~75 tokens total
  That's ~19:1 less context pollution from a 7:1 reduction in tool calls

Prompt caching helps with the cost side of re-reading previous turns, but it doesn't shrink your context window. Every intermediate reasoning turn still sits there taking up space, and compaction still triggers at the same threshold.

In practice, callmux reduces tool calls to about 15% of the original count. But the context savings are larger than that ratio suggests, because you're also eliminating the intermediate reasoning between those calls, which is the biggest source of pollution.

The result: sessions last longer before hitting context limits, and the context window has less noise competing with your actual conversation.

I wrote up the full context math with diagrams here: https://longgamedev.substack.com/p/your-ai-agent-is-re-readi...

Setup is one line:

  npx -y callmux -- npx -y @modelcontextprotocol/server-github

Works with Claude Code, Codex, Claude Desktop. Also supports multi-server mode, remote HTTP/SSE servers, and tool filtering.

npm: https://www.npmjs.com/package/callmux

Ok, What actually uses Rust?

Let's enable MFA for all Ruby gems

Open-Source contributions do not help

New study compares growing corn for energy to solar production. It's no contest

Show HN: macOS VMs to let you agents run wild

You Need MLOps: When CI/CD for Machine Learning Becomes Mandatory

Show HN: Ghost Pepper Meet local meeting transcription and diarization

Freelancers Not Delivering

500 AI prompts scored across 8 quality dimensions. None passed

Twenty: The open alternative to Salesforce, designed for AI

Brick Farm Simplifies X API

The invisible engineering behind Lambda's network

It's None of Your Business

Who Gets to Stay Human – The AI hype, stripped of the hype

Hindsight Reaches 10k GitHub Stars: The Community's Choice for Agent Memory

Crypto Billionaire Justin Sun Accuses World Liberty of 'Criminal Extortion'

Web Stalker – an artist-made browser that ignored images and formatting (1998)

Show HN: Legal Action Boundary Eval for agentic legal workflows

The Macroeconomic Effects of Tariffs: Insights from 180 Years of Trade Policy

Javokhir Sindarov's secret weapon – his coach IM Roman Vidonyak

Former Employee Sues MrBeast's Company over Harassment Claims

The Praxian Genocidal Kill Chain – Part 1

The power keeping wages low: Planet Money

Apple's New CEO Has a Background in VR, but Is Reportedly Bearish on Vision Pro

Ask HN: How do people use coding agents?

Claude Code /ultrareview

Chrome tab bar organizer for localhost

Allbirds goes soleless and pivots to AI

Amtrak Trains

Building AI-First at Intercom (With Claude Code and Rails) [video]