frontpage.

I built MCPlexor to solve a token waste problem I kept running into with MCP-based agents.

The Problem: MCP (Model Context Protocol) is great for giving LLMs access to external tools. But if you connect multiple servers (GitHub, Linear, Postgres, Slack), you end up with 40-50k tokens of tool definitions injected into every request – before the agent even does anything.

On a 200k context model, that's 25% gone. On smaller models, it's worse. And most runs only use 1-2 tools.

The Solution: MCPlexor sits between your agent and your MCP servers. Instead of loading all tool definitions upfront:

Agent asks for a capability ("create an issue") MCPlexor routes to the right server using semantic matching Only relevant tools get exposed Result: ~500 tokens overhead instead of ~20k.

Technical Details: - Written in Go, single binary, no runtime deps - Supports both stdio and HTTP transports - Stores credentials in OS keychain (macOS Keychain, Windows Credential Manager, Linux keyring via zalando/go-keyring) - Works with Claude Desktop, Cursor, Augment Code, or any MCP-compatible client

The routing logic can run entirely locally using Ollama. No API calls, no cost, works offline.

Business Model (for transparency): - Local/Ollama users: completely free - Cloud tier (on waitlist): we run inference on efficient small models instead of Opus/Pro, pass on savings, take a margin The bet is that routing is a narrow enough task that a fine-tuned 7B model does it as well as a frontier model. Early testing suggests this works.

CLI uploads: https://github.com/arustagi101/mcplexor Install: `curl -fsSL https://mcplexor.com/install.sh | bash` Happy to answer questions about the architecture, the routing approach, or anything else.

I vibecoded a production grade internationalization library in 2 days

Show HN: Foliobase – Portfolio builder for freelance writers

Show HN: PubMed-max – a content-first, auditable medical retrieval pipeline

Bob – Open-source AI personal agent with 170 tools, built in 3 days

Found a PDF redaction online tool that doesn't move data to a server

Why Half Your Skills Expire Every Few Years

Sdefrgthdtyst4 [pdf]

'AI' is a dick move, redux

Show HN: Seedance 2.0 – Native audio-visual sync video model

Shattered Pixel Dungeon in 2026

Agentic Vision in Gemini 3 Flash

Show HN: GitWriter – mobile Markdown editor for writers

96% Engineers Don't Trust AI Output, yet Only 48% Verify It

Show HN: Markdown Projects – File based project management for AI agents

PaperBanana: Automating Academic Illustration for AI Scientists

Show HN: Simple Helpdehk tool for small software teams

Show HN: SaaS for making software documentation automatically

Java UI in 2026: an overview of current frameworks and approaches

EU and India launch talks on Horizon Europe association

Agent systems fail when orchestration is underspecified

Every Way to Export ChatGPT Conversations

Wireit: Smarter and more efficient NPM run

Show HN: ArkWatch – Uptime monitoring with zero dependencies

Show HN: Forge – 3MB Rust binary that coordinates multi-AI coding agents via MCP

Modular Monoliths and Other Facepalms – Kevlin Henney – NDC London 2026

MicroORM: TypeScript ORM Using Data Mapper, Unit of Work, Identity Map Patterns

Show HN: Auto Formatting PDF generator web tool for Travelers

A UK-focused job board for account managers – AccountManagerJobs.co.uk

If You Could Go Faster Than Light, Time Wouldn't Run Backward at All

P2P crypto exchange development company

Show HN: MCPlexor – MCP multiplexer that cuts agent context usage by 95%