I’ve been heavily using AI Agents (specifically the Antigravity environment) for coding, but I ran into three major engineering frustrations that made the experience painful on Windows:
Infinite Loading: Without a dedicated GPU setup, the agent was unresponsive for ~58s out of every minute.
Context Bloat: The agent kept requesting the full codebase context (repomix) repeatedly whenever it loaded MCP tools, locking up my local servers.
UI Lag & Zombie Processes: On Windows, the agent would often hang due to permission issues and lingering processes on the port.
I couldn't find a tool that solved the infrastructure side of this, so I built MCP Vault (mcpv).
Repo: https://github.com/thekeunpie-hash/mcpvault
How it works mcpv acts as a "Smart Middleware" proxy between your Agent and your actual MCP tools. It safely hijacks the mcp_config.json to optimize traffic, manage system resources, and route tool calls efficiently.
Key Technical Features:
Zero-Latency Startup (Lazy Loading): Instead of indexing the entire file tree or initializing all upstream connections at boot, mcpv starts instantly (<0.1s). It initializes connections and scans files only when the agent explicitly requests them via the protocol.
Smart Router (Unified Interface): It decouples the agent from specific server configurations. The agent calls a single flattened command (run_tool), and mcpv dynamically routes it to the correct upstream server. It even includes auto-correction to handle agent typos or hallucinations (e.g., if the agent tries to call a server name as a tool).
Smart Context Valve: It inspects traffic to detect if the agent is requesting a massive context dump (like a full repo summary) more than once in a session.
First request: Allowed.
Subsequent requests: Blocked with a "Context already cached" message (~20 tokens). This physically prevents the agent from wasting tokens and bandwidth on duplicate data.
Booster Injection (Windows/Antigravity): It auto-generates an optimized launcher script that injects hidden Chromium flags (--enable-gpu-rasterization) to force GPU acceleration. It also drops Admin rights via RunAsInvoker to fix drag-and-drop bugs and automatically kills zombie processes on port 26646.
Tech Stack
Python & uv: Uses uv for isolated virtual environments to prevent dependency hell.
FastMCP: Built on the FastMCP framework for high-throughput handling.
Installation
Bash
pip install mcpv mcpv install # Required to hijack config and inject the booster Current State It is currently optimized for Windows and the Antigravity environment, but the core logic (Gateway Pattern, Smart Router, Smart Valve) is applicable to any MCP-compliant agent.
It's open source (MIT). I’d love to hear your feedback on the architecture or feature requests for other OS/Agents.