Show HN: MCPSpec – Ship reliable MCP servers without writing test code

https://light-handle.github.io/mcpspec/

5•warmcat•1h ago

Hi HN,

I've been building MCPSpec, an open-source CLI for MCP server reliability. Record sessions, generate mock servers, catch Tool Poisoning, and fail your CI build when something's wrong — without writing test code.

There are ways to validate MCP servers today — the MCP Inspector, ad-hoc SDK scripts, unit tests for server internals — but nothing that handles regression detection, security auditing, mock generation, and CI pass/fail checks in one tool. MCPSpec does that:

1. Record a session against your real server, replay it after changes to catch regressions

2. Generate a standalone .js mock from any recording — no API keys, no live server needed in CI

3. Security audit with 8 rules including Tool Poisoning (prompt injection hidden in tool descriptions)

4. 0-100 quality score across documentation, schema quality, error handling, responsiveness, and security

5. One command to generate GitHub Actions / GitLab CI configs

No LLMs in the loop. Deterministic and fast. Ships with 70 ready-to-run tests for 7 popular MCP servers.

GitHub: https://github.com/light-handle/mcpspec Docs: https://light-handle.github.io/mcpspec/

Would love feedback or feature ideas.

Comments

shalakasurya•1h ago

Good way to go. But are there no other projects that have shipped this? Are there any industry standards?

warmcat•1h ago

There are a few tools in the space, but they each focus on parts:

- MCP Inspector (Anthropic's official tool) — interactive debugging, not CI-oriented - MCP-Scan (Invariant Labs, now Snyk) — security scanning, focused on tool poisoning detection - Promptfoo — LLM red teaming tool that added MCP support recently - MCP Protocol Validator — spec compliance checking

MCPSpec tries to be the one tool that covers the full workflow: record, replay, mock, security audit, quality scoring, and CI setup. None of the above do recording/replay or mock generation.

As for standards — there aren't any yet. MCP itself moved under the Linux Foundation's Agentic AI Foundation in December 2025, and NIST launched an AI Agent Standards Initiative last month. But no conformance or testing standard exists. It's still early.

Testing Can Be Fun

Show HN: A high-performance Hex Editor with Yara-X support in C#

Trump's Dollar

Open source project to map relationships and analyze w AI

GridCalc: An RPN Spreadsheet for iOS

Show HN: Inkwell – A lightweight, portable Markdown editor

Companies cutting jobs as investments shift toward AI

If code is cheap, intent is the currency

Show HN: Better Hub – we tried to improve GitHub

Gatekeeper – open-source policy engine and sandbox for AI coding agents

Show HN: Quoroom – local AI swarm (public research)

FireNation – Free Net Worth Dashboard and Fire Planner

Why isn't LA repaving streets?

Railway.gov.gr: Greek Train Tracker

Show HN: Well-net – a friends-only IPv6 network with no central server

Show HN: FilmLink – The Wiki Game for Movies (Daily Puzzle and Multiplayer Beta)

Money in Postgres

The Great Creative Extraction: AI Content Generation Rebuilds Colonial Economics

Racket v9.1

Major gap in Earth's rock record likely due to tectonics–not glaciers

The Rule of Four vs. RFC 3021: Temporal Conflicts in LLM Weights

Large-scale online deanonymization with LLMs (using HN posts)

Following 35% growth, solar has passed hydro on US grid

I Failed 3 Times Building This with AI. In 2026, It Took Days

Some More Game Theory, This Time on the AMD-Meta Platforms Deal

BBNs Toward Universal Fabricators – By Eric Gilliam

A 3D printed iPad tray for a compact dual-screen setup

Dinosaur eggshells can reveal the age of other fossils

Show HN: Engram – Open-source agent memory that beats Mem0 by 20% on LOCOMO

Show HN: Mlut – Tailwind CSS alternative for custom websites and creative coding