frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: MCPSpec – Ship reliable MCP servers without writing test code

https://light-handle.github.io/mcpspec/
5•warmcat•1h ago
Hi HN,

I've been building MCPSpec, an open-source CLI for MCP server reliability. Record sessions, generate mock servers, catch Tool Poisoning, and fail your CI build when something's wrong — without writing test code.

There are ways to validate MCP servers today — the MCP Inspector, ad-hoc SDK scripts, unit tests for server internals — but nothing that handles regression detection, security auditing, mock generation, and CI pass/fail checks in one tool. MCPSpec does that:

1. Record a session against your real server, replay it after changes to catch regressions

2. Generate a standalone .js mock from any recording — no API keys, no live server needed in CI

3. Security audit with 8 rules including Tool Poisoning (prompt injection hidden in tool descriptions)

4. 0-100 quality score across documentation, schema quality, error handling, responsiveness, and security

5. One command to generate GitHub Actions / GitLab CI configs

No LLMs in the loop. Deterministic and fast. Ships with 70 ready-to-run tests for 7 popular MCP servers.

GitHub: https://github.com/light-handle/mcpspec Docs: https://light-handle.github.io/mcpspec/

Would love feedback or feature ideas.

Comments

shalakasurya•1h ago
Good way to go. But are there no other projects that have shipped this? Are there any industry standards?
warmcat•1h ago
There are a few tools in the space, but they each focus on parts:

- MCP Inspector (Anthropic's official tool) — interactive debugging, not CI-oriented - MCP-Scan (Invariant Labs, now Snyk) — security scanning, focused on tool poisoning detection - Promptfoo — LLM red teaming tool that added MCP support recently - MCP Protocol Validator — spec compliance checking

MCPSpec tries to be the one tool that covers the full workflow: record, replay, mock, security audit, quality scoring, and CI setup. None of the above do recording/replay or mock generation.

As for standards — there aren't any yet. MCP itself moved under the Linux Foundation's Agentic AI Foundation in December 2025, and NIST launched an AI Agent Standards Initiative last month. But no conformance or testing standard exists. It's still early.

Testing Can Be Fun

https://giacomocavalieri.me/writing/testing-can-be-fun-actually
1•PaulHoule•3m ago•0 comments

Show HN: A high-performance Hex Editor with Yara-X support in C#

https://github.com/pumpkin-bit/EUVA
1•falkerdev•4m ago•0 comments

Trump's Dollar

https://www.phenomenalworld.org/analysis/trumps-dollar/
2•speckx•4m ago•0 comments

Open source project to map relationships and analyze w AI

https://www.bowen.app
2•andys627•4m ago•1 comments

GridCalc: An RPN Spreadsheet for iOS

https://tailrecursion.com/~alan/GridCalc.html
2•wooby•4m ago•0 comments

Show HN: Inkwell – A lightweight, portable Markdown editor

https://github.com/4worlds4w-svg/inkwell
2•accursed_share•5m ago•1 comments

Companies cutting jobs as investments shift toward AI

https://www.reuters.com/business/world-at-work/companies-cutting-jobs-investments-shift-toward-ai...
2•giuliomagnifico•5m ago•1 comments

If code is cheap, intent is the currency

https://zknill.io/posts/commit-message-intent/
3•zknill•5m ago•0 comments

Show HN: Better Hub – we tried to improve GitHub

https://www.better-hub.com/
2•bekacru•5m ago•1 comments

Gatekeeper – open-source policy engine and sandbox for AI coding agents

https://github.com/posterity-ventures/Gatekeeper
2•gemini2026•6m ago•1 comments

Show HN: Quoroom – local AI swarm (public research)

https://quoroom.ai/
3•vasilyt•6m ago•4 comments

FireNation – Free Net Worth Dashboard and Fire Planner

https://firenation.tech/
2•lovenwork•8m ago•1 comments

Why isn't LA repaving streets?

https://lapublicpress.org/2026/02/why-isnt-la-repaving-streets/
3•speckx•9m ago•0 comments

Railway.gov.gr: Greek Train Tracker

https://railway.gov.gr/
2•p-a_58213•9m ago•0 comments

Show HN: Well-net – a friends-only IPv6 network with no central server

https://github.com/remoon-net/well
2•shynome•10m ago•0 comments

Show HN: FilmLink – The Wiki Game for Movies (Daily Puzzle and Multiplayer Beta)

https://www.filmlink.io
2•danore2•10m ago•0 comments

Money in Postgres

https://numeric.substack.com/p/money-in-postgres
3•bihla•10m ago•0 comments

The Great Creative Extraction: AI Content Generation Rebuilds Colonial Economics

https://aylgorith.com/creative-extraction-ai-economics/
3•laurex•10m ago•0 comments

Racket v9.1

https://blog.racket-lang.org/2026/02/racket-v9-1.html
4•azhenley•11m ago•0 comments

Major gap in Earth's rock record likely due to tectonics–not glaciers

https://phys.org/news/2026-02-major-gap-earth-due-tectonics.html
4•bikenaga•11m ago•1 comments

The Rule of Four vs. RFC 3021: Temporal Conflicts in LLM Weights

2•mehrdadrad•12m ago•0 comments

Large-scale online deanonymization with LLMs (using HN posts)

https://arxiv.org/abs/2602.16800
3•mellosouls•13m ago•0 comments

Following 35% growth, solar has passed hydro on US grid

https://arstechnica.com/science/2026/02/final-2025-data-is-in-us-energy-use-is-up-as-solar-passes...
6•rbanffy•13m ago•1 comments

I Failed 3 Times Building This with AI. In 2026, It Took Days

https://luisfernandoyt.makestudio.app/blog/i-vibe-coded-a-research-paper
2•lout332•14m ago•0 comments

Some More Game Theory, This Time on the AMD-Meta Platforms Deal

https://www.nextplatform.com/compute/2026/02/24/some-more-game-theory-this-time-on-the-amd-meta-p...
2•rbanffy•14m ago•0 comments

BBNs Toward Universal Fabricators – By Eric Gilliam

https://www.freaktakes.com/p/bbns-towards-universal-fabricators
2•rbanffy•14m ago•0 comments

A 3D printed iPad tray for a compact dual-screen setup

https://abishov.com/blog/ipad-tray-dual-screen-setup/
2•araz•15m ago•0 comments

Dinosaur eggshells can reveal the age of other fossils

https://arstechnica.com/science/2026/02/dinosaur-eggshells-can-reveal-the-age-of-other-fossils/
2•gmays•15m ago•0 comments

Show HN: Engram – Open-source agent memory that beats Mem0 by 20% on LOCOMO

https://www.engram.fyi/
2•tstockham•15m ago•0 comments

Show HN: Mlut – Tailwind CSS alternative for custom websites and creative coding

https://mlut.style/
2•mr150•17m ago•0 comments