news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I built a CLI to test and eval MCP servers

https://www.npmjs.com/package/@mcpjam/cli

4•matt8p•1h ago

Comments

matt8p•1h ago

Hi folks, we've been working on a CLI tool to programatically test and eval MCP servers. Looking to get some initial feedback on the project.

Let's say you're testing PayPal MCP. You can write a test case prompt "Create a refund order for order 412". The test will run the prompt and check if the right PayPal tool was called.

The CLI helps with: 1. Test different prompts and observe how LLMs interact with your MCP server. The CLI shows a trace of the conversation. 2. Examine your server's tool name / description quality. See where LLMs are hallucinating using your server. 3. Analyze your MCP server's performance, like token consumption, and performance with different models. 4. Benchmarking your MCP server's performance to catch future regressions.

The nice thing about CLI is that you can run these tests iteratively! Please give it a try, and would really appreciate your feedback.

Letter Format – Professional Letter Templates and Editor

https://letterformat.org

1•wsljhint•33s ago•0 comments

Trump administration rehiring workers laid off by DOGE

https://thehill.com/homenews/administration/5519449-trump-administration-doge-rehiring/

1•MilnerRoute•55s ago•0 comments

Candlestick Pattern Practice – Quiz

https://trendlinegala.com

1•ExclusiveVirtue•2m ago•1 comments

The "Wage Level" Mirage: H-1B proposal could help outsourcers and hurt US talent

https://ifp.org/the-wage-level-mirage/

1•johntfella•2m ago•0 comments

America Leads the World in AI Skepticism

https://benjamingrayzel.substack.com/p/america-leads-the-world-in-ai-skepticism

1•The_Gray•2m ago•0 comments

Greatest 404 Page

https://statmodeling.stat.columbia.edu/2025/09/23/worlds-greatest-404-page/

1•neehao•7m ago•1 comments

Startup Modular raises $250M to challenge Nvidia's software dominance

https://www.ft.com/content/bc07f94d-de30-437b-8109-d15781abf77f

1•throwaway2037•11m ago•0 comments

Thinking Is Doing

https://takezo.bearblog.dev/thinking-is-doing/

2•trev9065•15m ago•0 comments

Bearie – a cuddly AI plush that kids can talk to in real time

https://www.lifetoy.ai

1•a_r_cheraghi•15m ago•3 comments

Got annoyed with Domains so I made a Free tool

https://www.wonetic.com/

1•itsolidude•19m ago•1 comments

Omni-Bodied Robot Brain

https://www.skild.ai/blogs/omni-bodied

1•ricardobeat•21m ago•0 comments

OpenAI will devour as much power as NYC and San Diego combined

https://fortune.com/2025/09/24/sam-altman-ai-empire-new-york-city-san-diego-scary/

2•geox•23m ago•1 comments

A Curated List of 700 Dictionary Domains from 10 Categories

https://lexicondomains.com

1•piranhas•23m ago•1 comments

The Question Concerning Technology

https://en.wikipedia.org/wiki/The_Question_Concerning_Technology

1•doener•28m ago•0 comments

Fenghua No.3 GPU – CUDA Compatibility, RT Support and 112GB+ of HBM Memory

https://www.tomshardware.com/pc-components/gpus/chinas-latest-gpu-arrives-with-claims-of-cuda-com...

1•CaptainOfCoit•32m ago•0 comments

Show HN: PVE VM API, support one-click download official images and deployment

https://github.com/pardnchiu/go-qemu

1•pardnchiu•33m ago•0 comments

Python to win them all – revisited

https://substack.com/inbox/post/174492902

1•mathattack•34m ago•0 comments

Our plan for a more secure NPM supply chain

https://github.blog/security/supply-chain-security/our-plan-for-a-more-secure-npm-supply-chain/

4•nnx•34m ago•0 comments

Are unique brand names better than generic ones for visibility in AI search?

1•piranhas•35m ago•0 comments

Fill probability estimates in institutional bond trading with quantum computers

https://arxiv.org/abs/2509.17715

1•polrjoy•36m ago•1 comments

The mystery of the dog in Rembrandt's 'The Night Watch' is solved

https://www.nytimes.com/2025/09/23/arts/rembrandt-the-night-watch-dog.html

1•bookofjoe•36m ago•1 comments

Rustroid, a Rust IDE for Android

https://rustroid.is-a.dev/story

2•coolcoder613•39m ago•0 comments

TV in the United States

https://mastodon.online/@mastodonmigration/115261985305708087

2•doener•41m ago•0 comments

The Third Chair

https://www.henrikkarlsson.xyz/p/third-chair

2•delichon•53m ago•0 comments

Argentina's Javier Milei lost the markets and turned to Donald Trump

https://www.ft.com/content/e5e314d0-31cf-44e0-9167-63a787baac47

7•doener•53m ago•5 comments

Calling all innovators Free multi-week collaborative tech series starting soon

https://www.girlhacks.net/events/innovate-series-2025

2•shuchiag•59m ago•1 comments

Fewer H-1B Visas Did Not Mean More Employment for Natives (2017)

https://www.nber.org/digest/dec17/fewer-h-1b-visas-did-not-mean-more-employment-natives

23•tuan•1h ago•26 comments

Minus World – The Glitch in Super Mario That Obsessed Gamers

https://www.bbc.com/culture/article/20250923-the-glitch-in-super-mario-bros-that-obsessed-gamers

6•jeffwass•1h ago•0 comments

The Graphing Calculator Story

https://www.pacifict.com/Story/

3•animal_spirits•1h ago•1 comments

Cloudflare Enters the Robots.txt Fray with a Content Signals Policy for AI Bots

https://www.searchengineworld.com/cloudflare-enters-the-robots-txt-fray-with-a-content-signals-po...

2•bhartzer•1h ago•0 comments