Show HN: MCP App Template designed for coding agents

https://github.com/sebderhy/mcp-app-template

4•sebderhy•1h ago

Comments

sebderhy•1h ago

Hi, author of the repo speaking here!

When I tried building MCP Apps [1], the official repos (https://github.com/openai/openai-apps-sdk-examples, https://github.com/modelcontextprotocol/ext-apps/tree/main/e...) were great starting points, but they're designed for human developers. When I used them with Claude Code, I ended up in the usual loop: agent writes code → I manually test the app on ChatGPT → describe errors back → repeat. Plus, we didn't know what the best practices are, and struggled to enforce them.

So I built an MCP App template designed for coding agents to work as autonomously as possible on an MCP app.

The key idea: orthogonal testing. 450+ tests parameterized across 12 widget modules that verify infrastructure (protocol compliance, best practices grade, browser rendering), not business logic. Modify widgets, change data, add features — the tests should still pass. Agents iterate freely and get feedback without a human in the loop.

Other features: - Hierarchical documentation that includes the MCP-App & OpenAI Apps SDK official llms.txt files - Local chat simulator app that works even without API keys via Puter.js - Visual testing of every widget: pnpm run ui-test --tool show_carousel → screenshot at /tmp/ui-test/screenshot.png - 12 working examples (QR codes to 3D solar system) gathered from the official repos mentioned above.

The repo includes an unedited ~15 min video of Claude Code building an app autonomously which worked directly within ChatGPT.

I'd love to hear how it goes if you try it. Or even better: ask for a feedback to your agent, and post it here!

[1] MCP Apps (https://modelcontextprotocol.io/docs/extensions/apps) let you build interactive widgets that run inside Claude, ChatGPT, VS Code, and other AI hosts. In contrast to smartphone apps, the same code can deploy to all platforms.

aflam•1h ago

First time I see a clear split where the README markets to humans and the AGENTS.md has a clean tutorial for LLMs.

I'll give it a try, MCP apps are full of promises but protocols are so unstable that I wouldn't want to write the boiler plate myself.

sebderhy•1h ago

Yeah, I have to say that finding the right balance between what to write and not to write in the AGENTS.md is quite hard.

Regarding the protocols being unstable, that's quite a fair point. Maybe it is possible to automate this? That is, detecting changes in the official docs automatically, and adapt the docs and tests automatically based on it via a Coding Agent.

kevco75•1h ago

Crazy ! Testing now

sebderhy•1h ago

Hope to hear *your coding agent's* feedback!

The Little Bool of Doom

Discovery of Goethe's amber ant: its phylogenetic and evolutionary implications

Isledb: Database Built on Object Storage

Algorithmic Wage Discrimination

GitButler

Canadian startups need to stop playing slow

Simple tool to check SSL, HTTPS, TLS, Security headers and HTTP/3 support

Web design without design software

Show HN: Plexsonic, a Plex Music to Subsonic Bridge

EU to delay anti-deforestation law. Again

Show HN: Deploy to AWS in minutes – no DevOps required

How do you manage context window?

Show HN: Surge – A TUI download manager written in Go that beats ara2 by ~1.4x

Valkey as a Message Broker for Request-Reply

Understanding the Go Compiler: The Linker

Why We just can't stop eating

Show HN: Pilot Protocol – UDP overlay network stack for AI agents(Go, zero deps)

Reach – An SSH client for people who are tired of PuTTY

One Weight-Loss Approach Fits All? No, Not Even Close

Intel Appears to Have Sunset "On Demand" Software Defined Silicon

The Economist as Reporter

Man, 83, Tricked by Scammers, Gets 21 Years to Life for Killing Uber Driver

Searching for your life's work is a multi-turn endeavor

Bun v1.3.9

Show HN: NPM Scripts Deck – Run NPM scripts from Stream Deck with dynamic button

The Business of Check Cashing

Show HN: Chaotic ― 3D renderer for your crazy math projects in C++

Stop Using Face ID

Show HN: Fragno Forms, form builder and response collection as a library

The most tragic programming language

The Little Bool of Doom

Discovery of Goethe's amber ant: its phylogenetic and evolutionary implications

Isledb: Database Built on Object Storage

Algorithmic Wage Discrimination

GitButler

Canadian startups need to stop playing slow

Simple tool to check SSL, HTTPS, TLS, Security headers and HTTP/3 support

Web design without design software

Show HN: Plexsonic, a Plex Music to Subsonic Bridge

EU to delay anti-deforestation law. Again

Show HN: Deploy to AWS in minutes – no DevOps required

How do you manage context window?

Show HN: Surge – A TUI download manager written in Go that beats ara2 by ~1.4x

Valkey as a Message Broker for Request-Reply

Understanding the Go Compiler: The Linker

Why We just can't stop eating

Show HN: Pilot Protocol – UDP overlay network stack for AI agents(Go, zero deps)

Reach – An SSH client for people who are tired of PuTTY

One Weight-Loss Approach Fits All? No, Not Even Close

Intel Appears to Have Sunset "On Demand" Software Defined Silicon

The Economist as Reporter

Man, 83, Tricked by Scammers, Gets 21 Years to Life for Killing Uber Driver

Searching for your life's work is a multi-turn endeavor

Bun v1.3.9

Show HN: NPM Scripts Deck – Run NPM scripts from Stream Deck with dynamic button

The Business of Check Cashing

Show HN: Chaotic ― 3D renderer for your crazy math projects in C++

Stop Using Face ID

Show HN: Fragno Forms, form builder and response collection as a library

The most tragic programming language

Show HN: MCP App Template designed for coding agents

Comments