I open-sourced this repo to (1) identify all of the clickable components in a web app using `computer-use-preview`, (2) traverse the tree of actions using Browserbase/Stagehand, then (3) generate a reasonable MCP interface using GPT-5: https://github.com/neelsomani/web2mcp
You just need to set your login credentials in the .env file as specified in the README. Let me know if you have any questions - happy to share reasoning on the design!
nsomani•3h ago
I open-sourced this repo to (1) identify all of the clickable components in a web app using `computer-use-preview`, (2) traverse the tree of actions using Browserbase/Stagehand, then (3) generate a reasonable MCP interface using GPT-5: https://github.com/neelsomani/web2mcp
Here's an example of it running on a demo app: https://drive.google.com/file/d/1dy5lllRKkc7_usiiwsdOEMQTbdq...
Less trivial example where it generates a video using Hedra Character 3: https://drive.google.com/file/d/1JeFidakOB8NYyB7LfrgPum40Pce...
You just need to set your login credentials in the .env file as specified in the README. Let me know if you have any questions - happy to share reasoning on the design!