frontpage.

Show HN: Magnitude MCP – vision-first browser interaction for Claude Code

https://github.com/sagekit/magnitude/tree/main/packages/magnitude-mcp

1•anerli•1h ago

Hey HN - Anders and Tom here. We made a browser MCP using the same vision-first scaffolding as our SOTA (94% WebVoyager) browser agent Magnitude. This approach is more flexible and robust than DOM-based interactions and avoids all the associated edge cases. It works particularly well with Claude Code, since Claude is already trained for computer-use tasks and can interact with browser elements precisely using vision alone. It's 100% open source.

The setup is very simple if you want to try it: ``` npm i -g magnitude-mcp@latest claude mcp add magnitude -- npx magnitude-mcp ``` Then spin up Claude Code and ask it to do something in the browser.

The MCP allows Claude Code (or any coding agent) to: - Open a browser with a persistent profile - Click, type, drag, etc. using pixel-based coordinates - Automatically stay aware of current state with a screenshot after each interaction - Coordinate multiple actions at once for efficiency

We made this MCP because we realized it would be really helpful in our own engineering workloads. We are building a massive number of first-party integrations for Sagekit (our workflow automation platform), and to do so we need to give Claude Code as many tools as possible to act with a high degree of autonomy, including a browser it can use reliably.

The only downside is that only certain models can use the MCP effectively, because not all models can pinpoint exact pixel coordinates based on a screenshot. These models roughly include Claude (Sonnet 3.7, Sonnet 4, Opus 4), Qwen 2.5 VL-based models (Qwen 2.5 VL 72B, UI TARS, etc.), and a few others specifically trained for it. Given that we personally use Claude for all our coding anyway, this seemed acceptable.

After using Claude Code to build 15+ integrations in a week, it became obvious that certain tools are necessary to take us out of the dev loop more often and produce high quality code autonomously. A browser was an obvious place to start, and we already had a browser agent we could repurpose.

So far, this is what we've personally found it useful for: - Seeing and interacting with web apps as it builds features or fixes issues - Configuring dummy data that can't be accessed programmatically - Browsing documentation on sites where fetch doesn't work - Improvised frontend testing

Anyway, we thought we would share in case other engineers find it useful in their workloads. It can also be used in non-engineering work (like booking a flight!!) if you so desire.

Lower Than Cowards: The Surrender of America's Elites

Santino (Chimpanzee)

Busch Light Apple Is the McRib of Beer

Setting Up the Z/OS Unix Shell (Correctly and Completely)

Genie3 Generated Video Glimps

Richard Sutton – If we understood a squirrel, we'd be almost all the way to AGI [video]

Are Michigan's small farms ready to go EV? MSU demos an electric tractor

Swift global payments network experiments with Ethereum Layer 2

High Voltage Coin Cell

The Coming Violent Backlash Against AI

Inactive H5N1 influenza virus in pasteurized milk poses minimal health risks

Chrome DevTools MCP

Numbered Databases in Valkey 9.0

Market design can feed the poor

Anti-Offshoring Legislation: The New Wave of Protectionism (2005) [pdf]

Nyx – An Experiment in Artificial Survival

Trump Takes Aim at Chip Makers with New Plan to Throttle Imports

The Köln Concert

HSBC unleashes yet another "qombie": a zombie claim of quantum advantage

1I/ʻOumuamua

Ukraine: EU states agree on need for 'drone wall'

Why Use Mailing Lists?

OpenAI: We Built the Responses API

OpenAI: Updated function calling to support files, images as tool call outputs

I made an app that reminds you about upcoming subscriptions and bills

Netanyahu says UN speech broadcast on tapped cell phones, loudspeakers in Gaza

Local TV Giant Sinclair Ends Jimmy Kimmel Boycott

How Tridge Reverse Engineered BitKeeper

Bottlebrush particles deliver big chemotherapy payloads directly to cancer cells

I Fell for a $1.25M Scam – Now MrBeast Is Helping Me Hunt Down the Scammers