frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Deterministic browser control for AI agents (~90% on Mind2Web)

https://github.com/theredsix/agent-browser-protocol
12•theredsix•8h ago

Comments

theredsix•8h ago
Hi HN, op here! This is an open source browser protocol for LLM agents.

The browser shows the model the current page, the model chooses the next action, and the browser returns the new state. Between steps, JavaScript and time are frozen so the page stays still while the model thinks.

That makes things like ecommerce shopping and popup-heavy web app workflows much more reliable.

Using this setup, the project gets ~90% on Online Mind2Web. My bet is that browser agents need a protocol designed for models, not just wrappers around CDP.

bignoggins•7h ago
what do you do differently compared to other options?
theredsix•7h ago
The difference is that we make browser use turn-based and return a single structured result per action.

With most other tools, the model is interacting with a live browser and effectively has to reason through a stream of low-level events while the page keeps changing. We instead freeze the page, let the model request one action, execute it, allow all resulting browser events to play out, then freeze again and return one bundled response with everything that happened plus the new stable page state.

So the model isn’t chasing a moving UI or event stream. It gets one grounded step at a time. A big part of the performance gain seems to come from that holistic action envelope.

shane-moran•7h ago
This is a great example of breaking the trend thinking it’s always model improvement needed, but sometimes the model doesn’t have the best way to interface with the data or system.

The cleanliness of this approach that improves the ability for the model to interact without having to completely redefine the interface system with respect to still being able to use websites and the computer as is and not having to develop an entirely new interface protocol at the machine level.

theredsix•7h ago
Exactly, the harness or protocol can matter just as much!
Thors3n•5h ago
Very exciting stuff! Most agent browser stacks still feel clunky to me. This is very promising, turning browsing into deterministic, atomic steps should definitely improve user interaction and E2E utility.
greggberry•5h ago
This project is incredible!

I already have it set up for local Claude agent use and seeing significant improvement, both in accuracy and task efficiency: `claude mcp add browser -- npx -y agent-browser-protocol@rc --mcp`

Additionally, if you want to configure with Claude Desktop, add the following to your `claude_desktop_config.json` after installing the MCP:

``` "mcpServers": { "browser": { "command": "npx", "args": [ "-y", "agent-browser-protocol@rc", "--mcp" ] } } ```

Russian Ransomware Administrator Pleads Guilty to Wire Fraud Conspiracy

https://www.justice.gov/usao-md/pr/russian-ransomware-administrator-pleads-guilty-wire-fraud-cons...
1•737min•20s ago•0 comments

Show HN: Rust-First L3 Limit Order Book Backtesting Engine with Python Bindings

https://github.com/chasemetoyer/Backtesting-Engine
1•chasemetoyer•26s ago•0 comments

Show HN: Ovumcy – self-hosted menstrual cycle tracker

https://github.com/terraincognita07/ovumcy
1•terrain07•48s ago•0 comments

Show HN: Sheila, an AI agent that replaced our accounting flow

https://soapbox.pub/blog/announcing-sheila/
3•knewter•9m ago•1 comments

Qualcomm CEO: 'Resistance Is Futile' as 6G Mobile Revolution Approaches

https://fortune.com/2026/03/03/qualcomm-ceo-resistance-is-futile-6g-mobile-revolution-approaches/
2•m463•10m ago•1 comments

Show HN: NeoNetrek – modernizing the internet's first team game (1988)

https://neonetrek.com
1•yuriksan•12m ago•0 comments

Show HN: Natural language queries for Prometheus Kafka metrics (StreamLens)

https://github.com/muralibasani/streamlens
1•muralibasani•12m ago•0 comments

Satellite firm pauses imagery after revealing Iran's attacks on US bases

https://arstechnica.com/space/2026/03/satellite-firm-pauses-imagery-after-revealing-irans-attacks...
1•consumer451•14m ago•0 comments

China Suspected in Breach of FBI Surveillance Network

https://www.wsj.com/politics/national-security/china-suspected-in-breach-of-fbi-surveillance-netw...
2•JumpCrisscross•14m ago•0 comments

Show HN: I created list of directories (1000) to create free backlinks

https://kitful.ai/directories
1•eashish93•16m ago•0 comments

Fishing crews in the Atlantic keep accidentally dredging up chemical weapons

https://arstechnica.com/health/2026/03/fishing-crews-in-the-atlantic-keep-accidentally-dredging-u...
2•jnord•18m ago•0 comments

The National Videogame Museum Has Acquired the Mythical Nintendo PlayStation

https://www.engadget.com/gaming/the-national-videogame-museum-has-acquired-the-mythical-nintendo-...
2•breve•21m ago•0 comments

C# Strings Silently Kill Your SQL Server Indexes in Dapper

https://consultwithgriff.com/dapper-nvarchar-implicit-conversion-performance-trap
5•PretzelFisch•22m ago•0 comments

Show HN: I open-sourced my Steam game, 100% written in Lua, engine is also open

https://github.com/willtobyte/reprobate
1•delduca•22m ago•0 comments

The White House: Touchdown

https://twitter.com/WhiteHouse/status/2030051395294941427
2•TheAlchemist•23m ago•3 comments

Capability-Tiered AI Governance Architecture (CEGP)

https://github.com/babyblueviper1/ai-governance-architecture
2•babyblueviper1•25m ago•1 comments

A new chapter for the Nix language, courtesy of WebAssembly

https://determinate.systems/blog/builtins-wasm/
2•birdculture•26m ago•0 comments

Shipping a Button in 2026 [video]

https://www.youtube.com/watch?v=xE9W9Ghe4Jk
1•Dhvani35729•26m ago•0 comments

Show HN: Stream-native AI that never sleeps, an alternative to OpenClaw

https://github.com/timeplus-io/PulseBot
1•gangtao•32m ago•0 comments

Show HN: Flompt – Visual prompt builder that decomposes prompts into blocks

https://github.com/Nyrok/flompt
1•hkonte•32m ago•0 comments

FBI investigating 'suspicious' cyber activity on system holding wiretaps

https://abcnews.com/Technology/wireStory/fbi-investigating-suspicious-cyber-activity-system-holdi...
1•campuscodi•33m ago•0 comments

Show HN: key-carousel - Key rotation for LLM agents

https://github.com/HalfEmptyDrum/Key-Carousel
4•EmptyDrum•33m ago•1 comments

Device that can extract 1k liters of clean water a day from desert air

https://www.tomshardware.com/tech-industry/device-that-can-extract-1-000-liters-of-clean-water-a-...
3•PaulHoule•37m ago•0 comments

Show HN: Sqry – semantic code search using AST and call graphs

https://sqry.dev
2•verivusai•37m ago•0 comments

The Window Chrome of Our Discontent

https://pxlnv.com/blog/window-chrome-of-our-discontent/
2•zdw•39m ago•0 comments

When Batteries Heat Up, This Membrane "Sweats" It Out

https://axial.acs.org/nanoscience/when-batteries-heat-up-this-membrane-sweats-it-out
1•geox•39m ago•0 comments

Show HN: Stratum - a pure JVM columnar SQL engine using the Java Vector API

https://datahike.io/stratum/
1•whilo•39m ago•1 comments

Wild crows in Sweden help clean up cigarette butts

https://www.samodobrevijesti.com/en/news/wild-crows-in-sweden-help-clean-up-cigarette-butts/
10•jhncls•40m ago•4 comments

Show HN: BLOBs in MariaDB's Memory Engine – No More Disk Spills for Temp Tables

https://jira.mariadb.org/browse/MDEV-38975
1•arcivanov•43m ago•1 comments

Tip me, my life depends on it (2021)

https://idiallo.com/blog/tip-me
2•foxfired•43m ago•0 comments