frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Open-source browser for AI agents

https://github.com/theredsix/agent-browser-protocol
8•theredsix•1h ago
Hi HN, I forked chromium and built agent-browser-protocol (ABP) after noticing that most browser-agent failures aren’t really about the model misunderstanding the page. Instead, the problem is that the model is reasoning from a stale state.

ABP is designed to keep the acting agent synchronized with the browser at every step. After each action (click, type, etc), it freezes JavaScript execution and rendering, then captures the resulting state. It also compiles the notable events that occurred during that action loop, such as navigation, file pickers, permission prompts, alerts, and downloads, and sends that along with a screenshot of the frozen page state back to the agent.

The result is that browser interaction starts to feel more like a multimodal chat loop. The agent takes an action, gets back a fresh visual state and a structured summary of what happened, then decides what to do next from there. That fits much better with how LLMs already work.

A few common browser-use failures ABP helps eliminate: * A modal appears after the last Playwright screenshot and blocks the input the agent was about to use * Dynamic filters cause the page to reflow between steps * An autocomplete dropdown opens and covers the element the agent intended to click * alert() / confirm() interrupts the flow * Downloads are triggered, but the agent has no reliable way to know when they’ve completed

As proof, ABP with opus 4.6 as the driver scores 90.5% on the Online Mind2Web benchmark. I think modern LLMs already understand websites, they just need a better tool to interact with them. Happy to answer questions about the architecture, forking chrome or anything else in the comments below.

Try it out: `claude mcp add browser -- npx -y agent-browser-protocol --mcp` (Codex/OpenCode instructions in the docs)

Demo video: https://www.loom.com/share/387f6349196f417d8b4b16a5452c3369

Comments

theredsix•1h ago
Op here, happy to answer any question!
giancarlostoro•12m ago
Interesting, I wonder if this would help with other projects too, one project that comes to mind is archivebox, I don't know if they still have the issue I'm thinking of, but archivebox eventually had the Chrome instances (as the meme goes) basically consume all available RAM. If by freezing execution this could stop that, it could be useful for more than just AI agents.
Retr0id•8m ago
> As proof, ABP with opus 4.6 as the driver scores 90.5% on the Online Mind2Web benchmark

And what does opus score with "regular" browser harnesses?

LLMs as a Direct Interface to Event Streams

https://www.eventsourcing.ai/deeper-insights/llms-as-a-direct-interface-to-event-streams/
1•goloroden•1m ago•0 comments

Chat Control Vote in the EU to End Untargeted Mass Scanning of Private Chats

https://www.patrick-breyer.de/en/historic-chat-control-vote-in-the-eu-parliament-meps-vote-to-end...
2•dinowars•2m ago•0 comments

Nielsen's Gracenote sues OpenAI over use of metadata in AI training

https://www.reuters.com/business/media-telecom/nielsens-gracenote-sues-openai-over-use-metadata-a...
2•1vuio0pswjnm7•3m ago•0 comments

Alex Gerko funds telescope showing 'cartography of the universe'

https://www.ft.com/content/74e5ce31-ca84-4ae0-96da-df517a015148
1•samizdis•3m ago•1 comments

The Internet Is for End Users

https://datatracker.ietf.org/doc/html/rfc8890
1•flipped•4m ago•0 comments

Major US tech firms are potential Iranian targets, state media says

https://www.cnn.com/world/live-news/iran-war-us-israel-trump-03-11-26
3•mrtksn•4m ago•0 comments

An unappetizing shrub became different vegetables

https://www.worksinprogress.news/p/many-of-the-tastiest-vegetables-are
1•bensouthwood•4m ago•0 comments

What Is a City When Its Wealthiest Leave?

https://www.wsj.com/real-estate/miami-property-taxes-billionaires-future-cities-a9bdda45
2•simonebrunozzi•5m ago•2 comments

Review of the Agent Internet RFCs

https://github.com/andre-git/agent-internet-rfcs
1•andre-hn•6m ago•1 comments

OpenBSD Ext4fs Update

https://www.kmx.io/blog/openbsd-ext4fs-update
1•thodg•7m ago•0 comments

Effects of Deliberate, Contingency and Reactive Planning on Effectiveness (2008)

https://www.researchgate.net/publication/247720525
1•pitched•8m ago•0 comments

AI Commoditization: Open-Source Parity Is a Pricing Problem

https://philippdubach.com/posts/ai-models-are-the-new-rebar/
1•7777777phil•8m ago•0 comments

Claude Login Outage

https://old.reddit.com/r/ClaudeAI/comments/1rqvucq/claude_status_update_elevated_errors_on_claudeai/
1•nurettin•8m ago•0 comments

All Models Are Wrong, Some Are Useful

https://docs.eventsourcingdb.io/blog/2026/03/12/all-models-are-wrong-some-are-useful/
1•goloroden•8m ago•0 comments

Is It Impossible to Make a Living as an Animator in Japan?

https://www.animenewsnetwork.com/feature/2026-02-20/is-it-really-impossible-to-make-a-living-as-a...
1•PaulHoule•9m ago•0 comments

Our ancestors used mushrooms to change the course of human history

https://www.newscientist.com/article/2516720-how-our-ancestors-used-mushrooms-to-change-the-cours...
1•Brajeshwar•11m ago•0 comments

Mathematics is undergoing the biggest change in its history

https://www.newscientist.com/article/2518526-mathematics-is-undergoing-the-biggest-change-in-its-...
2•Brajeshwar•11m ago•0 comments

New African species confirms evolutionary origin of magic mushrooms

https://phys.org/news/2026-03-african-species-evolutionary-magic-mushrooms.html
1•Brajeshwar•11m ago•0 comments

Show HN: I built an interactive globe for verified combat events

https://defogwar.com
2•IMAYousaf•11m ago•2 comments

I built an AI agent in Zig that runs on Windows XP with 64 MB RAM

1•bcorp•14m ago•1 comments

Volume, Ambition, Clarity

https://tern.sh/blog/volume-ambition-clarity/
1•trjordan•15m ago•0 comments

AI-powered refund abuse and dispute fraud: The democratization of deception

https://www.ravelin.com/blog/ai-powered-refund-abuse-dispute-fraud
1•speckx•15m ago•0 comments

Track idle, typing, and agent work time across Claude Code sessions

https://github.com/martinambrus/claude_timings_wrapper
1•martinambrus•15m ago•1 comments

Show HN: Rewriting Mongosh in Golang Using Claude

https://github.com/adaptive-scale/go-mongosh
6•debarshri•16m ago•1 comments

Show HN: Klaus – OpenClaw on a VM, batteries included

https://klausai.com/
9•robthompson2018•17m ago•2 comments

ugit: DIY Git in Python

https://www.leshenko.net/p/ugit/#
1•tosh•18m ago•0 comments

YouTube expands AI deepfake detection to politicians, officials, journalists

https://techcrunch.com/2026/03/10/youtube-expands-ai-deepfake-detection-to-politicians-government...
1•Vaslo•18m ago•0 comments

Revolut secures full UK banking licence after lengthy wait

https://uk.finance.yahoo.com/news/revolut-finally-secures-full-uk-144307017.html
2•mmarian•18m ago•0 comments

Okmain: You have an image but you want a colour

https://dgroshev.com/blog/okmain/
1•dgroshev•19m ago•0 comments

OpenUI – The Open Standard for Generative UI

https://github.com/thesysdev/openui
1•Topfi•21m ago•0 comments