frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Open-source browser for AI agents

https://github.com/theredsix/agent-browser-protocol
28•theredsix•3h ago
Hi HN, I forked chromium and built agent-browser-protocol (ABP) after noticing that most browser-agent failures aren’t really about the model misunderstanding the page. Instead, the problem is that the model is reasoning from a stale state.

ABP is designed to keep the acting agent synchronized with the browser at every step. After each action (click, type, etc), it freezes JavaScript execution and rendering, then captures the resulting state. It also compiles the notable events that occurred during that action loop, such as navigation, file pickers, permission prompts, alerts, and downloads, and sends that along with a screenshot of the frozen page state back to the agent.

The result is that browser interaction starts to feel more like a multimodal chat loop. The agent takes an action, gets back a fresh visual state and a structured summary of what happened, then decides what to do next from there. That fits much better with how LLMs already work.

A few common browser-use failures ABP helps eliminate: * A modal appears after the last Playwright screenshot and blocks the input the agent was about to use * Dynamic filters cause the page to reflow between steps * An autocomplete dropdown opens and covers the element the agent intended to click * alert() / confirm() interrupts the flow * Downloads are triggered, but the agent has no reliable way to know when they’ve completed

As proof, ABP with opus 4.6 as the driver scores 90.5% on the Online Mind2Web benchmark. I think modern LLMs already understand websites, they just need a better tool to interact with them. Happy to answer questions about the architecture, forking chrome or anything else in the comments below.

Try it out: `claude mcp add browser -- npx -y agent-browser-protocol --mcp` (Codex/OpenCode instructions in the docs)

Demo video: https://www.loom.com/share/387f6349196f417d8b4b16a5452c3369

Comments

theredsix•3h ago
Op here, happy to answer any question!
esafak•1h ago
How does it compare with https://agent-browser.dev/ ? It would be great if you could add it to your table: https://github.com/theredsix/agent-browser-protocol?#compari...
theredsix•17m ago
agent-browser's biggest selling point is a CLI wrapper around CDP/puppeteer for context management. It'll have mostly the same pros/cons as CDP on the table.
giancarlostoro•1h ago
Interesting, I wonder if this would help with other projects too, one project that comes to mind is archivebox, I don't know if they still have the issue I'm thinking of, but archivebox eventually had the Chrome instances (as the meme goes) basically consume all available RAM. If by freezing execution this could stop that, it could be useful for more than just AI agents.
theredsix•13m ago
Yeah, I noticed CPU use goes to near zero during the pausing phase. You can also trigger pause via REST/MCP so a script can take advantage of these abilities as well.
Retr0id•1h ago
> As proof, ABP with opus 4.6 as the driver scores 90.5% on the Online Mind2Web benchmark

And what does opus score with "regular" browser harnesses?

esafak•1h ago
https://huggingface.co/spaces/osunlp/Online_Mind2Web_Leaderb...
Retr0id•1h ago
Hm I can't see Opus 4.6 on there
theredsix•19m ago
I tweeted at the OSUNLP and they're backed up on eval validation. In the meantime, here's the benchmark repo with the saved runs and also instructions on how to run it locally. https://github.com/theredsix/abp-online-mind2web-results
9wzYQbTYsAIc•40m ago
90% easy or 90% average?
theredsix•20m ago
90% average with 85.51% hard!
9wzYQbTYsAIc•18m ago
Nice! Will take a look at this for my homelab - was debating using crawl.cloudflare.com to try it out, as browser rendering was my next stretch goal.
gregpr07•29m ago
Love it! From first principles: this kinda answers the "do we really even need CDP" I always have in my head building browser use...
theredsix•21m ago
Totally, I feel that CDP was designed for a different category of automations.

Temporal: A nine-year journey to fix time in JavaScript

https://bloomberg.github.io/js-blog/post/temporal/
165•robpalmer•2h ago•69 comments

Entities enabling scientific fraud at scale are large, resilient, growing (2025)

https://doi.org/10.1073/pnas.2420092122
175•peyton•4h ago•81 comments

Making WebAssembly a first-class language on the Web

https://hacks.mozilla.org/2026/02/making-webassembly-a-first-class-language-on-the-web/
104•mikece•13h ago•40 comments

Where Some See Strings, She Sees a Space-Time Made of Fractals

https://www.quantamagazine.org/where-some-see-strings-she-sees-a-space-time-made-of-fractals-2026...
46•tzury•1h ago•2 comments

BitNet: 100B Param 1-Bit model for local CPUs

https://github.com/microsoft/BitNet
206•redm•5h ago•108 comments

Show HN: Klaus – OpenClaw on a VM, batteries included

https://klausai.com/
33•robthompson2018•1h ago•18 comments

Elevated errors on login with Claude Code

https://status.claude.com/incidents/jm3b4jjy2jrt
45•zurfer•2h ago•28 comments

Wiz joins Google

https://www.wiz.io/blog/google-closes-deal-to-acquire-wiz
79•aldarisbm•2h ago•53 comments

Lego's 0.002mm specification and its implications for manufacturing (2025)

https://www.thewave.engineer/articles.html/productivity/legos-0002mm-specification-and-its-implic...
270•scrlk•4h ago•211 comments

Launch HN: Prism (YC X25) – Workspace and API to generate and edit videos

https://www.prismvideos.com
14•aliu327•1h ago•6 comments

Faster asin() was hiding in plain sight

https://16bpp.net/blog/post/faster-asin-was-hiding-in-plain-sight/
114•def-pri-pub•3h ago•56 comments

Launch HN: Sentrial (YC W26) – Catch AI Agent Failures Before Your Users Do

https://www.sentrial.com/
9•anayrshukla•1h ago•5 comments

AI Agent Hacks McKinsey

https://codewall.ai/blog/how-we-hacked-mckinseys-ai-platform
211•mycroft_4221•7h ago•87 comments

Show HN: Open-source browser for AI agents

https://github.com/theredsix/agent-browser-protocol
28•theredsix•3h ago•14 comments

The MacBook Neo

https://daringfireball.net/2026/03/the_macbook_neo
36•etothet•6h ago•152 comments

Show HN: Vanilla JavaScript refinery simulator built to explain job to my kids

https://fuelingcuriosity.com/game.html
19•fuelingcurious•49m ago•13 comments

Show HN: I built an ISP infrastructure emulator from scratch with a custom vBNG

https://aether.saphal.me/dashboard/default
16•saphalpdyl•4h ago•1 comments

Fungal Electronics

https://arxiv.org/abs/2111.11231
8•byt3h3ad•53m ago•2 comments

Swiss e-voting pilot can't count 2,048 ballots after decryption failure

https://www.theregister.com/2026/03/11/swiss_evote_usb_snafu/
34•jjgreen•4h ago•77 comments

Building a TB-303 from Scratch

https://loopmaster.xyz/tutorials/tb303-from-scratch
179•stagas•3d ago•69 comments

Searching for the Agentic IDE

https://twitter.com/karpathy/status/2031616709560610993
4•bigwheels•1h ago•1 comments

PeppyOS: A simpler alternative to ROS 2 (now with containers support)

https://peppy.bot/
56•Ekami•3d ago•18 comments

Zig – Type Resolution Redesign and Language Changes

https://ziglang.org/devlog/2026/#2026-03-10
362•Retro_Dev•16h ago•199 comments

Why the global elite gave up on spelling and grammar

https://www.wsj.com/lifestyle/jeffrey-epstein-files-bad-grammar-spelling-trump-ellison-dorsey-gat...
39•matthieu_bl•2h ago•100 comments

Writing my own text editor, and daily-driving it

https://blog.jsbarretto.com/post/text-editor
186•todsacerdoti•15h ago•101 comments

Visualizing Ukkonen's Suffix Tree Algorithm

https://www.abahgat.com/blog/visualizing-ukkonens-algorithm/
10•gsky•1d ago•2 comments

Cloudflare crawl endpoint

https://developers.cloudflare.com/changelog/post/2026-03-10-br-crawl-endpoint/
433•jeffpalmer•19h ago•165 comments

Yann LeCun raises $1B to build AI that understands the physical world

https://www.wired.com/story/yann-lecun-raises-dollar1-billion-to-build-ai-that-understands-the-ph...
572•helloplanets•1d ago•466 comments

Bypassing PatchGuard on Windows x64 (2005)

http://uninformed.org/index.cgi?v=3&a=3&t=txt
11•davikr•3d ago•1 comments

Whistleblower claims ex-DOGE member says he took Social Security data to new job

https://www.washingtonpost.com/politics/2026/03/10/social-security-data-breach-doge-2/
485•raldi•3h ago•200 comments