Show HN: Browser4 – an open-source browser engine for agents and concurrency

7•galaxyeye•1mo ago

Hi HN,

I’d like to share an open-source project we’ve been working on for a while: Browser4.

The motivation came from a recurring frustration: most browser automation tools (Playwright, Selenium, Puppeteer) are excellent for human-written scripts, but start to show friction when used as a core execution layer for AI agents or at very high concurrency.

So instead of building “another wrapper around Playwright”, we experimented with a different direction: designing a browser engine where AI agents are first-class citizens.

### What Browser4 is

Browser4 is a browser automation engine built on native Chrome DevTools Protocol (CDP), with a focus on:

* Coroutine-safe concurrency (designed to run many browser sessions in parallel)

* Agent-oriented APIs (navigation, interaction, extraction as composable actions)

* Hybrid extraction: ML agent driven extraction + LLM extraction + structured selectors + an SQL-like DOM query language (X-SQL)

* Low-level control without Playwright-style abstraction overhead

It’s written in Kotlin/JVM, mainly because we needed predictable concurrency behavior and long-running stability under load.

The project is fully open-source (Apache 2.0).

### What it’s not

* It’s not a drop-in Playwright replacement.

* It’s not a no-code RPA tool.

* It’s not “LLM magic” — LLMs sit outside the browser engine.

Browser4 intentionally stays close to the browser execution layer and leaves planning/reasoning to external agent loops.

### Current use cases we’re testing

* Large-scale web data extraction

* Agentic workflows (search → navigate → extract → summarize)

* Price / content monitoring with frequent revisits

* High-concurrency crawling where browser startup and context switching are bottlenecks

On a single machine, we can sustain very high daily page visits, though we’re still validating benchmarks across different workloads.

### Open questions (where I’d love feedback)

* For agentic systems, does it make sense to bypass Playwright entirely and work closer to CDP?

* Where do you see the biggest pain points when combining LLMs with browser automation today?

* Is JVM a reasonable choice here, or is Python still the better tradeoff despite concurrency limits

* What abstractions would you want in a browser engine built for AI agents?

### Links

* GitHub: https://github.com/platonai/browser4

* Website (light overview): https://browser4.io

Happy to answer technical questions or hear criticism — especially from people running browser automation or agent systems in production.

Thanks for reading.

Comments

SkyRocknRoll•1mo ago

Very much excited about this. *Hybrid extraction: ML agent driven extraction

This is what I have been missing in the existing systems

galaxyeye•1mo ago

Thank you for your comment. I think X-SQL based extraction could also be useful: its ideal for high-complexity data-extraction pipelines with multiple-dozen entities and several hundred fields per entity. What are your thoughts on this idea?

tomfox2•1mo ago

Therefore, we should not be paying for tokens, but for more effective agents.

galaxyeye•1mo ago

Yes, that's why we develop it. And I think X-SQL based extraction could also be useful: its ideal for high-complexity data-extraction pipelines with multiple-dozen entities and several hundred fields per entity. What are your thoughts on this idea?

password-app•1mo ago

The concurrency aspect is interesting - we're building password automation and one of the pain points is that most sites have rate limiting / bot detection that gets triggered if you try to parallelize password changes too aggressively.

Sequential execution with realistic timing delays is actually necessary for our use case. But I can see how other agent applications would benefit from true concurrency.

Are you handling session isolation between concurrent agents? That seems like it would be critical for avoiding state pollution.

galaxyeye•1mo ago

Yes. Browser4 supports concurrent multi-agent execution. At the moment, we support the following scenarios:

A single agent operating on multiple pages (tabs) within the same browser context

Multiple agents operating in parallel across multiple browser contexts, where each context has an isolated profile

Are you specifically looking for multiple agents concurrently operating on the same browser context? If so, could you describe the concrete use case?

Show HN: Stacky – certain block game clone

AIII: A public benchmark for AI narrative and political independence

SectorC: A C Compiler in 512 bytes

The API Is a Dead End; Machines Need a Labor Economy

Digital Iris [video]

New wave of GLP-1 drugs is coming–and they're stronger than Wegovy and Zepbound

Convert tempo (BPM) to millisecond durations for musical note subdivisions

Show HN: Tasty A.F.

The Contagious Taste of Cancer

U.S. Jobs Disappear at Fastest January Pace Since Great Recession

Bithumb mistakenly hands out $195M in Bitcoin to users in 'Random Box' giveaway

Beyond Agentic Coding

OpenClaw ClawHub Broken Windows Theory – If basic sorting isn't working what is?

OpenBSD Copyright Policy

OpenClaw Creator: Why 80% of Apps Will Disappear

What Happens When Technical Debt Vanishes?

AI Is Finally Eating Software's Total Market: Here's What's Next

Computer Science from the Bottom Up

Show HN: A toy compiler I built in high school (runs in browser)

You don't need Mac mini to run OpenClaw

Learning to Reason in 13 Parameters

Convergent Discovery of Critical Phenomena Mathematics Across Disciplines

Ask HN: Will GPU and RAM prices ever go down?

From hunger to luxury: The story behind the most expensive rice (2025)

Substack makes money from hosting Nazi newsletters

A New Crypto Winter Is Here and Even the Biggest Bulls Aren't Certain Why

Moltbook was peak AI theater

Why Claude Cowork is a math problem Indian IT can't solve

Show HN: Built an space travel calculator with vanilla JavaScript v2

Why a 175-Year-Old Glassmaker Is Suddenly an AI Superstar

Show HN: Stacky – certain block game clone

AIII: A public benchmark for AI narrative and political independence

SectorC: A C Compiler in 512 bytes

The API Is a Dead End; Machines Need a Labor Economy

Digital Iris [video]

New wave of GLP-1 drugs is coming–and they're stronger than Wegovy and Zepbound

Convert tempo (BPM) to millisecond durations for musical note subdivisions

Show HN: Tasty A.F.

The Contagious Taste of Cancer

U.S. Jobs Disappear at Fastest January Pace Since Great Recession

Bithumb mistakenly hands out $195M in Bitcoin to users in 'Random Box' giveaway

Beyond Agentic Coding

OpenClaw ClawHub Broken Windows Theory – If basic sorting isn't working what is?

OpenBSD Copyright Policy

OpenClaw Creator: Why 80% of Apps Will Disappear

What Happens When Technical Debt Vanishes?

AI Is Finally Eating Software's Total Market: Here's What's Next

Computer Science from the Bottom Up

Show HN: A toy compiler I built in high school (runs in browser)

You don't need Mac mini to run OpenClaw

Learning to Reason in 13 Parameters

Convergent Discovery of Critical Phenomena Mathematics Across Disciplines

Ask HN: Will GPU and RAM prices ever go down?

From hunger to luxury: The story behind the most expensive rice (2025)

Substack makes money from hosting Nazi newsletters

A New Crypto Winter Is Here and Even the Biggest Bulls Aren't Certain Why

Moltbook was peak AI theater

Why Claude Cowork is a math problem Indian IT can't solve

Show HN: Built an space travel calculator with vanilla JavaScript v2

Why a 175-Year-Old Glassmaker Is Suddenly an AI Superstar

Show HN: Browser4 – an open-source browser engine for agents and concurrency

Comments