frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a Chrome extension to let my OpenClaw Bot remote in

1•gideon-claws•1h ago
Sharing a build-in-public update.

I’ve been working with my assistant “Gideon” (running inside OpenClaw) to solve a very specific problem:

I want the agent to control my real browser (logged-in sites, my normal cookies, my actual tabs) - not a sandboxed headless browser - while still keeping the control surface simple and auditable. This means my OpenClaw won't break the moment a site gets "clever".

So... We built it! I say we but it was mostly Gideon and I was along for the ride as QA.

Why did we bother?

Well, because the real world is messy.

Headless is fine until you need:

• a session that already exists in your day-to-day browser

• sites like X/Gmail/anything modern that behaves differently under automation

• human-in-the-loop flows where the agent drives, then hands off, then resumes

This connector is basically: agent → my laptop Chrome → real work.

How it works (high level)

There are 3 pieces:

Chrome extension (MV3)

• You pair it to a relay URL once

• You explicitly choose what the agent can touch using an OpenClaw tab group

• Actions (click/type/scroll/navigate) are optional and gated

2. Relay service

• Extension connects over WebSocket

• The agent sends commands to the relay (HTTP)

• Relay forwards to the extension; extension returns results (and screenshots)

3. Agent

• Issues actions (navigate/click/type/scroll)

• Requests screenshots for “eyes”

• Can extract some page structure when possible

The security model (non-negotiable)

I don’t want an agent that can randomly click around every tab on my machine.

So the rule is:

• Only tabs I explicitly “Start controlling” (in the OpenClaw group) are eligible

• “Allow Actions” is a separate toggle (so I can keep it read-only most of the time)

• We log what happens so it’s not a black box

What we learned (a.k.a. Chrome MV3 is a gremlin)

Some fun discoveries:

• MV3 service workers love to go to sleep. If your WS lives in the background SW, you’ll see connections that “work… until they don’t” (accept → close loops). We had to build reconnection logic and then work on keeping the SW alive during active control sessions.

• UI needs to match the real state machine. Pairing / connecting / controlling are different states. If you let users do them out of order, it feels broken even when it’s technically working. We’re tightening it so the “happy path” is idiot-proof.

• Modern sites don’t type like normal websites. X in particular uses contenteditable + React event plumbing. “Just set value” doesn’t cut it. We’re upgrading the action layer so typing works reliably.

Where it’s at right now

It can:

• pair to a relay

• control a selected tab group

• navigate / click / scroll

• take screenshots from the controlled tab (so the agent can actually see)

And we’re iterating quickly on:

• connection stability

• better typing for rich editors

• clearer “controlled” visuals (so it’s unmistakable when the agent has the wheel)

If you’re building something similar…

I’d love to hear how other HN folks building around OpenClaw would do this:

• What’s your ideal safety model for “agent drives my real browser”?

• Any proven MV3 patterns for stable long-lived connections?

• UX ideas that make control state obvious without being obnoxious?

If people want, I can share more implementation details / the approach we took to the relay + tab-group gating.

Expansion Microscopy Has Transformed How We See the Cellular World

https://www.quantamagazine.org/expansion-microscopy-has-transformed-how-we-see-the-cellular-world...
1•ibobev•1m ago•0 comments

WebCad – free browser-based CAD with AI (export STEP)

https://app.webcad.ca/
1•tonio67•2m ago•1 comments

Show HN: Backseat Writer – AI pair writing

https://backseat-writer.vercel.app/demo
1•Dansvidania•4m ago•0 comments

Show HN: Implementation of Google's PaperBanana (diagram generation from text)

https://github.com/llmsresearch/paperbanana
1•dippatel1994•6m ago•0 comments

Clean Coder: The Dark Path (2017)

https://blog.cleancoder.com/uncle-bob/2017/01/11/TheDarkPath.html
1•andrewjf•6m ago•1 comments

What Do You Think of My Business Idea? (Claude Ad) [video]

https://www.youtube.com/watch?v=De-_wQpKw0s
2•eamag•9m ago•0 comments

Show HN: Grok Imagine – High-fidelity FLUX.1 generation with cinematic video

https://grok-imagine.me/
1•thenextechtrade•9m ago•0 comments

Show HN: Seren – Serverless Postgres, Rust SDK, CLI, & MCP Server for AI Agents

https://github.com/serenorg/seren
2•taariqlewis•10m ago•0 comments

Recursive Knowledge Synthesis for Multi-LLM Systems

https://arxiv.org/abs/2601.08839
1•bob1029•12m ago•0 comments

Microsoft's Pivotal AI Product Is Running into Big Problems

https://www.wsj.com/tech/ai/microsofts-pivotal-ai-product-is-running-into-big-problems-ce235b28
3•fortran77•13m ago•1 comments

Even after cutting EV incentives, Norway only sold 98 diesel cars in January

https://electrek.co/2026/02/03/even-after-cutting-ev-incentives-norway-only-sold-98-diesel-cars-i...
3•ceejayoz•15m ago•0 comments

Show HN: CuaBot – Co-op computer-use for any coding agent

https://github.com/trycua/cua
1•frabonacci•16m ago•0 comments

Forensic Photonics verifies digital evidence with Content Credentials

https://contentauthenticity.org/blog/how-forensic-photonics-verifies-digital-evidence-with-conten...
1•hasheddan•17m ago•0 comments

DuoBolt – a review-first duplicate file finder powered by BLAKE3

https://duobolt.app/
2•r9ne•17m ago•0 comments

LibreQoS: Online Bufferbloat Test

https://bufferbloat.libreqos.com/
1•goodburb•18m ago•0 comments

Why the Future of Movies Lives on Letterboxd

https://www.nytimes.com/interactive/2026/02/03/magazine/letterboxd-film-discussion-site-streaming...
1•mitchbob•18m ago•1 comments

How do you validate AI-generated data transformations before prod?

https://www.yorph.ai
1•areddyfd•18m ago•1 comments

If AI Writes the Code, What Should Engineers Learn?

https://the-learning-agency.com/the-cutting-ed/article/if-ai-writes-the-code-what-should-engineer...
2•selvaprakash•18m ago•0 comments

A programmable, Lego-like material for robots emulates life's flexibility

https://techxplore.com/news/2026-02-programmable-lego-material-robots-emulates.html
1•Brajeshwar•19m ago•0 comments

Anthropic Super Bowl Spot Skewers ChatGPT Ads

https://www.businessinsider.com/anthropic-super-bowl-openai-chatgpt-ads-claude-2026-2
2•tortilla•19m ago•0 comments

Physicists achieve near-zero friction on macroscopic scales

https://phys.org/news/2026-02-physicists-friction-macroscopic-scales.html
1•Brajeshwar•19m ago•0 comments

Pipe organ playing a single, nonstop song until 2640

https://www.popsci.com/technology/pipe-organ-one-song-2640/
1•Brajeshwar•19m ago•0 comments

SpaceX grounds Falcon 9 missions, could impact ISS launch

https://phys.org/news/2026-02-spacex-grounds-falcon-missions-impact.html
2•bookmtn•20m ago•0 comments

Show HN: Distr 2.0 – A year of learning how to ship to customer environments

https://github.com/distr-sh/distr
1•louis_w_gk•20m ago•0 comments

Show HN: Orpheus, An Agent runtime that scales on queue depth and not CPU

https://github.com/arpitnath/orpheus
3•arpitnath42•22m ago•0 comments

Anthropic Performance Team Take-Home for Dummies

https://www.ikot.blog/anthropic-take-home-for-dummies
2•vinhnx•23m ago•0 comments

A field guide to sandboxes for AI

https://www.luiscardoso.dev/blog/sandboxes-for-ai
1•Dangeranger•23m ago•0 comments

Show HN: Finding similarities in magazine covers (updated)

https://shoplurker.com/labs/img-compare/
1•tkp-415•24m ago•0 comments

We read the JSON Schema spec so you don't have to

https://blog.dottxt.ai/dotjson-has-good-schema-support.html
1•PaulHoule•25m ago•0 comments

Show HN: I built Clash to avoid conflicts when running AI agents in parallel

https://github.com/clash-sh/clash
1•matk9•27m ago•0 comments