frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: BrowseBrawl – What if browser agents battled to generate training data?

https://www.browser-brawl.com/
12•HrubyOnRails•1h ago
I remember watching the AlphaGo documentary in 2017. What stood out to me was that the model got drastically better when it started competing against itself. GANs clicked for me similarly: a generator and discriminator competing, and somehow the competition is what produces something remarkable.

I've been curious whether this principle generalizes to today's agents.

So mehulkalia and I built Browser Brawl at the YC / BrowserUse hackathon last weekend and won first place. It is a fun experiment in which an attacker agent tries to complete tasks on live websites while a defender agent injects JavaScript to sabotage it.

The analogy isn't perfect, because browser tasks aren't zero-sum. But our hypothesis is that an agent faced with an adversary should produce more interesting training data than one navigating clean, static environments.

Try it on: http://browser-brawl.com

GitHub: https://github.com/RichardHruby/browser-brawl

Demo Video: https://youtu.be/NIoFXv-JvBY

(Skip to [0:55](https://www.youtube.com/watch?v=NIoFXv-JvBY&t=55s) to see the agents “brawling” in the arena :), [1:52](https://www.youtube.com/watch?v=NIoFXv-JvBY&t=1m52s) to see the browser traces generated)

Would love to chat with anyone building or training browser agents. Happy to dive in below!

Comments

SobjectiveTruth•1h ago
This is hilarious
julian2k•1h ago
love it
mehulkalia•1h ago
Thanks!
mehulkalia•1h ago
Mehul here. One thing that surprised me while building this was how creative the defender agent became. It runs Claude Haiku on a timer and can choose from prebuilt disruptions like fake “Session Expired” popups, or generate custom JavaScript injections based on what the attacker is doing, like inserting fake “Search disabled” buttons. Digging through the traces and seeing the before/after screenshots of what the defender agent came up with was pretty funny, and kind of mind-blowing.
etash•50m ago
This is sick. Love the gamified creativity for the project + the actual real world usecase
sohilbhatia85•50m ago
Super cool!!
rohoswagger1•22m ago
This is so sick! Another crazy extension could be to give them certain handicaps / abilities to add more noise

The Disappearing American Mortgage

https://www.theatlantic.com/ideas/2026/03/mortgage-decline/686178/
1•FinnLobsien•1m ago•0 comments

macOS code injection for fun and no profit (2024)

https://mariozechner.at/posts/2024-07-20-macos-code-injection-fun/
1•jstrieb•1m ago•0 comments

Ask HN: Maintainers, do LLM-only users often clutter your issues/PRs?

1•lucrbvi•1m ago•0 comments

Show HN: Marra AI – AI consulting built for healthcare and occupational medicine

https://www.marraai.io/
1•rweale•1m ago•0 comments

Computer Is Fast Enough

https://lukaswerner.com/post/2026-03-03@computer-fast-enough
1•chilipepperhott•2m ago•0 comments

Models have some pretty funny attractor states

https://www.lesswrong.com/posts/mgjtEHeLgkhZZ3cEx/models-have-some-pretty-funny-attractor-states
1•debesyla•2m ago•0 comments

Pancreatic tumors cells 'choose' to either grow or tolerate treatment

https://medicalxpress.com/news/2026-02-local-fibers-pancreatic-tumors-cancer.html
1•PaulHoule•2m ago•0 comments

Anthropic investors push to de-escalate Pentagon clash over AI safeguards

https://www.reuters.com/business/retail-consumer/anthropic-investors-push-de-escalate-pentagon-cl...
1•giuliomagnifico•2m ago•0 comments

Zembed-1: The Best Text-Embedding Model

https://www.zeroentropy.dev/articles/introducing-zembed-1-the-worlds-best-multilingual-text-embed...
2•medbar•3m ago•0 comments

Agent memory is structured not fuzzy.why are we all using vector DBs for it?

https://old.reddit.com/r/aiagents/comments/1rklfid/comment/o8md7qv/
1•JosephjackJR•3m ago•1 comments

Show HN: Health optimization as agent-guiding gradient descent

https://github.com/artiedins/vital_loss_func
1•dingmuti•4m ago•0 comments

Why Write in the Time of LLMs?

https://hexaray.com/blog/writing-words-AI-could-never
1•gatinsama•5m ago•0 comments

From Claiming Land to Claiming Agents

https://medium.com/@mukun2045/from-claiming-land-to-claiming-agents-ca89157462a5
1•Demi369•5m ago•0 comments

Don't Fear AI: Why Humanity Will Thrive in the Age of Artificial Intelligence

https://medium.com/@mukun2045/dont-fear-ai-why-humanity-will-thrive-in-the-age-of-artificial-inte...
1•Demi369•6m ago•0 comments

Six Things I Learned Watching a Robotics Startup Die from the Inside

https://ruixu.us/posts/six-things-robotics-startup
1•pr337h4m•7m ago•0 comments

Show HN: SuperKey – JMeter Command Center

https://github.com/QAInsights/superkey
1•qainsights•8m ago•0 comments

Show HN: Secure Agent Starter – A minimal template for building safer AI agents

https://github.com/timbuctoo/secure-agent-starter
1•timbucto2•10m ago•2 comments

How the War Department Learned to Stop Worrying and Love AI (With Naomi Klein)

https://www.buzzsprout.com/2126417/episodes/18749317-how-the-war-department-learned-to-stop-worry...
2•ibobev•10m ago•0 comments

Show HN: Turn .cursorrules / repo guidelines into GitHub pre-merge checks (OSS)

https://watchflow.dev
2•dkargatzis•11m ago•1 comments

Ordered Dithering with Arbitrary or Irregular Colour Palettes

https://matejlou.blog/2023/12/06/ordered-dithering-for-arbitrary-or-irregular-palettes/
1•ibobev•11m ago•0 comments

Teleprinter

https://notebook.zoeblade.com/Teleprinter.html
1•ibobev•11m ago•0 comments

What I talk about when I talk about prompting

https://poyo.co/note/20260217T130137/
1•minikomi•11m ago•0 comments

Show HN: Novum – Automated ML Research Pipeline with Anti-Fabrication Guards

https://github.com/euanai/novum
1•euanai•11m ago•1 comments

New York could prohibit chatbot medical, legal, engineering advice

https://folding-sky.com/blog/ny-senate-bill-s7263-chatbot-liability
4•bluepeter•14m ago•0 comments

Show HN:JSON API for US HTS and Canadian Customs Tariff, with Change Detection

https://tradefacts.io
1•PowMan•15m ago•1 comments

Judge Blocks Virginia 1-Hour Social Media Law for Minors

https://reclaimthenet.org/judge-blocks-virginia-social-media-limit-minors
2•bilsbie•15m ago•0 comments

Show HN: I spent 1000 hours building my dream personal finance dashboard

https://finzen.org/
1•novaheic•16m ago•0 comments

Higher risk of later-life memory,mental health issues in Amer. football players

https://medicalxpress.com/news/2026-03-american-football-players-higher-life.html
2•bikenaga•16m ago•1 comments

FDA sends warning to 30 telehealth companies selling 'illegal' GLP-1s

https://thehill.com/policy/healthcare/5765337-fda-telehealth-companies-compounded-glp-1/
2•randycupertino•16m ago•1 comments

Matthew Explains

https://matthewexplains.com/
1•todsacerdoti•18m ago•0 comments