Hey HN, I was building a project where I needed to compare multiple browser-agents at once, so I built The Browser Arena.
It’s a website where you can run several agents side-by-side and see, in real time, which model performs best on the exact same task. You also get some metrics, such as cost and speed.