frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: rtrvr.ai – New Free SOTA AI Web Agent Beats Even Operator

https://www.rtrvr.ai/blog/web-bench-results
5•arjunchint•2h ago
We just benchmarked our agent, rtrvr.ai, on the Halluminate (YC S25) Web Bench, and rtrvr.ai achieved a new State-of-the-Art performance with an 81% success rate. For perspective, this surpasses not only all other autonomous agents but also the human-intervention baseline of OpenAI's Operator (76.5%).

It also completes tasks an astonishing 7x faster than the next leading alternative.

This isn't just an incremental improvement; it's a validation of our core architectural philosophy. Our performance stems from two key differentiators:

- Local-First Operation: As a Chrome Extension, rtrvr.ai operates directly within the user's browser. This eliminates the latency, bot detection and access issues that plague cloud browser agents.

- DOM-Based Interaction: Instead of relying on brittle visual parsing (CUA), our agent interacts directly with the page's HTML structure, enabling skipping clicks and resilience to pop-ups and overlays. We also can just use the latest and fastest models such as Gemini Flash for superior performance.

This leads to a critical industry insight: Cloud Browser Agents are not a viable long-term solution for reliable web automation.

Our benchmark analysis shows that over 94% of rtrvr.ai's failures were "agent errors" (fixable AI logic), while only 5% were "infrastructure errors." For cloud agents, this ratio is often inverted. You can't build a reliable agent if you can't even guarantee access to the environment.

Finally it only cost us ~$40 to run this benchmark, whereas we estimate it cost >~$1k in infra costs for each agent for Halluminate.

The future of web automation won't be fought from remote data centers. It will be run symbiotically from your browser. Our results are the first major data point proving this thesis and putting the first nail in the coffin for cloud browser agents.

Full Report: https://www.rtrvr.ai/blog/web-bench-results

Or if you just want to tune into some Agentic-SMR of a web agent doing tasks online tune into the playlist: https://www.youtube.com/watch?v=HWPZI8PjuLY&list=PL5rk1YARPB...

Try out the magic of a working web agent yourself, install at: https://chromewebstore.google.com/detail/rtrvrai-ai-web-agen...

Bring your own API Key from ai.studio and use Google's Gemini Free Tier to use our web agent for free! We literally have a button that will get our agent to open AI Studio create key and configure itself all automatically.

Comments

quarkcarbon279•2h ago
How did you keep your costs so low? Eval costs especially with Agents can go up a lot and what did it cost for other agents?

Hawaii Highways

http://www.hawaiihighways.com/
1•yakattak•50s ago•0 comments

Tell HN: Knowledge is dead. Insight is currency in the age of AI

1•INKidea•7m ago•0 comments

Neil Sloane's favourite integer sequences

https://www.theguardian.com/science/alexs-adventures-in-numberland/2014/oct/07/neil-sloane-the-man-who-loved-only-integer-sequences
1•qifzer•7m ago•1 comments

Wave of syringe attacks mar France's street music festival

https://www.france24.com/en/live-news/20250622-wave-of-syringe-attacks-mar-france-s-street-music-festival
1•pizza•9m ago•0 comments

Vintage Supermarket Photos

https://theimaginaryworld.com/groceryA1.html
1•gaws•11m ago•0 comments

CS 325: CXML Parser

https://courses.cs.northwestern.edu/325/readings/cxml.php
1•susam•16m ago•0 comments

Why Meta and Apple want Perplexity AI, even if it's just a glorified chatbot

https://gizmodo.com/the-14-billion-ai-google-killer-2000618755
1•rntn•18m ago•0 comments

In Just a Few Minutes, This Music Will Change Your Day

https://www.nytimes.com/2025/06/20/arts/music/brahms-romance-piano.html
1•whack•25m ago•0 comments

Dandelion root extract affects colorectal cancer proliferation & survival (2016)

https://pmc.ncbi.nlm.nih.gov/articles/PMC5341965/
1•throwaway992673•29m ago•0 comments

Lp(a) blood test shows 114% higher heart attack risk

https://www.empirical.health/blog/lipoprotein-a-blood-test/
1•brandonb•29m ago•0 comments

Best ECG smartwatch: Our experiences and ECG explained

https://www.wareable.com/health-and-wellbeing/ecg-heart-rate-monitor-watch-guide-6508
2•teleforce•29m ago•1 comments

A new stem cell therapy for treating Type 1 diabetes

https://www.hsci.harvard.edu/news/new-therapy-treating-type-1-diabetes
1•mitchbob•30m ago•0 comments

The Value and Importance of Women Who Take Up Space

https://lithub.com/standing-tall-on-the-value-and-importance-of-women-who-take-up-space/
2•mooreds•33m ago•0 comments

Show HN: Mqutils – Universal Go message queue library

https://mqutils.dev/
1•DjGilcrease•34m ago•0 comments

Atrial Fibrillation (AFib): A Guide to Wearable ECG Smart Watches

https://afibinstitute.com.au/atrial-fibrillation-a-guide-to-wearable-ecg-smart-watches/
2•teleforce•36m ago•1 comments

Markdown (2004)

https://daringfireball.net/projects/markdown/
1•thomassmith65•39m ago•0 comments

Air India crash points to systemic problems at Boeing that CEO Ortberg must fix

https://leehamnews.com/2025/06/15/five-for-five-air-india-crash-points-to-systemic-problems-at-boeing-ceo-ortberg-must-fix/
13•andrewfromx•39m ago•12 comments

DevRel is developer zero [video]

https://www.youtube.com/shorts/8Ox_XHYoZ_c
1•mooreds•45m ago•0 comments

Highest-Paying Jobs in Germany

https://www.euronews.com/business/2025/06/21/highest-paying-jobs-in-germany-official-data-and-job-postings-reveal-top-salaries
1•e2e4•46m ago•0 comments

Tiny orange beads found by Apollo astronauts reveal Moon's explosive past

https://www.sciencedaily.com/releases/2025/06/250616040233.htm
1•anigbrowl•47m ago•0 comments

Kelp UI Library

https://kelpui.com/
2•mitchbob•47m ago•0 comments

A broker-less distributed messaging system from the previous century

https://aivarsk.com/2025/06/22/brokerless-distributed-messaging/
1•aivarsk•51m ago•0 comments

Did Contexts Kill Phoenix?

https://arrowsmithlabs.com/blog/did-contexts-kill-phoenix
2•mitchbob•56m ago•1 comments

The Art of Hanakami, or Flower-Petal Folding

https://origamiusa.org/thefold/article/art-hanakami-or-flower-petal-folding
1•s4074433•56m ago•0 comments

The Fine Art of Nesting

https://roberthoward.com.au/fine-art-nesting/
2•s4074433•58m ago•0 comments

Ask HN: How is US entering war affecting your AGI timelines?

1•ozzyphantom•58m ago•3 comments

Taking the wind out of dangerous cyclones

https://reporter.anu.edu.au/all-stories/taking-the-wind-out-of-dangerous-cyclones
1•geox•1h ago•0 comments

Show HN: REPL is the memory layer for multi-agent AI apps – Sherlog‑MCP

https://github.com/GetSherlog/Sherlog-MCP
2•teenvan_1995•1h ago•0 comments

Children in England growing up 'sedentary, scrolling and alone', say experts

https://www.theguardian.com/society/2025/jun/11/children-sedentary-scrolling-alone-lack-of-play-england
10•PaulHoule•1h ago•1 comments

Conscience and the New Cartography of War

https://blogs.timesofisrael.com/conscience-and-the-new-cartography-of-war/
1•bryanrasmussen•1h ago•1 comments