frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

You Are Here

https://brooker.co.za/blog/2026/02/07/you-are-here.html
1•mltvc•1m ago•0 comments

Why social apps need to become proactive, not reactive

https://www.heyflare.app/blog/from-reactive-to-proactive-how-ai-agents-will-reshape-social-apps
1•JoanMDuarte•2m ago•0 comments

How patient are AI scrapers, anyway? – Random Thoughts

https://lars.ingebrigtsen.no/2026/02/07/how-patient-are-ai-scrapers-anyway/
1•samtrack2019•2m ago•0 comments

Vouch: A contributor trust management system

https://github.com/mitchellh/vouch
1•SchwKatze•3m ago•0 comments

I built a terminal monitoring app and custom firmware for a clock with Claude

https://duggan.ie/posts/i-built-a-terminal-monitoring-app-and-custom-firmware-for-a-desktop-clock...
1•duggan•4m ago•0 comments

Tiny C Compiler

https://bellard.org/tcc/
1•guerrilla•5m ago•0 comments

Y Combinator Founder Organizes 'March for Billionaires'

https://mlq.ai/news/ai-startup-founder-organizes-march-for-billionaires-protest-against-californi...
1•hidden80•5m ago•1 comments

Ask HN: Need feedback on the idea I'm working on

1•Yogender78•6m ago•0 comments

OpenClaw Addresses Security Risks

https://thebiggish.com/news/openclaw-s-security-flaws-expose-enterprise-risk-22-of-deployments-un...
1•vedantnair•6m ago•0 comments

Apple finalizes Gemini / Siri deal

https://www.engadget.com/ai/apple-reportedly-plans-to-reveal-its-gemini-powered-siri-in-february-...
1•vedantnair•7m ago•0 comments

Italy Railways Sabotaged

https://www.bbc.co.uk/news/articles/czr4rx04xjpo
2•vedantnair•7m ago•0 comments

Emacs-tramp-RPC: high-performance TRAMP back end using MsgPack-RPC

https://github.com/ArthurHeymans/emacs-tramp-rpc
1•fanf2•9m ago•0 comments

Nintendo Wii Themed Portfolio

https://akiraux.vercel.app/
1•s4074433•13m ago•1 comments

"There must be something like the opposite of suicide "

https://post.substack.com/p/there-must-be-something-like-the
1•rbanffy•15m ago•0 comments

Ask HN: Why doesn't Netflix add a “Theater Mode” that recreates the worst parts?

2•amichail•16m ago•0 comments

Show HN: Engineering Perception with Combinatorial Memetics

1•alan_sass•22m ago•2 comments

Show HN: Steam Daily – A Wordle-like daily puzzle game for Steam fans

https://steamdaily.xyz
1•itshellboy•24m ago•0 comments

The Anthropic Hive Mind

https://steve-yegge.medium.com/the-anthropic-hive-mind-d01f768f3d7b
1•spenvo•24m ago•0 comments

Just Started Using AmpCode

https://intelligenttools.co/blog/ampcode-multi-agent-production
1•BojanTomic•25m ago•0 comments

LLM as an Engineer vs. a Founder?

1•dm03514•26m ago•0 comments

Crosstalk inside cells helps pathogens evade drugs, study finds

https://phys.org/news/2026-01-crosstalk-cells-pathogens-evade-drugs.html
2•PaulHoule•27m ago•0 comments

Show HN: Design system generator (mood to CSS in <1 second)

https://huesly.app
1•egeuysall•27m ago•1 comments

Show HN: 26/02/26 – 5 songs in a day

https://playingwith.variousbits.net/saturday
1•dmje•28m ago•0 comments

Toroidal Logit Bias – Reduce LLM hallucinations 40% with no fine-tuning

https://github.com/Paraxiom/topological-coherence
1•slye514•31m ago•1 comments

Top AI models fail at >96% of tasks

https://www.zdnet.com/article/ai-failed-test-on-remote-freelance-jobs/
5•codexon•31m ago•2 comments

The Science of the Perfect Second (2023)

https://harpers.org/archive/2023/04/the-science-of-the-perfect-second/
1•NaOH•32m ago•0 comments

Bob Beck (OpenBSD) on why vi should stay vi (2006)

https://marc.info/?l=openbsd-misc&m=115820462402673&w=2
2•birdculture•35m ago•0 comments

Show HN: a glimpse into the future of eye tracking for multi-agent use

https://github.com/dchrty/glimpsh
1•dochrty•36m ago•0 comments

The Optima-l Situation: A deep dive into the classic humanist sans-serif

https://micahblachman.beehiiv.com/p/the-optima-l-situation
2•subdomain•36m ago•1 comments

Barn Owls Know When to Wait

https://blog.typeobject.com/posts/2026-barn-owls-know-when-to-wait/
1•fintler•37m ago•0 comments
Open in hackernews

Show HN: rtrvr.ai – New Free SOTA AI Web Agent Beats Even Operator

https://www.rtrvr.ai/blog/web-bench-results
8•arjunchint•7mo ago
We just benchmarked our agent, rtrvr.ai, on the Halluminate (YC S25) Web Bench, and rtrvr.ai achieved a new State-of-the-Art performance with an 81% success rate. For perspective, this surpasses not only all other autonomous agents but also the human-intervention baseline of OpenAI's Operator (76.5%).

It also completes tasks an astonishing 7x faster than the next leading alternative.

This isn't just an incremental improvement; it's a validation of our core architectural philosophy. Our performance stems from two key differentiators:

- Local-First Operation: As a Chrome Extension, rtrvr.ai operates directly within the user's browser. This eliminates the latency, bot detection and access issues that plague cloud browser agents.

- DOM-Based Interaction: Instead of relying on brittle visual parsing (CUA), our agent interacts directly with the page's HTML structure, enabling skipping clicks and resilience to pop-ups and overlays. We also can just use the latest and fastest models such as Gemini Flash for superior performance.

This leads to a critical industry insight: Cloud Browser Agents are not a viable long-term solution for reliable web automation.

Our benchmark analysis shows that over 94% of rtrvr.ai's failures were "agent errors" (fixable AI logic), while only 5% were "infrastructure errors." For cloud agents, this ratio is often inverted. You can't build a reliable agent if you can't even guarantee access to the environment.

Finally it only cost us ~$40 to run this benchmark, whereas we estimate it cost >~$1k in infra costs for each agent for Halluminate.

The future of web automation won't be fought from remote data centers. It will be run symbiotically from your browser. Our results are the first major data point proving this thesis and putting the first nail in the coffin for cloud browser agents.

Full Report: https://www.rtrvr.ai/blog/web-bench-results

Or if you just want to tune into some Agentic-SMR of a web agent doing tasks online tune into the playlist: https://www.youtube.com/watch?v=HWPZI8PjuLY&list=PL5rk1YARPB...

Try out the magic of a working web agent yourself, install at: https://chromewebstore.google.com/detail/rtrvrai-ai-web-agen...

Bring your own API Key from ai.studio and use Google's Gemini Free Tier to use our web agent for free! We literally have a button that will get our agent to open AI Studio create key and configure itself all automatically.

Comments

quarkcarbon279•7mo ago
How did you keep your costs so low? Eval costs especially with Agents can go up a lot and what did it cost for other agents?
arjunchint•7mo ago
We directly leverage the user's own browser so no cloud browser hosting or proxying costs! We averaged only $0.1/task.

The whole idea of cloud browser agents is a stupid paradigm. The agents are not only 7x slower but have the cost of hosting and proxying for that extra time!

Our own biggest cost is just LLM inference, thus we can just let our users bring their own API Key and use our service for free!