frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

OpenClaw Creator: Why 80% of Apps Will Disappear

https://www.youtube.com/watch?v=4uzGDAoNOZc
1•schwentkerr•3m ago•0 comments

What Happens When Technical Debt Vanishes?

https://ieeexplore.ieee.org/document/11316905
1•blenderob•4m ago•0 comments

AI Is Finally Eating Software's Total Market: Here's What's Next

https://vinvashishta.substack.com/p/ai-is-finally-eating-softwares-total
1•gmays•5m ago•0 comments

Computer Science from the Bottom Up

https://www.bottomupcs.com/
1•gurjeet•5m ago•0 comments

Show HN: I built a toy compiler as a young dev

https://vire-lang.web.app
1•xeouz•7m ago•0 comments

You don't need Mac mini to run OpenClaw

https://runclaw.sh
1•rutagandasalim•8m ago•0 comments

Learning to Reason in 13 Parameters

https://arxiv.org/abs/2602.04118
1•nicholascarolan•10m ago•0 comments

Convergent Discovery of Critical Phenomena Mathematics Across Disciplines

https://arxiv.org/abs/2601.22389
1•energyscholar•10m ago•1 comments

Ask HN: Will GPU and RAM prices ever go down?

1•alentred•10m ago•0 comments

From hunger to luxury: The story behind the most expensive rice (2025)

https://www.cnn.com/travel/japan-expensive-rice-kinmemai-premium-intl-hnk-dst
2•mooreds•11m ago•0 comments

Substack makes money from hosting Nazi newsletters

https://www.theguardian.com/media/2026/feb/07/revealed-how-substack-makes-money-from-hosting-nazi...
5•mindracer•12m ago•1 comments

A New Crypto Winter Is Here and Even the Biggest Bulls Aren't Certain Why

https://www.wsj.com/finance/currencies/a-new-crypto-winter-is-here-and-even-the-biggest-bulls-are...
1•thm•12m ago•0 comments

Moltbook was peak AI theater

https://www.technologyreview.com/2026/02/06/1132448/moltbook-was-peak-ai-theater/
1•Brajeshwar•13m ago•0 comments

Why Claude Cowork is a math problem Indian IT can't solve

https://restofworld.org/2026/indian-it-ai-stock-crash-claude-cowork/
1•Brajeshwar•13m ago•0 comments

Show HN: Built an space travel calculator with vanilla JavaScript v2

https://www.cosmicodometer.space/
2•captainnemo729•13m ago•0 comments

Why a 175-Year-Old Glassmaker Is Suddenly an AI Superstar

https://www.wsj.com/tech/corning-fiber-optics-ai-e045ba3b
1•Brajeshwar•13m ago•0 comments

Micro-Front Ends in 2026: Architecture Win or Enterprise Tax?

https://iocombats.com/blogs/micro-frontends-in-2026
1•ghazikhan205•16m ago•0 comments

These White-Collar Workers Actually Made the Switch to a Trade

https://www.wsj.com/lifestyle/careers/white-collar-mid-career-trades-caca4b5f
1•impish9208•16m ago•1 comments

The Wonder Drug That's Plaguing Sports

https://www.nytimes.com/2026/02/02/us/ostarine-olympics-doping.html
1•mooreds•17m ago•0 comments

Show HN: Which chef knife steels are good? Data from 540 Reddit tread

https://new.knife.day/blog/reddit-steel-sentiment-analysis
1•p-s-v•17m ago•0 comments

Federated Credential Management (FedCM)

https://ciamweekly.substack.com/p/federated-credential-management-fedcm
1•mooreds•17m ago•0 comments

Token-to-Credit Conversion: Avoiding Floating-Point Errors in AI Billing Systems

https://app.writtte.com/read/kZ8Kj6R
1•lasgawe•17m ago•1 comments

The Story of Heroku (2022)

https://leerob.com/heroku
1•tosh•18m ago•0 comments

Obey the Testing Goat

https://www.obeythetestinggoat.com/
1•mkl95•18m ago•0 comments

Claude Opus 4.6 extends LLM pareto frontier

https://michaelshi.me/pareto/
1•mikeshi42•19m ago•0 comments

Brute Force Colors (2022)

https://arnaud-carre.github.io/2022-12-30-amiga-ham/
1•erickhill•22m ago•0 comments

Google Translate apparently vulnerable to prompt injection

https://www.lesswrong.com/posts/tAh2keDNEEHMXvLvz/prompt-injection-in-google-translate-reveals-ba...
1•julkali•22m ago•0 comments

(Bsky thread) "This turns the maintainer into an unwitting vibe coder"

https://bsky.app/profile/fullmoon.id/post/3meadfaulhk2s
1•todsacerdoti•23m ago•0 comments

Software development is undergoing a Renaissance in front of our eyes

https://twitter.com/gdb/status/2019566641491963946
1•tosh•23m ago•0 comments

Can you beat ensloppification? I made a quiz for Wikipedia's Signs of AI Writing

https://tryward.app/aiquiz
1•bennydog224•24m ago•1 comments
Open in hackernews

Show HN: rtrvr.ai – New Free SOTA AI Web Agent Beats Even Operator

https://www.rtrvr.ai/blog/web-bench-results
8•arjunchint•7mo ago
We just benchmarked our agent, rtrvr.ai, on the Halluminate (YC S25) Web Bench, and rtrvr.ai achieved a new State-of-the-Art performance with an 81% success rate. For perspective, this surpasses not only all other autonomous agents but also the human-intervention baseline of OpenAI's Operator (76.5%).

It also completes tasks an astonishing 7x faster than the next leading alternative.

This isn't just an incremental improvement; it's a validation of our core architectural philosophy. Our performance stems from two key differentiators:

- Local-First Operation: As a Chrome Extension, rtrvr.ai operates directly within the user's browser. This eliminates the latency, bot detection and access issues that plague cloud browser agents.

- DOM-Based Interaction: Instead of relying on brittle visual parsing (CUA), our agent interacts directly with the page's HTML structure, enabling skipping clicks and resilience to pop-ups and overlays. We also can just use the latest and fastest models such as Gemini Flash for superior performance.

This leads to a critical industry insight: Cloud Browser Agents are not a viable long-term solution for reliable web automation.

Our benchmark analysis shows that over 94% of rtrvr.ai's failures were "agent errors" (fixable AI logic), while only 5% were "infrastructure errors." For cloud agents, this ratio is often inverted. You can't build a reliable agent if you can't even guarantee access to the environment.

Finally it only cost us ~$40 to run this benchmark, whereas we estimate it cost >~$1k in infra costs for each agent for Halluminate.

The future of web automation won't be fought from remote data centers. It will be run symbiotically from your browser. Our results are the first major data point proving this thesis and putting the first nail in the coffin for cloud browser agents.

Full Report: https://www.rtrvr.ai/blog/web-bench-results

Or if you just want to tune into some Agentic-SMR of a web agent doing tasks online tune into the playlist: https://www.youtube.com/watch?v=HWPZI8PjuLY&list=PL5rk1YARPB...

Try out the magic of a working web agent yourself, install at: https://chromewebstore.google.com/detail/rtrvrai-ai-web-agen...

Bring your own API Key from ai.studio and use Google's Gemini Free Tier to use our web agent for free! We literally have a button that will get our agent to open AI Studio create key and configure itself all automatically.

Comments

quarkcarbon279•7mo ago
How did you keep your costs so low? Eval costs especially with Agents can go up a lot and what did it cost for other agents?
arjunchint•7mo ago
We directly leverage the user's own browser so no cloud browser hosting or proxying costs! We averaged only $0.1/task.

The whole idea of cloud browser agents is a stupid paradigm. The agents are not only 7x slower but have the cost of hosting and proxying for that extra time!

Our own biggest cost is just LLM inference, thus we can just let our users bring their own API Key and use our service for free!