frontpage.

I often use LLMs to automate different workflows, some of which include browsing the web and gathering data. At some point I started noticing a few things that bothered me: the browser interactions were clunky, as if the agent was struggling to "see" and understand the page, and as a result, many tokens were wasted. Same for knowing when the page is actually ready or not.

I started digging deeper and at some point I just bluntly asked in the Cursor chat the following question: "I ask you, as an LLM that uses these headless browsers, what do you wish people would build to make your work easier?"

And it worked because I expanded the "Thinking" section and I saw: "The user is asking me a really interesting meta-question ..." and after that it just listed top 10 most painful issues related to the agent<->browser interaction.

So I started building a browser API that returns what LLMs actually need, not what browsers return.

Fast forward a few weeks and here we are. A REST API built specifically to help LLMs interact with real browsers.

Instead of reading raw HTML, you get markdown, page map, short refs (e1, e2) for clicking instead of CSS selectors, a stable flag when the page is ready, diffs after each step, the list of all interactive elements (links, buttons, inputs), automatic blocker dismissal and a small extract step that returns structured JSON from a schema you describe.

Official SDKs for Python, TypeScript, Ruby. MCP server for Cursor and Claude Desktop.

Would appreciate any feedback, especially on the API design.

BCG's Data Warehouse Hacked – 3.17T Rows, Zero Authentication

Has GitLab Felt into the Enshittification?

North Korea-Nexus Threat Actor Compromises Widely Used Axios NPM Package

Dark Code

Ask questions on the Claude Code codebase

Software Pipelining for GPU Kernels: Part 1 – The Pipeline Problem

Agent skills for desktop automation and video recording

Show HN: Amoxide – The right aliases, at the right time

Kagi: April 1, 1996

Reductio Ad Absurdum

The Oil Crisis Is About to Get Physical

Wonder View Tower, a Century-Old Landmark on Colorado's Eastern Plains

Show HN: AI-Native NAACP

Prompt Engineering for Humans

Show HN: MCP server that generates macOS tools via Open Scripting Architecture

Show HN: Claude Code rewritten as a bash script

Show HN: Initialize an Agent Harness with Forge CLI

RL Meets Adaptive Speculative Training

Write Once, Run Anywhere (For Real This Time) – Why WASI Won't Get Us There

The Road to NfSensei

The plan to make IPOs great again

Show HN: Strudel.ai – Figma for org design [video]

Dating apps are boring so we created a chess dating app

Israel passes law to give death penalty to Palestinians convicted lethal attacks

Options for Preloading Images with JavaScript

Reviews for The Super Mario Galaxy film are what you'd expect

Thoughts on LLMs' effect on work

Signals, the push-pull based algorithm

Born in the USA? China targeted in Trump's birthright citizenship fight

Improving on Sandi Metz's Gear Class from Poodr

Show HN: Browserbeam – a browser API built for AI agents