frontpage.

I’ve been experimenting with browser agents (OpenClaw Browser, agent-browser, Playwright setups with Claude/Cursor).

Even with: - accessibility snapshots - element references (E1, E2) - semantic locators - session isolation

they still feel fundamentally fragile.

LLMs are reasoning over DOM trees step by step. It works — but barely. Small UI changes break everything.

It feels like we’re missing an abstraction layer.

What if instead of agents operating on markup, websites exposed structured “interaction surfaces” — something closer to tools or world models rather than DOM nodes?

Instead of: - parse DOM - guess selector - click element

It would be: - request action - receive structured state - operate over stable semantic primitives

Is this already being explored somewhere beyond MCP experiments? Or is everyone still stuck in DOM-land?

Curious if others see the same limitation — and whether a middleware “site-agent” layer makes sense.

Would love to hear your thoughts

Autistic Job Interview Simulator Game

Show HN: A simple and fun leaderboard for game nights

Cline Supply Chain Attack: Cline 2.3.0 Silently Installs OpenClaw

Speculation about AI application: Rapid [stock] price increase of Raspberry Pi

Model-context-shell: Unix-style pipelines for MCP. Deterministic tool calls

Title: Show HN: Bulwark – Centralized permissions for coding agents

GrabShot – Live OG images with one meta tag, no back end needed

TIL: Claude Opus 4.6 Can Reverse Engineer STL Files

My side project caught $14.1B in acquisitions before they hit the market

Building Next.js for an Agentic Future

The Old Axolotl (2015 Novel)

Asahi Linux Progress Report: Linux 6.19

A 3000W Water-Cooled Power Supply (With GAN and Sic) [video]

Linus T tells The Reg how Linux solo act became a global jam session

Show HN: LedgerSync – A cross-agent shared-memory protocol for AI coding

Looking for long-term investors to test tool to reduce overtrading

Ask HN: Is AI the final nail in the coffin for solo developers?

Show HN: PGPkeygenerator.com Now Supports WebMCP

Floating-Point Error Handling in C++: What Works

Show HN: AgentPump – AI agents launch tokens on Solana (Android)

Slop Cannons and Turbo Brains

Show HN: Wondershaper QuickToggle

The Temperature Has Changed

Show HN: Recall Lite – Local semantic search for Windows (Rust/Tauri, no cloud)

Show HN: AI agents designed and shipped this app end-to-end in 36 hours for $270

Experience Report: Teaching GenAI at Elementary School

I Don't Like Magic

How OpenAI, the US government and Persona built an identity surveillance machine

Google image URLs allow arbitrary upscaling via size parameter

Show HN: Equidistance – find a meeting spot that's equally painful for everyone

Ask HN: Are we missing a middleware layer between LLM agents and the web?

Comments