frontpage.

We run a cloud device farm for mobile test automation (real Android/iOS devices, Selenium Grid, Appium).

We just shipped a Chat AI Agent that lives inside every device session. It sees the live screen and answers questions like "what's the XPath for that button?" or "give me the UIAutomator2 selector for row 2" — with code output in Java, Python, Swift, Kotlin, or WebDriverIO.

The implementation: we grab a frame from the WebRTC stream at query time and pass it to a vision LLM with a structured prompt. The model returns locators in all applicable formats (XPath, CSS, UIAutomator2, XCUITest, Accessibility ID). We parse and render with syntax highlighting.

The main challenge was prompt engineering to get clean parseable output vs. prose with embedded code.

Happy to answer questions about the implementation.

Universal vaccine against respiratory infections and allergens

U+237C ⍼ Is Azimuth

MuskMeter – Minute-by-minute Musk metrics

AI should help us produce better code

Cloudflare Crawl Endpoint

Nasdaq Partners with Kraken in Plan for 24/7 Tokenized Stock Trading

Hill-Climbing: Why Your AI Agent Wastes Half Its Brain Before Writing Any Code

From millions of dollars to under a grand: The dramatic fall of the NFT

Freedesktop Closes Controversial Age Verification API Proposal

US and EU sanctions have killed 38M people since 1970 (2025)

Enzyme as Maxwell's Demon: Steady-State Deviation from Chemical Equilibrium

Stansted Airport starts accepting London's contactless travel tickets

Daniel Ellsberg: The Effect of Top Secret Clearance

X Users Find Their Real Names Are Googled After Using X Verification "Au10tix"

Why Your Pinecone Index Keeps Breaking (and the Vector Ops Fix)

Hereditary peers to be removed from Lords as bill passes

Markdown Now Available on Wordpress.org

Zee – Push-to-talk transcription for macOS (Pure Go, sub-second)

AI Sucks at Guitar Tones

Orange County homeowner says insurer used drone to inspect her roof

His Mother Vanished When He Was 14. 33 Years Later, He Found Her

How the Eon Team Produced a Virtual Embodied Fly

Om Malik – The Debt Beneath the Dream

U.S. Global War on Terror Has Taken Nearly 1M Lives (2021)

Olie, A global dollar account powered by USDC

Valve facing second, class-action lawsuit over loot boxes

AI is to software as power tools are to woodworking

What it costs to run 1M image search in production

Actis: Autonomous Coordination and Transaction Integrity Standard

Starlink Mini solar-powered unlimited range R/C boat [video]

Show HN: Chat AI Agent built into live Appium/mobile device sessions