Show HN: Tine – Drive Wayland Around with Agents

4•tarboreus•1h ago

So recently Anthropic came out with tools that let you drive Windows and Mac OS around. But I don't use Windows or MacOS, and was curious about some Wayland internals stuff and whether it would be possible to build something usable on a relatively restrictive platform like Wayland. So I made Tine.

Tine is a GNOME extension and CLI that lets an agent (I have used Claude but in theory any agent that can access the CLI) drive the desktop around using SPI trees (AT-SPI2), OCR, and visual fallbacks. Agent can do work with the a11y (AT-SPI2) trees, take screenshots, zoom in on a grid, click, enter text using a uinput device, and generally bumble their way around a Wayland Linux desktop.

This project would probably have been way easier in x11 but Wayland is teh future!!!111 Thanks for any thoughts and feedback and feels good to release something here after a decade of lurking. Decade plus but who's counting / I'm not old.

Comments

aayushkumar121•1h ago

Nice approach combining AT-SPI2 + grid fallback.

Have you run into issues where the a11y tree is incomplete (e.g. Electron apps)? Wondering how often the grid/OCR path becomes the primary path.

tarboreus•59m ago

Yes. It works really well with Firefox. Chrome is basically a big blank, could expose the Chrome a11y tree but would have to turn on developer / debug mode or whatever which has downsides. It's really nice for Firefox and GTK apps and it does OCR + grid stuff for everything else.

aayushkumar121•56m ago

That makes sense. Feels like the fallback path might end up being the default for a lot of real-world apps.

Have you thought about combining weak a11y signals + OCR to build more stable refs over time, or is that too brittle in practice?

tarboreus•45m ago

Yeah. Some of the tree is a mess, could try to ocr those bits. It's on the roadmap. :)

I will say I have some feelings about Wayland and how hard it makes some stuff I do. I'm visually impaired and have a whole stack of tools. But this project has helped me port over 70-80% of those tools and it helps me bridge some gaps on Wayland temporarily so I can get infra set up. It's also great for the many sites that Claude blocks for whatever reason (Reddit, I am a sub mod but the a11y on Reddit is terrible, AmEx, LinkedIn).

Show HN: Playbach.io, browser rhythm game (desktop)

Arm comes to the Framework 13 [video]

Comparing Compression Tools

Startups Are Context Arbitrages

Coq theorem prover is now called Rocq

Printing real headline news on the Commodore 64 with The Newsroom's Wire Service

Space Force looks at moving "significant number" of launches from ULA to SpaceX

Opting out of cookies no guarantee

How Accurate Are Google's A.I. Overviews?

Lowdefy v5: The Config Webstack

I made Agentation for vanilla JavaScript

Project Glasswing Has a Blind Spot. It's You

Users lose $9.5M to fake Ledger wallet app on the Apple App Store

Jane Street Signs $6B AI Cloud Agreement with CoreWeave

Linux 7.1 Is a Big Win for Intel Panther Lake with Fred Now Enabled by Default

Keyword Scout

ChatGPT, Is This Real?

Show HN: EmbedIQ – Claude Code Compliance Config for HIPAA/PCI-DSS/SOC2

We Built Hanker in 14 Days with Claude

Fiverr Denies Report of Data Leak

AI papers published in 2026 worth reading

Claude Cowork found me a flat to rent in London in just 5 days

Project Maven Put A.I. Into the Kill Chain

How China is wooing Paraguay's political class away from longtime ally Taiwan

The Courage to Stop

Anthropic's rise is giving some OpenAI investors second thoughts

Study of the cosmos proves we still can't explain how the universe is expanding

AAUP does not want you to share your syllabus

Show HN: Horizontally Scale Localhost

Before he wrote AI 2027, he predicted the world in 2026. How did he do?