frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Pilo – open-source agentic web automation engine by Mozilla

13•MrTravisB•2h ago
Hello HN,

We are the team behind Tabstack (https://tabstack.ai) - part of Mozilla. We just open sourced Pilo (pronounce PIE-low), the core engine that powers our automation platform. You can check it out on Github at https://github.com/mozilla/pilo.

Pilo is an agentic web automation library. Instead of writing rigid scripts with CSS selectors, you give it a natural language goal (e.g., "Find the best pizza in Seattle and extract the ratings") and it autonomously navigates the browser to achieve it.

We built this because we were struggling to make reliable agents for our own /automate endpoint. Existing tools were either too brittle (breaking on minor DOM changes) or too heavy (feeding raw HTML to LLMs, blowing up context windows).

Here is how Pilo solves those problems:

- Accessibility Tree over HTML: Instead of parsing raw HTML "soup," Pilo captures the browser's accessibility tree (via Playwright's _snapshotForAI). This gives the LLM a semantic, stable view of the page (buttons, links, inputs) rather than div hell.

- Context Compression: We pipe that tree through a compression engine. We map verbose tags (like listitem -> li), shorten reference IDs, and deduplicate repetitive text. This reduces token usage by 60-80% without losing interactive elements, allowing for much longer agent loops.

- Layered Error Handling: The web is flaky. Pilo treats navigation failures as distinct from interaction failures. It uses timeout escalation for network issues (doubling wait times) and will automatically restart the browser instance if it detects a "stuck" state or DNS failure.

- Agentic Loop: It follows a strict Plan -> Observe -> Act -> Validate loop. It even includes a separate validation step where a second LLM "grades" the final output against the original success criteria before returning it.

The "Cool" Part (Browser Extension) Since the core logic is decoupled from the runtime, we packaged it into a browser extension. You can install it, type a prompt, and literally watch the agent drive your local browser tab in real-time. It’s a great way to debug how the LLM "sees" the page.

Why Open Source? We sell the managed infrastructure (scaling browsers, persistent sessions, etc.) at Tabstack. But the execution engine itself, the thing that decides "click here" or "scroll there", should be open. You can run Pilo entirely on your own machine with your own API keys without paying us a dime.

You can read more about it on our blog https://tabstack.ai/blog/introducing-pilo-browser-automation.

Or check out the repo, install it, and give it a try - https://github.com/mozilla/pilo

We’d love to hear your feedback on the compression pipeline or how you’re handling agent state in your own projects.

Happy to answer any questions!

Comments

verdverm•1h ago
The main issue I see with everyone and their brother making specialized agentic frameworks is

1. I now have to understand N frameworks, their quirks and handles, their prompts and tools. I certainly don't want to be locked into their strict loop definition.

2. Most of them could be extensions, even just a skill, within other frameworks

I prefer to remain a minimalist for now and use projects like this for inspiration

MrTravisB•1h ago
We completely agree. Framework fatigue is real, and getting locked into a rigid loop is frustrating.

Choices are great, and our goal is to let you piece together a setup to your own liking. We want Pilo to work with your existing tools, not against them. If you just want to rip out our accessibility tree compression pipeline and use it as a standalone skill in your own custom framework, we consider that a massive win.

That is exactly why we are open sourcing it. We want to see what others can do with it.

If there is a framework or tool this could work with but does not currently, we would love to hear about it.

verdverm•19m ago
I use ADK which has many points 3rd parties plug in. I'm also involved in the development (from the outside). I will look more into Pilo and how this could work. Would save me a bunch of effort!

I'll open an issue for tracking

---

said issue: https://github.com/mozilla/pilo/issues/318

My lobster lost $450k this weekend

https://pashpashpash.substack.com/p/my-lobster-lost-450000-this-weekend
1•__cayenne__•1m ago•0 comments

The Longest Line of Sight

https://tombh.co.uk/longest-line-of-sight
1•giraffe_lady•1m ago•0 comments

Ductape – One SDK for any backend integration

https://www.ductape.app/?hnlaunch=1
1•snifideezy•1m ago•1 comments

You Can't Optimize What You Can't See. AI Cost Observability

https://www.edgee.ai/blog/posts/2026-02-23-ai-cost-observability-missing-layer
1•Gillesray•1m ago•0 comments

Show HN: Fastdedup – Rust dataset deduplication (2:55 vs. 7:55 688MB vs. 22GB)

https://wapplewhite4.github.io/fastdedup/
1•wapplewhite4•2m ago•0 comments

Hegseth gives Anthropic until Friday to back down on AI safeguards

https://www.axios.com/2026/02/24/anthropic-pentagon-claude-hegseth-dario
4•rurp•4m ago•0 comments

Training my dog to vibe code B2B SaaS apps

https://dogomation.darefail.com/
2•jimhi•6m ago•1 comments

Can agentic coding raise the quality bar?

https://lpalmieri.com/posts/agentic-coding-raises-quality/
1•SatvikBeri•7m ago•0 comments

Show HN: MakLock – Free macOS App Locker with Touch ID and Apple Watch

https://github.com/dutkiewiczmaciej/MakLock
1•makmakapps•7m ago•0 comments

"SaaS is Dead" – they say

https://kudmitry.com/articles/saas-is-dead-they-say/
1•skwee357•7m ago•0 comments

Show HN: YouAM – An address, contact card, and encrypted inbox for AI agents

1•midlifedad•7m ago•0 comments

Show HN: Shelfctl – PDF/ePub library manager backed by GitHub Release

https://github.com/blackwell-systems/shelfctl
1•daynablackwell•9m ago•0 comments

Intel Formally Ends Four of Their Go Language Open-Source Projects

https://www.phoronix.com/news/Intel-Stops-Go-Projects
1•LorenDB•9m ago•0 comments

Spacydo: State machine example with own calldata for state transition rules

1•tracyspacy•10m ago•0 comments

Data vs. Hype: How Orgs Win with AI – The Pragmatic Summit [video]

https://www.youtube.com/watch?v=LOHgRw43fFk
1•cyndunlop•10m ago•0 comments

Implementing a Clear Room Z80 / ZX Spectrum Emulator with Claude Code

https://antirez.com/news/160
1•cyndunlop•10m ago•0 comments

Coding Agent, Good?

https://teetracker.medium.com/coding-agent-its-a-good-idea-1d34966c44ab
1•hasszhao•11m ago•1 comments

Steel Bank Common Lisp

https://www.sbcl.org/
2•tosh•11m ago•0 comments

Forests don't just store carbon. They keep people alive, scientists say

https://news.mongabay.com/2026/02/forests-dont-just-store-carbon-they-keep-people-alive-scientist...
1•PaulHoule•11m ago•0 comments

The Deceptively Simple Act of Writing to Disk

https://www.scylladb.com/2026/02/18/the-deceptively-simple-act-of-writing-to-disk/
1•cyndunlop•11m ago•0 comments

Inception Launches Mercury 2, the Fastest Reasoning LLM

https://www.businesswire.com/news/home/20260224034496/en/Inception-Launches-Mercury-2-the-Fastest...
1•tinco•12m ago•0 comments

OpenAI, the US government and Persona built an identity surveillance machine

https://vmfunc.re/blog/persona/
3•rzk•12m ago•0 comments

OpenAI resets spending expectations, from $1.4T to $600B

https://www.cnbc.com/2026/02/20/openai-resets-spend-expectations-targets-around-600-billion-by-20...
3•randycupertino•12m ago•0 comments

I think WebRTC is better than SSH-ing for connecting to Mac terminal from iPhone

https://macky.dev
2•Sayuj01•13m ago•1 comments

China May Grab a Lead in the Race for Military Fusion

https://www.wsj.com/opinion/china-may-grab-a-lead-in-the-race-for-military-fusion-c5ab6d2b
1•JumpCrisscross•13m ago•0 comments

An AI agent bought from our WooCommerce store. Here's what we learned

https://zologic.nl/the-next-evolution-of-conversion-why-your-store-needs-to-be-agent-ready/
1•Zologic•14m ago•1 comments

Ask HN: Share a random link from your bookmarks

1•TechSquidTV•14m ago•0 comments

Ask HN: Demand for a compliance-first deterministic context compiler?

1•PensaerWales•14m ago•1 comments

Ask HN: How to exhaustively search the scientific literature?

1•cossatot•16m ago•0 comments

Gas Town, OpenClaw and the rise of open source AI agents

https://www.axios.com/2026/02/24/agents-openclaw-moltbook-gastown
2•msolujic•16m ago•0 comments