Show HN: Smooth CLI – Token-efficient browser for AI agents

https://docs.smooth.sh/cli/overview

27•antves•23h ago

Hi HN! Smooth CLI (https://www.smooth.sh) is a browser that agents like Claude Code can use to navigate the web reliably, quickly, and affordably. It lets agents specify tasks using natural language, hiding UI complexity, and allowing them to focus on higher-level intents to carry out complex web tasks. It can also use your IP address while running browsers in the cloud, which helps a lot with roadblocks like captchas (https://docs.smooth.sh/features/use-my-ip).

Here’s a demo: https://www.youtube.com/watch?v=62jthcU705k Docs start at https://docs.smooth.sh.

Agents like Claude Code, etc are amazing but mostly restrained to the CLI, while a ton of valuable work needs a browser. This is a fundamental limitation to what these agents can do.

So far, attempts to add browsers to these agents (Claude’s built-in --chrome, Playwright MCP, agent-browser, etc.) all have interfaces that are unnatural for browsing. They expose hundreds of tools - e.g. click, type, select, etc - and the action space is too complex. (For an example, see the low-level details listed at https://github.com/vercel-labs/agent-browser). Also, they don’t handle the billion edge cases of the internet like iframes nested in iframes nested in shadow-doms and so on. The internet is super messy! Tools that rely on the accessibility tree, in particular, unfortunately do not work for a lot of websites.

We believe that these tools are at the wrong level of abstraction: they make the agent focus on UI details instead of the task to be accomplished.

Using a giant general-purpose model like Opus to click on buttons and fill out forms ends up being slow and expensive. The context window gets bogged down with details like clicks and keystrokes, and the model has to figure out how to do browser navigation each time. A smaller model in a system specifically designed for browsing can actually do this much better and at a fraction of the cost and latency.

Security matters too - probably more than people realize. When you run an agent on the web, you should treat it like an untrusted actor. It should access the web using a sandboxed machine and have minimal permissions by default. Virtual browsers are the perfect environment for that. There’s a good write up by Paul Kinlan that explains this very well (see https://aifoc.us/the-browser-is-the-sandbox and https://news.ycombinator.com/item?id=46762150). Browsers were built to interact with untrusted software safely. They’re an isolation boundary that already works.

Smooth CLI is a browser designed for agents based on what they’re good at. We expose a higher-level interface to let the agent think in terms of goals and tasks, not low-level details.

For example, instead of this:

  click(x=342, y=128)
  type("search query")
  click(x=401, y=130)
  scroll(down=500)
  click(x=220, y=340)
  ...50 more steps

Your agent just says:

  Search for flights from NYC to LA and find the cheapest option

Agents like Claude Code can use the Smooth CLI to extract hard-to-reach data, fill-in forms, download files, interact with dynamic content, handle authentication, vibe-test apps, and a lot more.

Smooth enables agents to launch as many browsers and tasks as they want, autonomously, and on-demand. If the agent is carrying out work on someone’s behalf, the agent’s browser presents itself to the web as a device on the user’s network. The need for this feature may diminish over time, but for now it’s a necessary primitive. To support this, Smooth offers a “self” proxy that creates a secure tunnel and routes all browser traffic through your machine’s IP address (https://docs.smooth.sh/features/use-my-ip). This is one of our favorite features because it makes the agent look like it’s running on your machine, while keeping all the benefits of running in the cloud.

We also take away as much security responsibility from the agent as possible. The agent should not be aware of authentication details or be responsible for handling malicious behavior such as prompt injections. While some security responsibility will always remain with the agent, the browser should minimize this burden as much as possible.

We’re biased of course, but in our tests, running Claude with Smooth CLI has been 20x faster and 5x cheaper than Claude Code with the --chrome flag (https://www.smooth.sh/images/comparison.gif). Happy to explain further how we’ve tested this and to answer any questions about it!

Instructions to install: https://docs.smooth.sh/cli. Plans and pricing: https://docs.smooth.sh/pricing.

It’s free to try, and we'd love to get feedback/ideas if you give it a go :)

We’d love to hear what you think, especially if you’ve tried using browsers with AI agents. Happy to answer questions, dig into tradeoffs, or explain any part of the design and implementation!

Comments

franze•1h ago

Congrats for shipping.

How does it compare to Agent Browser by Vercel?

antves•9m ago

Thanks for asking! There are a few core differences: 1. we expose a higher level interface which allows the agent to think about what to do as opposed to what to do 2. we developed a token-efficient representation of the webpages that combines both visual and textual elements, heavily optimized for what LLMs are good at. 3. because we control the agentic loop, it also means that we can do fancy things on contextual injections, compressions, asynchronous manipulations, etc which are impossible to achieve when exposing the navigation interface 4. we use a coding agent under the hood, meaning that it can express complex actions efficiently and effectively compared to the CLI interface that agent-browser exposes 5. because we control the agent, we can use small and efficient LLMs which make the system much faster, cheaper, and more reliable

Also, our service comes with batteries included: the agent can use browsers in our cloud with auto-captcha solvers, stealth mode, we can proxy your own ip, etc

waynenilsen•1h ago

Frontend QA is the final frontier, good luck, you are over the target.

The amount of manual QA I am currently subjected to is simultaneously infuriating and hilarious. The foundation models are up to the task but we need new abstractions and layers to correctly fix it. This will all go the way of the dodo in 12 months but it'll be useful in the meantime.

agent-browser helped a lot over playwright but doesn't completely close the gap.

behnamoh•1h ago

Ironically, the landing page and docs pages of Smooth aren't all that token-efficient!

liukidar•7m ago

Ahah, indeed that's true... That's why we've just released Smooth CLI (https://docs.smooth.sh/cli/overview) and the SKILL.md (smooth-sdk/skills/smooth-browser/SKILL.md) associated with it. That should contain everything your agent needs to know to use Smooth. We will definitely add a LLM-friendly reference to it in the landing page and the docs introduction.

tekacs•51m ago

This looks really interesting!

I _would_ be curious to try it, but...

My first question was whether I could use this for sensitive tasks, given that it's not running on our machines. And after poking around for a while, I didn't find a single mention of security anywhere (as far as I could tell!)

The only thing that I did find was zero data retention, which is mentioned as being 'on request' and only on the Enterprise plan.

I totally understand that you guys need to train and advance your model, but with suggested features like scraping behind login walls, it's a little hard to take seriously with neither of those two things anywhere on the site, so anything you could do to lift up those concerns would be amazing.

Again, you seem to have done some really cool stuff, so I'd love for it to be possible to use!

Update: The homepage says this in a feature box, which is... almost worst than saying nothing, because it doesn't mean anything? -> "Enterprise-grade security; End-to-end encryption, enterprise-grade standards, and zero-trust access controls keep your data protected in transit and at rest."

johnys•44m ago

Curious: what are people using as the best open source and locally hosted versions to have agents browse the web?

waynenilsen•51m ago

i can see a new token efficient mirror web possibly emerging using content type headers on the request side

forms, PRG, semantic HTML and no js needed

antves•4m ago

Totally agree! The web for agents is evolving very fast and it's still unclear what it will look like

Our take is that, while that happens, agents today need to be able to access all the web resources that we can access as humans

Also, browsers are a really special piece of software because they provide access to almost every other kind of software. This makes them arguably the single most important tool for AI agents, and that’s why we believe that a browser might be all agents need to suddenly become ten times more useful than they already are

tobyhinloopen•2m ago

Way too expensive, I'll wait for a free/open source browser optimized to be used by agents.

I now assume that all ads on Apple news are scams

Hackers (1995) Animated Experience

The rise of one-pizza engineering teams

LLMs could be, but shouldn't be compilers

Claude Opus 4.6

TikTok's 'Addictive Design' Found to Be Illegal in Europe

Invention of DNA "Page Numbers" Opens Up Possibilities for the Bioeconomy

A new bill in New York would require disclaimers on AI-generated news content

GPT-5.3-Codex

Microsoft open-sources LiteBox, a security-focused library OS

Things Unix can do atomically (2010)

My AI Adoption Journey

Solving Shrinkwrap: New Experimental Technique

Show HN: Smooth CLI – Token-efficient browser for AI agents

Wall Street just lost $285B because of 13 Markdown files

DNS Explained – How Domain Names Get Resolved

Nixie-clock using neon lamps as logic elements (2007)

Systems Thinking

Stay Away from My Trash

Plasma Effect (2016)

We tasked Opus 4.6 using agent teams to build a C Compiler

Understanding Neural Network, Visually

Recreating Epstein PDFs from raw encoded attachments

Show HN: Artifact Keeper – Open-Source Artifactory/Nexus Alternative in Rust

Coding Agents and Use Cases

Animated Knots

The time I didn't meet Jeffrey Epstein

The RCE that AMD won't fix

Unlocking high-performance PostgreSQL with key memory optimizations

How to carry more than your own bodyweight (2025)

Show HN: Smooth CLI – Token-efficient browser for AI agents

Comments

I now assume that all ads on Apple news are scams

Hackers (1995) Animated Experience

The rise of one-pizza engineering teams

LLMs could be, but shouldn't be compilers

Claude Opus 4.6

TikTok's 'Addictive Design' Found to Be Illegal in Europe

Invention of DNA "Page Numbers" Opens Up Possibilities for the Bioeconomy

A new bill in New York would require disclaimers on AI-generated news content

GPT-5.3-Codex

Microsoft open-sources LiteBox, a security-focused library OS

Things Unix can do atomically (2010)

My AI Adoption Journey

Solving Shrinkwrap: New Experimental Technique

Show HN: Smooth CLI – Token-efficient browser for AI agents

Wall Street just lost $285B because of 13 Markdown files

DNS Explained – How Domain Names Get Resolved

Nixie-clock using neon lamps as logic elements (2007)

Systems Thinking

Stay Away from My Trash

Plasma Effect (2016)

We tasked Opus 4.6 using agent teams to build a C Compiler

Understanding Neural Network, Visually

Recreating Epstein PDFs from raw encoded attachments

Show HN: Artifact Keeper – Open-Source Artifactory/Nexus Alternative in Rust

Coding Agents and Use Cases

Animated Knots

The time I didn't meet Jeffrey Epstein

The RCE that AMD won't fix

Unlocking high-performance PostgreSQL with key memory optimizations

How to carry more than your own bodyweight (2025)