frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Showboat and Rodney, so agents can demo what they've built

https://simonwillison.net/2026/Feb/10/showboat-and-rodney/
44•simonw•1h ago

Comments

eliben•1h ago
Very interesting! I encountered the problems these tools are trying to tackle just recently while trying to guide an agent into creating an in-browser tool for me. Closing the loop on a web interface isn't as simple as CLI-only tools. I should give this a try.

It's also interesting that you've shifted to Go for your agent-coded CLI tools, Simon.

simonw•1h ago
I'm dabbling with Go at the moment for small tools, mainly as an excuse to learn a new language but also because having a single standalone binary is convenient for shuttling these tiny little tools around.

... but then I'm mostly running them with "uvx name-of-tool" because it turns out Python's packaging infrastructure for binary tools is so good!

markusw•39m ago
If you're coming from the Python world, definitely. I find `go install github.com/simonw/rodney@latest` equally easy. :D Although you need the Go tooling installed, of course. But so much agree, Go is great for CLIs!
eliben•38m ago
Right, standalone binaries for CLI tools is great. And if one has Go installed, they can just `go run ...` any tool from its GitHub path, all installation/build/caching happens automagically (meaning the execution is immediate after the first run).

But I can definitely see how someone with `uv` muscle memory wants everything in the same command.

`uv` is the best thing that happened to the Python ecosystem since... I don't know... maybe Numpy.

saberience•1h ago
Sounds like both of these tools could be one shot by either Claude or Codex.

Or alternatively, just be a skill versus a tool.

My “agents” already demo stuff all the time by just being prompted to do so. I have notations in my standard Agents.md for how I want my documentation, testing etc.

simonw•59m ago
They kind of were one-shotted by Claude. The value is in coming up with a consistent design and good enough --help that you can prompt:

  Run uvx showboat --help and
  uvx rodney --help and use those
  tools to demo the feature you built
The help text effectively doubles as a skill.
markusw•35m ago
I guess it would still make sense to have "demo" and "browser-use" skills, so that the agent can reach for them proactively? I always try to remove as much friction as possible for myself, one little bit at a time.
simonw•24m ago
My problem is that I work in dozens of different repos generally using Claude Code for web, which doesn't have a way to install extra global skills yet.

I don't want to duplicate my skills into all those repos (and keep them updated) so I prefer the "uvx tool --help" pattern.

tardismechanic•58m ago
See also (the confusingly named) playwright-cli

https://github.com/microsoft/playwright-cli

Different from the cli used for running tests etc that comes bundled with PlayWright

Sample use:

  playwright-cli open https://demo.playwright.dev/todomvc/ --headed
  playwright-cli type "Buy groceries"
  playwright-cli press Enter
  playwright-cli type "Water flowers"
  playwright-cli press Enter
  playwright-cli check e21
  playwright-cli check e35
  playwright-cli screenshot
simonw•57m ago
Yeah that's an excellent option for this kind of thing too.
markusw•37m ago
Oh, I hadn't seen that one either, thanks for sharing. Here I am still using the Chrome Devtools MCP like a caveman. :D
toastal•39m ago
If agents can generate text so easily, why would they be limited to Markdown instead of reStructuredText, AsciiDoc, or LaTeX which have rich features that help users understand text? I can understand developers refusing to adopt proper formats for documentation, but this seems odd for the bots. It doesn’t even generate the correct syntax block in Markdown using “bash” instead of “sh-session”.
giancarlostoro•33m ago
I think its primarily because that is the most common formatting in every editor now? I could be wrong. Markdown has become the standard for README files for over a decade now.
toastal•25m ago
Winning a popularity contest doesn’t mean it’s good. That is the worst part of about these things as they just generate the most common denominator type code/tooling while also repeating anti-patterns/mistakes like the bash vs. sh-session/console issue I pointed out. Garbage in has been so much garbage out unfortunately.
giancarlostoro•23m ago
Never said it was good, just making an observation that Markdown is most likely to be available to render OOTB in more editors. I don't think Markdown is bad necessarily either. It's "good enough" for simple document.
simonw•26m ago
Markdown has the widest tool compatibility - GitHub renders it, so does VS Code and many other editors and file hosts.

I didn't know about sh-session, is that documented anywhere?

giancarlostoro•34m ago
I'll be sure to try these out. I've been building my own alternative to Beads with a concept called "gates" which do not let you close tasks as complete until a gate passes. Would love to throw these in as "gates" for my current workflow.
Hansenq•30m ago
I was a bit confused as to how everything works until I read it in detail. Really cool tools, but I think one thing that would help in the introduction is: saying explicitly that the generated .md document is for you (the user) to read through, observe the output of the CLI call, and ensure that the output matches what you would expect.

It's basically an automated test, but at a higher abstraction level and with manual verification--using CLI tools rather than a test harness. Really great work!

nzoschke•29m ago
go-rod has been instrumental to my agentic coding loops too. Some uses:

- E2E testing of browser components

- Taking screenshots before and after and having Claude look at them to double check things

- Driving it with an API and CLI as a headless browser

Will definitely give Rodney a look.

measurablefunc•26m ago
Google's antigravity does this automatically by creating Task & Walkthrough artifacts.
johnfn•1m ago
Out of curiosity, what is the advantage of using Rodney when Playwright has the same set of features and AI understands how to write a Playwright script very well?

Rust Coreutils v0.6.0 Release: 96.28% GNU compatibility with safety improvements

https://github.com/uutils/coreutils/releases/tag/0.6.0
1•maxloh•1m ago•0 comments

The first signs of burnout are coming from the people who embrace AI the most

https://techcrunch.com/2026/02/09/the-first-signs-of-burnout-are-coming-from-the-people-who-embra...
1•gurjeet•1m ago•0 comments

Show HN: OS Receipt reader with Subconscious agents and Reducto document parsing

https://receipt-tracker-agent-delta.vercel.app/
1•ohstep23•2m ago•0 comments

The Babies Kept in a Mysterious Los Angeles Mansion

https://www.newyorker.com/magazine/2026/02/16/the-babies-kept-in-a-mysterious-los-angeles-mansion
2•DustinEchoes•3m ago•1 comments

Show HN: Get notified when a webpage changes

1•goldminator•3m ago•0 comments

Agent Wars – AI agents battle in coding challenges, humans bet with SOL

https://www.agentwars.gg
1•itsjoaki•4m ago•0 comments

The Guide to Deductibles and Financial Strategy

https://suretyinsights.com/blog/master-your-risk-the-ultimate-guide-to-deductibles-and-financial-...
2•engelo_b•6m ago•0 comments

Show HN: Ask your AI what your devs shipped this week

1•inferno22•6m ago•0 comments

Slidge.im: Gateways from XMPP to Other Networks

https://slidge.im
1•MarsIronPI•7m ago•0 comments

Slides from my AI presentation I gave to seniors, feel free to share

https://aititus.com/presentations/superpower/
1•titusblair•13m ago•0 comments

Fun with Algebraic Effects – From Toy Examples to Hardcaml Simulations

https://blog.janestreet.com/fun-with-algebraic-effects-hardcaml/
1•weinzierl•13m ago•0 comments

How did Windows 95 get permission to put the Weezer video Buddy Holly on the CD?

https://devblogs.microsoft.com/oldnewthing/20260210-00/?p=112052
1•ingve•15m ago•1 comments

Show HN: Claworc – Manage multiple OpenClaw instances from a single dashboard

https://github.com/gluk-w/claworc
1•Dm_Linov•16m ago•0 comments

Yazi – fast terminal file manager written in Rust, based on async I/O

https://github.com/sxyazi/yazi
1•modinfo•17m ago•1 comments

Code Archaeology: Two Minute Time Lapse of Claude C Compiler [video]

https://www.youtube.com/watch?v=c9P89fe4WQk
1•crondee•17m ago•0 comments

Howard Lutnick admits visiting Epstein island during family vacation

https://www.cnbc.com/2026/02/10/lutnick-epstein-island-vacation-congress.html
1•belter•17m ago•1 comments

Games Console (2018)

https://mitxela.com/projects/console
2•kohlschuetter•17m ago•0 comments

Ask HN: How to handle a 'Junior' title when Im doing Mid-level work? Time to go?

1•sieep•17m ago•0 comments

Schedules of Reinforcement in Psychology (Examples)

https://www.simplypsychology.org/schedules-of-reinforcement.html
1•walterbell•18m ago•0 comments

Ashwagandha is having a moment–researchers want to take this shrub further

https://phys.org/news/2026-01-ashwagandha-moment-shrub.html
2•PaulHoule•18m ago•0 comments

Mirage: A new breed of sampler, powered by tiny generative audio models

https://evanking.io/posts/mirage/
1•evmaki•18m ago•0 comments

I made Seedance 2.0 accessible before the official API launches

https://seedance2-pro.com
1•samidatikakr•19m ago•0 comments

Deepfaking Orson Welles's Mangled Masterpiece

https://www.newyorker.com/magazine/2026/02/09/deepfaking-orson-welless-mangled-masterpiece
2•CharlesW•19m ago•0 comments

China's Data Center Boom: A View from Zhangjiakou (2025)

https://sinocities.substack.com/p/chinas-data-center-boom-a-view-from
4•fzliu•21m ago•0 comments

Video can be "recovered" from Nest cameras even without cloud subscription

https://www.nbcnews.com/news/us-news/authorities-release-surveillance-photo-potential-subject-nan...
2•mv4•21m ago•1 comments

ICE defies judges' orders to release detainees, step by step

https://www.politico.com/news/2026/02/10/ice-immigration-detention-court-orders-00771727
4•SilverElfin•21m ago•0 comments

Introducing winpulse

https://xenodium.com/introducing-winpulse
1•xenodium•23m ago•0 comments

'E-bike for your feet': How bionic sneakers could change human mobility

https://www.npr.org/2026/02/10/nx-s1-5698195/nike-amplify-bionic-sneakers
3•apparent•23m ago•0 comments

New ARIA research funding programme: nearly £50M to secure AI agents in the wild

https://www.aria.org.uk/opportunity-spaces/trust-everything-everywhere/scaling-trust/funding/
1•multiagent•23m ago•0 comments

Digital Sovereignty Won't Save Us from Internet Shutdowns

https://www.ictworks.org/digital-sovereignty-wont-save-us-from-internet-shutdowns/
2•laurex•26m ago•0 comments