frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Testronaut – AI-powered mission-based browser testing

https://testronaut.app/
4•scfast•5mo ago
Hi HN,

I’ve been working on a project called *Testronaut*, an autonomous testing framework that combines AI reasoning with real browser automation. The idea is to let you define end-to-end tests as “missions” in plain English, then have an agent run them through a real browser using Playwright.

Why I built this: I’ve often found end-to-end tests to be fragile, time-consuming to maintain, and difficult to scale. Testronaut tries to reduce the maintenance burden by using AI to adapt tests to small UI changes, while still producing a deterministic report of what passed/failed.

How it works: - Missions can be written as strings or functions. - The agent uses GPT-4o with a set of tools (click, type, navigate, get_dom, etc.) to interact with the page. Support for other LLMs/Models in the works. - Browser control is handled by Playwright. - Reports are generated in both JSON and HTML, with step-by-step breakdowns (including screenshots). - It runs locally via a CLI (`npx testronaut`) and doesn’t require any hosted service. You will need to provide your own OpenAI API key, however.

Current state: - Early days: it works for simple flows and demo apps, but I’m still tuning the reliability and efficiency. - It installs with one command and comes with a sample mission. - Open source on npm/GitHub.

Links: - Docs & quickstart: https://docs.testronaut.app - GitHub: https://github.com/mission-testronaut/testronaut-cli - npm: https://www.npmjs.com/package/testronaut

I’d love feedback from the HN community on: - Where this could be most useful (CI/CD? flaky test replacement? exploratory testing?). - What concerns you’d have about using an AI-driven test runner. - Any “gotchas” I should watch out for in early adoption.

Thanks for taking a look!

Comments

scfast•5mo ago
Hi HN, I’m the maker of Testronaut.

It’s an AI-powered testing framework that runs “missions” through a real browser (via Playwright). The goal is to reduce the brittleness of end-to-end tests by letting an agent adapt to small UI changes while still producing a clear pass/fail report.

Right now it works for simple flows like login/signup, outputs step-by-step reports with screenshots, and installs locally via CLI (`npx testronaut`). Still early, so I’d love feedback on where this might (or might not) fit into your workflows.

Happy to answer questions — thanks for checking it out!

OpenAI's Latest Platform Targets Enterprise Customers

https://aibusiness.com/agentic-ai/openai-s-latest-platform-targets-enterprise-customers
1•myk-e•1m ago•0 comments

Goldman Sachs taps Anthropic's Claude to automate accounting, compliance roles

https://www.cnbc.com/2026/02/06/anthropic-goldman-sachs-ai-model-accounting.html
2•myk-e•4m ago•1 comments

Ai.com bought by Crypto.com founder for $70M in biggest-ever website name deal

https://www.ft.com/content/83488628-8dfd-4060-a7b0-71b1bb012785
1•1vuio0pswjnm7•5m ago•1 comments

Big Tech's AI Push Is Costing More Than the Moon Landing

https://www.wsj.com/tech/ai/ai-spending-tech-companies-compared-02b90046
1•1vuio0pswjnm7•7m ago•0 comments

The AI boom is causing shortages everywhere else

https://www.washingtonpost.com/technology/2026/02/07/ai-spending-economy-shortages/
1•1vuio0pswjnm7•9m ago•0 comments

Suno, AI Music, and the Bad Future [video]

https://www.youtube.com/watch?v=U8dcFhF0Dlk
1•askl•11m ago•1 comments

Ask HN: How are researchers using AlphaFold in 2026?

1•jocho12•13m ago•0 comments

Running the "Reflections on Trusting Trust" Compiler

https://spawn-queue.acm.org/doi/10.1145/3786614
1•devooops•18m ago•0 comments

Watermark API – $0.01/image, 10x cheaper than Cloudinary

https://api-production-caa8.up.railway.app/docs
1•lembergs•20m ago•1 comments

Now send your marketing campaigns directly from ChatGPT

https://www.mail-o-mail.com/
1•avallark•23m ago•1 comments

Queueing Theory v2: DORA metrics, queue-of-queues, chi-alpha-beta-sigma notation

https://github.com/joelparkerhenderson/queueing-theory
1•jph•35m ago•0 comments

Show HN: Hibana – choreography-first protocol safety for Rust

https://hibanaworks.dev/
5•o8vm•37m ago•0 comments

Haniri: A live autonomous world where AI agents survive or collapse

https://www.haniri.com
1•donangrey•38m ago•1 comments

GPT-5.3-Codex System Card [pdf]

https://cdn.openai.com/pdf/23eca107-a9b1-4d2c-b156-7deb4fbc697c/GPT-5-3-Codex-System-Card-02.pdf
1•tosh•51m ago•0 comments

Atlas: Manage your database schema as code

https://github.com/ariga/atlas
1•quectophoton•54m ago•0 comments

Geist Pixel

https://vercel.com/blog/introducing-geist-pixel
2•helloplanets•56m ago•0 comments

Show HN: MCP to get latest dependency package and tool versions

https://github.com/MShekow/package-version-check-mcp
1•mshekow•1h ago•0 comments

The better you get at something, the harder it becomes to do

https://seekingtrust.substack.com/p/improving-at-writing-made-me-almost
2•FinnLobsien•1h ago•0 comments

Show HN: WP Float – Archive WordPress blogs to free static hosting

https://wpfloat.netlify.app/
1•zizoulegrande•1h ago•0 comments

Show HN: I Hacked My Family's Meal Planning with an App

https://mealjar.app
1•melvinzammit•1h ago•0 comments

Sony BMG copy protection rootkit scandal

https://en.wikipedia.org/wiki/Sony_BMG_copy_protection_rootkit_scandal
2•basilikum•1h ago•0 comments

The Future of Systems

https://novlabs.ai/mission/
2•tekbog•1h ago•1 comments

NASA now allowing astronauts to bring their smartphones on space missions

https://twitter.com/NASAAdmin/status/2019259382962307393
2•gbugniot•1h ago•0 comments

Claude Code Is the Inflection Point

https://newsletter.semianalysis.com/p/claude-code-is-the-inflection-point
4•throwaw12•1h ago•2 comments

Show HN: MicroClaw – Agentic AI Assistant for Telegram, Built in Rust

https://github.com/microclaw/microclaw
1•everettjf•1h ago•2 comments

Show HN: Omni-BLAS – 4x faster matrix multiplication via Monte Carlo sampling

https://github.com/AleatorAI/OMNI-BLAS
1•LowSpecEng•1h ago•1 comments

The AI-Ready Software Developer: Conclusion – Same Game, Different Dice

https://codemanship.wordpress.com/2026/01/05/the-ai-ready-software-developer-conclusion-same-game...
1•lifeisstillgood•1h ago•0 comments

AI Agent Automates Google Stock Analysis from Financial Reports

https://pardusai.org/view/54c6646b9e273bbe103b76256a91a7f30da624062a8a6eeb16febfe403efd078
1•JasonHEIN•1h ago•0 comments

Voxtral Realtime 4B Pure C Implementation

https://github.com/antirez/voxtral.c
2•andreabat•1h ago•1 comments

I Was Trapped in Chinese Mafia Crypto Slavery [video]

https://www.youtube.com/watch?v=zOcNaWmmn0A
2•mgh2•1h ago•1 comments