frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Launch HN: TesterArmy (YC P26) – Agents that test web and mobile apps

https://tester.army
23•okwasniewski•1h ago
Hey HN - we’re Oskar, Szymon, and Piotr, and we’re building TesterArmy (https://tester.army). TesterArmy is an agentic testing platform that runs end-to-end checks before deployment and in production. Instead of wasting hours on manual testing or maintaining static scripts, we let you specify your tests in natural language and handle everything in between. We've built the platform fully around agents. Our agent will reliably execute the tests, but your coding agent can manage everything in our platform, from defining tests in natural language to running them on your behalf.

Check out our demo video: https://www.youtube.com/watch?v=291IkUbPrlk.

We started TesterArmy because testing is still far too painful. AI coding tools have made it dramatically faster to write and ship code, but testing is still a bottleneck. Traditional E2E tests are slow to set up and expensive to maintain. Managing auth and test users is painful. Setting up staging environments is painful. Running tests reliably is painful.

We think most teams do not actually want to spend their time writing selectors or maintaining test infrastructure. They just want confidence that their core flows work. With TesterArmy, an engineer can sign up, give an agent our CLI, and let it handle creating tests and running them on schedule or on GitHub.

When something breaks, TesterArmy alerts your team through Slack or Discord.

Over the past few months, we scaled from 0 to 30+ teams using our product every day. We caught bugs in critical flows, including onboarding, checkout, and AI chat. We've got many of our customers migrating from already established competitors to us because of the quality and reliability of our agents.

Here are a few of the recent bugs that our agent found (there were quite a lot of them!):

1) Timezone bug that affected the booking flow in one of our clients' apps, the dashboard was very complex and hard to catch by a human. 2) Regression in agent orchestration that caused a sandboxed environment to be stuck on loading, thanks to TesterArmy, the team was able to resolve it before it hit production. 3) Incorrectly counting the order amount in a complex dashboard flow with checkout, thanks to TesterArmy, the team was able to resolve it before it affected revenue 4) Catching a regression in an AI chat flow that would result in a user not being able to retrieve their data due to broken tool calling.

And many more, mostly related to some incorrect API calls, 404s, unhandled errors, etc.

If this sounds useful, we would love your feedback at https://tester.army. We have a bunch of free test runs for you to try. And don’t worry, we won’t make you do sales calls, and we don’t have long onboarding or annoying setup. Our goal is an it-just-works experience.

If you're looking for an end-to-end testing solution, we'd love to hear your feedback!

Comments

yohguy•1h ago
Does it work of mobile native applications or expo apps that have native modules?

Pricing question, the usage on the plans seems low considering in the demo you said that you have 25 tests per pr which would mean you get only 10 PRs per month on the hobby plan?

okwasniewski•45m ago
Yes, it works for any framework. We just get the built native binary and run it in the cloud.

Regarding pricing, the self serve options are currently only for lower usage. We will add more plans further down the line. Currently the most popular one is the startup plan. If you need more usage I’m happy to discuss it on a call!

msencenb•56m ago
Have you been able to nail down a loop where your tool can take an open pr, guess the code path and do some testing?

We use cypress heavily for our core flows which has a similar ai prompt thing but it’s not quite ad hoc enough for smaller fixes which is where the bottleneck still comes in for us.

okwasniewski•16m ago
Yes! We spent quite a lot of time on this, and we are currently creating a test plan based on PR changes and sending an agent to verify it. We have some customers who are only using this feature.
dbbk•49m ago
"Traditional E2E tests are slow to set up and expensive to maintain." I don't really understand this. If I'm already using Opus to write the code, surely it would know best what E2E tests to write to be able to verify its own output? This seems like an unnecessary external step.
okwasniewski•38m ago
Unfortunately from our experience tests don’t scale as well as code. First of all static tests are very brittle, you rely on selectors, need wait times and can’t really test a lot of dynamic content (think AI chats/interactions). Then it’s all the infrastructure around it: solving captchas, handling auth, handling email OTP (each of our agents has access to its own inbox) and handling video recording and screenshots. So with the traditional testing approach you end up mocking a lot of services. I highly recommend you to give it a try!
iknownthing•17m ago
.army?
okwasniewski•15m ago
We are thinking whether to change this.. We also have testerarmy.com/.ai
rpunkfu•4m ago
Congratulations on launch, I’ve been tracking your progress since you’ve been accepted for spring batch.

Always happy to see cool products from Poland! :)

Ask HN: Is there a way to stop the animated Google Doodles?

1•arnejenssen•2m ago•0 comments

SRAM Wall Art

https://old.reddit.com/r/chipdesign/comments/1u99eio/256x8_sram_wall_art/
1•random__duck•2m ago•0 comments

LLMs Put Style over Substance, You Should Put Substance over Style

https://www.felixhaba.com/writing/llms-put-style-over-substance/
1•feliixh•2m ago•0 comments

The Harajuku Moment

https://tim.blog/2024/02/09/harajuku-moment/
1•abhaynayar•2m ago•0 comments

Show HN: We ran 74 popular MCP servers in microVMs to see what breaks

https://usethrone.dev/registry
1•imtaimoorkhan•3m ago•0 comments

What We Know About Billionaire Peter Thiel's 'Dialog' Society

https://www.forbes.com/sites/maryroeloffs/2026/06/18/what-we-know-about-billionaire-peter-thiels-...
1•sreekanth850•4m ago•3 comments

Show HN: Emacs log-mode (cheap copy of Logseq)

https://github.com/luqtas/log-mode
1•luqtas•4m ago•0 comments

Ellf: Virtual NLP Engineer

https://beta.ellf.ai/
1•paffdragon•6m ago•0 comments

Global Freedom and Democracy Indices

https://www.amos.design/the-civic-atlas
1•bookofjoe•7m ago•0 comments

LLM biased against accessible code (Claude Code issue #56079)

https://www.aaron-gustafson.com/notebook/2026-06-17-llm-biased-against-accessible-code/
1•robin_reala•7m ago•0 comments

JetBrains IDE Expertise, Now on LinkedIn

https://blog.jetbrains.com/blog/2026/06/17/your-jetbrains-ide-expertise-now-on-linkedin/
1•WhiteDawn•8m ago•0 comments

SQLite Cloud SQLite database with real-time synchronization

https://www.sqlite.ai
1•Asfand3099•9m ago•0 comments

GLM-5.2 is probably the most powerful text-only open weights LLM

https://simonwillison.net/2026/Jun/17/glm-52/
5•Brajeshwar•10m ago•0 comments

Double Entry Programming

https://www.0xsid.com/blog/double-entry-programming
2•ssiddharth•12m ago•0 comments

What is the best Duolingo for X thing to build?

2•Lil-Finance-Bro•12m ago•1 comments

Windows 93

http://windows93.net/
2•xg15•14m ago•0 comments

Show HN: StartupWiki, a free alternative to crunchbase/pitchbook

https://startupwiki.tech/
1•shpran•15m ago•0 comments

AI-Native Firms

https://twitter.com/orgRem/status/2067318661669372196
1•jeffreyrogers•15m ago•0 comments

Cultivating Interests in Undergrad

https://bcmullins.github.io/favorite-books-from-undergrad/
1•wannabebarista•16m ago•0 comments

EOL – Find an audience from public posts, talk to it as one

https://www.earthonlines.com/product
1•BonanKou•16m ago•0 comments

Trump administration Backs Off Plan to End Ocean Monitoring

https://www.nytimes.com/2026/06/18/climate/trump-ocean-observatories-initiative.html
5•burkaman•16m ago•1 comments

The Makings of a Good Bioweapon

https://www.owlposting.com/p/the-makings-of-a-good-bioweapon
1•crescit_eundo•17m ago•0 comments

Coinbase AI Adviser

https://www.coindesk.com/business/2026/06/16/coinbase-intoduces-ai-advisor-stock-options-and-pre-...
1•AnhTho_FR•17m ago•0 comments

Apple's Tim Cook Says Price Increases Are 'Unavoidable'

https://www.cnet.com/tech/mobile/apples-tim-cook-says-price-increases-are-unavoidable/
1•speckx•20m ago•0 comments

Prof. Sarah Paine – Round Up of Grand Strategy and Geopolitics

https://www.youtube.com/watch?v=OS1NZLgKM2c
1•lifeisstillgood•22m ago•0 comments

September 2025 NPM Attack Hit 2.6B Weekly Downloads. Most Found Out on Twitter

https://datanexusmcp.com/blog/npm-supply-chain-attack-2025/
2•jsmudda•22m ago•1 comments

Show HN: Motion-contact-sheet – give a coding agent eyes for motion

https://github.com/Kallin/motion-contact-sheet
1•kal9000•23m ago•0 comments

Show HN: Jsonl-tools – secure paste bin for agent run traces

https://jsonl-tools.dev/
2•vierliam•23m ago•0 comments

Show HN: One hundred LLMs Generating a HTML/CSS Solar System

https://aibenchy.com/showcase/solar-system-animation/
2•XCSme•24m ago•0 comments

Show HN: Asili – open-source, privacy-first in-browser DNA PGS scoring

https://github.com/techninja/asili
2•techninja42•26m ago•1 comments