Ask HN: Why aren't AIs being used as app beta testers yet?

6•amichail•4h ago

For example, why don't beta testing services such as TestFlight have ChatGPT as a possible beta tester along with the human testers?

Comments

duxup•4h ago

I'm going to throw out my own ignorant theory.

AIs that I find useful are still just LLMs and LLMs power comes from having a massive amount of text to work with to string together word math and come up with something ok. That's a lot of data that comes together to get things ... kinda right... sometimes.

I don't think there's that data set for "use an app" yet.

We've seen from "AI plays games" efforts that there have been some pretty spectacular failures. It seems like "use app" is a different problem.

cheevly•53m ago

LLMs have literally won Pokemon. Im pretty sure that using an app is 10x simpler.

Vilian•33m ago

A lot simpler to run pokemon than test an app, the game play by itself sometimes

v5v3•4h ago

Are llm testers doing anything traditional scripts with for loops can't?

postalrat•1h ago

llm testers have for loops so they can do everything traditional scripts with for loops can plus more.

afrederico•4h ago

They should totally be able to. If there's "vibe coding" there should be "vibe testing." We're working on just such a product (https://actory.ai); right now it only does websites but just imagine when we turn it on mobile/apps, etc. How cool would that be?

aristofun•3h ago

Because for meaningful tests of an app (assuming b2c or b2b for end users) you are supposed to be or imitate a human being.

Current AI is not even designed to do that. It is just a very sophisticated auto-complete.

It is sophisticated enough to fool some VCs that you can chop your round peg into square hole. But there is no ground to expect a scalable solution.

drakonka•3h ago

They are; we're working on agents for web application testing over at qa.tech.

HeyLaughingBoy•1h ago

Anecdotally, I know someone who tried to have ChatGPT generate unit tests and it was an abject failure.

cheevly•54m ago

I know someone that generated unit tests successfully.

whoknowsidont•27m ago

And I know exactly which one of these is an enterprise B2B app/platform.

danbrooks•1h ago

I worked with a team that did this for the Facebook app.

https://engineering.fb.com/2018/05/02/developer-tools/sapien...

Getting Started with Nvidia CuOpt

Supreme Court limits nationwide injunctions in birthright citizenship order

Helsinki turns to AI to spot e-scooter crashes before they happen

I'm analyzing 1000 indie hackers landing pages

Gentle gripper gives leaves a 'shot' of sensors and genes for smart farming

Demystifying AI 'Computer Use': Building GUI Automation with AI Workflows

Axios’ Sara Fischer in conversation with Cloudflare’s Matthew Prince [video]

LangChain vs. Langfuse: Key Differences and Their Role in LLM App Development

New Vulnerabilities Expose Brother Printers to Hacking

SymbolicAI: A neuro-symbolic perspective on LLMs

Rust 1.88

Notes on Epistemic Collapse

Colour e-paper weather display

OpenAI, Microsoft Rift Hinges on How Smart AI Can Get

Google begins rolling out AI search in YouTube

nimbme – Nim bare-metal environment

New Process Uses Microbes to Create Valuable Materials from Urine

Ask HN: Why don't OSes automatically download apps/features that you might like?

Vibe Coding Is Not an Advantage

Ask HN: Why doesn't HN have notifications for replies?

Show HN: Open-Source International Space Station Tracker ESP32/Arduino for $20

Show HN: IssuePay – Get paid for open-source contributions

15 AI Coding Agents evaluated with the same prompt

AI is ruining houseplant communities online

Show HN: Super simple, automated email marketing tool for YouTubers

The software engineering "squeeze"

How did China come to dominate the world of electric cars?

How Chinese Carmakers Doubled Their Share of the European Market

Dia by the Browser Company

Blind spots on American cars are expanding