frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Open-Source Agentic QA Harness with Memory

https://github.com/vostride/agent-qa
50•pranshuchittora•51m ago

Comments

pranshuchittora•50m ago
Hey, I am the creator of agent-qa.

Coding agents have accelerated software development, allowing folks to ship features at lightning speed, but whether the feature works in production without breaking existing behavior is still questionable.

Conventionally, either a software engineer or a QA engineer converts user stories / feature PRDs into composable end-to-end tests, allowing teams to catch regressions.

But with AI writing code, tests become the bottleneck. Though you can ask the coding agent to write tests, and it does write tests with reasonable correctness, AI greedily chases passing tests and sometimes bends the rules. Also, having access to the code allows it to write tests with shortcuts that might not mimic real user behavior.

With agent-qa, you can write tests in plain English (natural language). It is built upon battle-tested testing frameworks (Playwright for web and Appium for mobile). Playwright and Appium work as a kernel executing the planned actions, while AI runs in the harness doing observation -> planning -> executing planned actions (via kernel) -> self-healing (in case a planned action fails) -> verification.

The agent also evolves with every test run. It generates learning & product memories from each run, improving itself over time.

This is in an early stage, and I’m looking forward to your feedback.

Thanks!

Live Demo - https://vostride.com/demo/agent-qa GitHub - https://github.com/vostride/agent-qa (Consider giving it star) Good Day!

mkdsf01•29m ago
That looks interesting
pranshuchittora•2m ago
Do give it a try https://vostride.com/docs/agent-qa/quickstart
willowwd9•27m ago
What's the need of this? I run codex in loop and it writes and runs the playwright tests without any intervention.
pranshuchittora•29s ago
This is what teams are doing today. But LLMs have a tendency to greedily write tests, which leads to hacky tricks to make the test succeed.

agent-qa is a harness where playwright works as an execution kernel and LLM works as a observer, planner and verifier.

ofdgdfkg9034•27m ago
Can I use it with claude code?
pranshuchittora•2m ago
Yes, https://vostride.com/docs/agent-qa/configuration/global-conf...

AI is likely to widen the gap between corporate giants and everyone else

https://finance.yahoo.com/markets/article/history-says-ai-is-likely-to-widen-the-gap-between-corp...
1•wslh•23s ago•0 comments

Ebola outbreak: WHO declares emergency, US restricts travel, American infected

https://arstechnica.com/health/2026/05/ebola-outbreak-who-declares-emergency-us-restricts-travel-...
1•Bender•31s ago•0 comments

Millions of merchants speak UCP

https://twitter.com/igrigorik/status/2056417991693312370
1•doppp•3m ago•0 comments

Googolplex Written Out

https://www.googolplexwrittenout.com/
1•syx•4m ago•0 comments

Gemini is in danger of going full Copilot

https://www.theverge.com/tech/931752/google-io-2026-gemini-icon-docs-workspace
1•ilreb•5m ago•0 comments

Gaussian Splat of a Strawberry

https://superspl.at/scene/84df8849
1•danybittel•5m ago•0 comments

Drug Development Failure: How GLP-1 Development Was Abandoned in 1990

https://muse.jhu.edu/verify?url=%2Farticle%2F936213&r=479323
1•Anon84•6m ago•0 comments

NosDAV: Nostr-native Solid storage server. Powered by JSS

https://nosdav.com/server/
1•sigalor•6m ago•0 comments

AdminForth – Open-source admin framework with a built-in AI agent [video]

https://www.youtube.com/watch?v=4tB8uzY__uk
1•nixy71•8m ago•0 comments

An open question about how AI agent skills should be distributed

https://github.com/hymhub/skill-indexer
1•1749207188•8m ago•0 comments

Trump admin creates $1.7B fund for allies of the president

https://www.cnn.com/2026/05/18/politics/trump-irs-lawsuit-fund-for-allies
2•tdeck•8m ago•0 comments

All the Bugs They Found

https://andreapivetta.com/posts/all-the-bugs-they-found.html
1•ziggy42•10m ago•0 comments

Show HN: Barstool, a Prettier macOS Menubar

https://barstool.lotl.dev/
1•thecommieaxo•11m ago•0 comments

Show HN: ShakeToFocus – blur everything except your active window on macOS

https://beinlife.gumroad.com/l/shake-to-focus
1•BeInLife•11m ago•0 comments

The million-dollar math problem hardly anyone is trying to solve

https://www.scientificamerican.com/article/the-riemann-hypothesis-is-a-million-dollar-math-proble...
1•baruchel•12m ago•0 comments

Before Making It Configurable

https://rugu.dev/en/blog/before-making-it-configurable/
1•kugurerdem•13m ago•0 comments

Adobe Lightroom CC on Linux via Wine

https://github.com/sander110419/lightroom-cc-on-linux
1•sdoering•16m ago•0 comments

Automate shitty tasks with dog agents

https://handinger.com
4•masylum•19m ago•0 comments

The founder's playbook: Building an AI-native startup

https://claude.com/blog/the-founders-playbook
3•nsoonhui•25m ago•0 comments

An uptime monitor that knows the difference between a blip and an outage

https://monitrova.com/
1•SourceCodeES•25m ago•0 comments

OpenAI, Microsoft and Friends Build a Better, More Scalable Ethernet

https://www.nextplatform.com/connect/2026/05/12/openai-microsoft-and-friends-build-a-better-more-...
2•rbanffy•26m ago•0 comments

Ascetic Computing

https://ratfactor.com/ascetic-computing
1•ingve•26m ago•0 comments

Growing bread queues in Gaza as Israel restricts fuel, flour imports

https://www.aljazeera.com/news/2026/5/18/growing-bread-lines-gaza-israel-restricts-fuel-flour
1•hebelehubele•29m ago•0 comments

How Socialism Could Work

https://confrontingcapitalism.substack.com/p/how-socialism-could-really-work
2•avidphantasm•32m ago•2 comments

Iran invites CNN to show "a call to arms", arming and training 7-8 year olds

https://www.cnn.com/2026/05/18/world/video/iran-tehran-call-to-arms-chance-intldsk
3•spwa4•35m ago•0 comments

Rust: Project Goals Update

https://blog.rust-lang.org/2026/05/18/project-goals-2026-04/
1•f311a•36m ago•0 comments

Bito's AI Architect Boosts Claude Opus's task success rate by 35%

https://bito.ai/benchmarks/swe-bench-pro-evaluation/
2•Sushrutkm•42m ago•0 comments

All the a Trading Zone, and All the Languages Merely Pidgins

https://everythingstudies.com/2017/04/29/all-the-worlds-a-trading-zone/
2•paraschopra•43m ago•0 comments

How to Lose a Fight (skillfully) (2011)

https://expertboxing.com/how-to-lose-a-fight-skillfully
2•nephihaha•44m ago•0 comments

Hunting orphan objects: 45% off our ClickHouse storage bill and a near data-loss

https://www.tinybird.co/blog/how-we-deal-with-cloud-orphan-objects
1•gnzjgo•45m ago•0 comments