Stop Writing Brittle Playwright Tests: Why YAML-Based Testing Is the Future

https://medium.com/@oxtiger/stop-writing-brittle-playwright-tests-why-yaml-based-testing-is-the-future-5cc90a81bfa2

6•suchuanyi•5h ago

Comments

shove•3h ago

So the answer to “how are we going to verify that vibe-coded application code does what we think it does is “we’re going to vibe-code the tests too”?

meepmorp•3h ago

Don’t bother, man - it’s vibes all the way down.

ohdeargodno•2h ago

Replace your flaky UI tests with flaky LLM-based tests, at least when it inevitably fails you can spend 45 minutes attempting to find just the right prompt with which the LLM doesn't attempt to also click something unrelated!

Most of the tools currently existing are (plain awful|work only on browsers|do magic behind the scenes making them non repeatable|force best effort, hiding any validation). These tests are barely better than doing them by hand, at least there's not someone burning their mind on a 250 test-case list for half a day.

Your primary UI testing tool should be accessibility. If your accessibility elements/descriptions aren't enough to test things, _then you aren't accessible enough_.

(Although I do agree, pure code-based tests mooost likely should go away. Whether that's Playwright, Espresso or any other tool. Maestro finds a right balance between expressive yaml, and openness to scripting if needed)

bananapub•2h ago

Just in case you were thinking of wasting time on reading it, they put a helpful summary at the top:

> How a simple YAML configuration built for Claude Code and Playwright MCP transformed our testing workflow and made automation accessible to everyone on the team

Side note, in what order did it happen? Did Medium go from “one of the nicest publishing platforms on the web” to “pop up infested search-engine-spamming garbage” before or after all the garbage blog spammers started using it?

moomin•2h ago

We used LLMs to reinvent Cucumber but worse.

Playwright tests are fine, but you need to think about the design or you end up with a mess. Using a steps file is one way to do it, but just employing coding discipline is another. Don’t expect to be able to slap 1000 lines of scripting code together and ignore everything you’ve already learned about structuring code.

latsu•31m ago

AI slop article about using AI to write tests in a format that's worse than Cucumber...

Why would I bother to read the slop you couldn't even be bothered to write?

Angus Barbieri's Fast

Sculpting Language Models

250M+ acres of public lands eligible for sale in reconciliation package

Zoomcar Data Breach Exposes Personal Information of 8.4M Users

Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer

Getting better at LLMs, with Zvi Mowshowitz [audio]

Battle to eradicate invasive pythons in Florida achieves milestone

Summary of Heroku June 10 Outage

Scientists detect light passing through human head:new doors for brain imaging

'It's absolutely f–-ed': Why Google's new £1B London office is in crisis

Godfather of AI: I Tried to Warn Them, but We've Lost Control [video]

From code to commands: prompt training technique helps users speak AI's language

Making hydrogen with soda cans and seawater is scalable and sustainable

DRM Can Watch You Too: Privacy Effects of Browsers' Widevine EME (2023)

A New Rust Packaging Model

Sony Group Corporation Fireside Chat 2025

Spotted in Prod

What if useState was your back end?

Ask HN: Should your life partner be less/more educated than yourself?

MythGen – Lovable for world-building (Unity and Unreal)

In defense of ad-supported products (2024)

About premium requests

Ask HN: How would you convert 80% of the Linux kernel to rust?

Trump Mobile

Unknown Pleasures: pulsars, pop, and plotting (2021)

In the 1980s, Iran Outfitted F-14s as Heavy Bombers

Browsh: A fully-modern text-based browser, rendering to TTY and browsers

First usermode exploit and more: Nintendo Switch 2 had busy week for its launch

Iron Beam

How Software Development Works