frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

We built automated testing for vibe-coded apps

2•MatveyF•4h ago
Hi HN! We built buffalos.ai because we got tired of users finding bugs we missed.

The problem: AI tools like Cursor made us 10x faster at shipping, but manual testing is still slow.

Buffalo spawns browser agents that click everything users would click, in ways you didn't test. They find the bugs before your users do.

How it works:

1. Paste your staging URL 2. Agents systematically test all interaction paths 3. Get a detailed report with scoring for different category

Would love your feedback.

Free during beta: buffalos.ai

Comments

codingdave•4h ago
Writing a crawler to hit all possible links and interactions is not the tricky part. Actually understanding the expected behavior, which is not always what the code says it should be, is the tricky part. Without someone actually creating specific assertions and acceptance criteria, this seems like a flawed concept, as it might catch some bone-headed "Oh, clicking here breaks it" mistakes, but those are not the bugs that most teams fight.
MatveyF•2h ago
Yes, you’re right! Our roadmap and tools are focused on helping non-technical users who build apps with platforms like Lovable, v0, etc. The “bug” isn’t just about clicking a link and it not working. For example, we’re also working on things like: 1. Branding analysis and design drift detection (this happens a lot in vibe-coded designs — e.g., one page uses a purple gradient, another uses a completely different style, which breaks consistency). 2. checking how a new user would actually interact with your app, to see if the flow makes sense.

A crawler alone is just a static check. Our north star is trying to fix the last step from vibe coding to the production, like spawning a browser user agent that actually navigates through the app like a first-time user. Another roadmap idea is to build a browser agent that connects with coding AIs (like Cursor or Claude) to provide automated testing right after a feature is implemented. For example, when we use Claude to build something, we constantly have to flip between the terminal and browser to keep design consistent. The thought is: what if Claude’s code had “eyes” to test what it just built in real time?

Overall, thanks so much for your response. honestly I didn’t expect anyone to reply. This is my first time on HN, and it’s been an awesome interaction!

Ask HN: What's a good 3D Printer for sub $1000?

213•lucideng•3d ago•273 comments

Ask HN: Walled garden dwellers: What keeps you there?

6•FlyingAvatar•1h ago•4 comments

Ask HN: How were graphics card drivers programmed back in the 90s?

4•ferguess_k•6h ago•7 comments

We built automated testing for vibe-coded apps

2•MatveyF•4h ago•2 comments

Tell HN: Apple Broke Fitts' Law in Tahoe

30•dmd•11h ago•19 comments

Ask HN: LLM Prompt Engineering

3•Scotrix•10h ago•3 comments

I launched a Mac utility; now there are 5 clones on the App Store using my story

127•tTarnMhrkm•2d ago•132 comments

Ask HN: What Are You Reading?

9•ImPleadThe5th•1d ago•32 comments

Ask HN: What Terminal apps (via homebrew) support 24 bit color on macOS Tahoe?

4•amichail•1d ago•8 comments

Paid $2400 to Cloudflare, support refuses to help

142•thekonqueror•3d ago•29 comments

Ask HN: How can we reliably determine if text was written by AI?

4•denis_dolya•7h ago•6 comments

Ask HN: Generalists, when do you say "I know enough" about any particular topic?

32•AbstractH24•2d ago•85 comments

Ask HN: Dark Mode for HN?

43•todotask2•6h ago•41 comments

Ask HN: How to be ambitious/hungry again?

9•Poomba•20h ago•15 comments

Is the era of personal software portfolios over?

10•justanotherunit•1d ago•9 comments

Ask HN: How to deal with fake job applicants?

16•rswerve•1d ago•24 comments

Ask HN: Is it immoral not to correct someone else's grammar on social media?

2•amichail•1d ago•27 comments

Ask HN: Why isn't capability-based security more common?

12•killerstorm•2d ago•21 comments

Ask HN: Is Claude Code less useful in recent weeks for you?

9•vintagedave•1d ago•11 comments

Advertising in Microsoft Excel

12•BLKNSLVR•1d ago•8 comments

You've reached the end!