frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: How do you test AI-generated code?

2•df003•2h ago
When AI generates code, I first instruct the model to find, fix, and verify any issues. After that, I start the server and test whether it actually works from the user’s perspective.

What I’m looking for is a workflow where issues are received, fixed, tested, and deployed—but it seems that current AI agents aren’t very good at performing browser tests from the user’s perspective.

I’ve tried using the built-in browsers in Codex and Cursor, but they often only checked whether the page loaded. In the end, I had to instruct them step by step on what to do, and it turned out to be cheaper and faster for me to test it myself.

So I’m curious to know how you’ve set up test automation. Are there any services that do this (for individuals, not just enterprises)? If you’re using a harness like Codex, I’d like to know what instructions and skills are needed to get it to perform tests from the user’s perspective.

Comments

dutchcode•1h ago
Blinkof.ai can fix a lot of it for you, it will signup/login, walk through pages and return issues with a prompt to fix it.
rishabhpoddar•1h ago
I ask it to write unit test with close to 100% coverage. This has worked well for me so far

Ranked: Countries Spending the Most on Research and Development

https://www.visualcapitalist.com/ranked-countries-spending-most-on-r-and-d/
1•theanonymousone•3m ago•0 comments

Smart Hotel Management Software for Hotels, Resorts and Vacation Rentals

https://app.notion.com/p/Smart-Hotel-Management-Software-for-Hotels-Resorts-Vacation-Rentals-de44...
1•jackarnold•6m ago•0 comments

"Start with a Monolith" Was Good Advice. AI Is Changing That

https://medium.com/@pivotfakie/start-with-a-monolith-was-good-advice-ai-is-changing-that-a2181b8e...
1•feeblefakie•7m ago•0 comments

How to Apply Google's Open Knowledge Format (OKF) on Enterprise Level

https://community.obsidian.md/plugins/vault-operator
1•pssah4•9m ago•1 comments

Full Metal Jacket. Copper Edition – Vollebak

https://vollebak.com/en-us/products/full-metal-jacket-copper-edition
1•evo_9•10m ago•0 comments

OpenAI Codex bombards SSDs with needless write operations, costing millions

https://www.theregister.com/ai-and-ml/2026/06/23/openai-codex-bombards-ssds-with-needless-write-o...
1•jonbaer•12m ago•0 comments

The Digital Sovereignty Trap

https://statedept.substack.com/p/the-digital-sovereignty-trap
1•ryzvonusef•13m ago•0 comments

PixelSmash – FFmpeg's MagicYUV decoder vuln leads to RCE via media file

https://jfrog.com/blog/pixelsmash-critical-ffmpeg-vulnerability-turns-media-files-into-weapons/
1•n0on3•14m ago•0 comments

AI Steps Off the Screen

https://epics.tech/posts/2026-06-23-ai-steps-off-the-screen/
2•epicsagas•15m ago•0 comments

Benchmark object storage in objects/s, not GB/s

https://fractalbits.com/blog/objects-per-second/
7•zzsheng•26m ago•0 comments

Dietary guidelines do not yield sufficient flavanol for cardiovascular benefit

https://pubs.rsc.org/en/content/articlehtml/2026/fo/d6fo00867d
2•littlexsparkee•27m ago•0 comments

AxLLM

https://axllm.dev/
2•handfuloflight•29m ago•0 comments

RIP Fable

https://fable.rip
2•opndragoon•31m ago•0 comments

Lucid to lay off roughly 18% of U.S. workforce, COO Marc Winterhoff leaves

https://www.cnbc.com/2026/06/22/lucid-layoffs-evs.html
2•mgh2•37m ago•0 comments

Clean sweep for Mamdani-backed candidates in New York's Democratic primary

https://www.bbc.com/news/articles/clye652m41po
2•mikhael•47m ago•0 comments

2026 vs. 1996 Chevrolet Blazer IIHS crash test

https://www.youtube.com/watch?v=4U8Ero-3GxI
3•plun9•48m ago•2 comments

VoltanaLLM: Energy-Efficient LLM Serving

https://supercomputing-system-ai-lab.github.io/projects/voltana/
2•matt_d•51m ago•0 comments

2003-era DDR2 memory prices jump up to 60%

https://www.tomshardware.com/pc-components/dram/ddr2-memory-prices-jump-up-to-60-percent
2•pkaeding•52m ago•1 comments

Sakana Fugu Technical Report

https://www.chapterpal.com/s/7ff4f6ba/sakana-fugu-technical-report
1•theanonymousone•52m ago•1 comments

Show HN: Deploy to Vercel, Netlify, Railway, Render, Cloudflare in 1 Command

https://xiaohou2503687-design.github.io/shipfast-oss/
1•shipfastai•52m ago•0 comments

Intel shareholder sues to void deal giving U.S. gov $11B in stock for free

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6985440
4•de6u99er•55m ago•1 comments

Sakana Fugu Ultra promises to deliver "the best frontier-level performance"

https://www.theverge.com/ai-artificial-intelligence/953904/sakana-fugu-ai
1•theanonymousone•1h ago•1 comments

TSMC: 36.1 A 32Gb/s 10.5Tb/s/mm 0.6pJ/b UCIe-Compliant Low-Latency Interface 3nm

https://ieeexplore.ieee.org/document/10904767
3•Alien1Being•1h ago•0 comments

Trump Gets Negative Reviews Internationally as Fewer Say US Is Reliable Partner

https://www.pewresearch.org/global/2026/06/23/trump-gets-negative-reviews-internationally-as-fewe...
4•Bondi_Blue•1h ago•0 comments

OpenAI spending hit $34B last year ahead of planned IPO

https://www.ft.com/content/e15b0d7e-ff6b-4f16-ba7a-4068feddb828
2•1vuio0pswjnm7•1h ago•1 comments

The Junior Developer Problem Is Becoming a Senior Developer Problem

https://www.vincentschmalbach.com/the-junior-developer-problem-is-becoming-a-senior-developer-pro...
4•vincent_s•1h ago•0 comments

Show HN: Fork.ai – branch any AI answer into a mind map instead of a chat log

https://forkai.in
1•gokulmc•1h ago•0 comments

Conspiracy Theories, Spontaneous Orders, and Global Politics [pdf]

https://isonomiaquarterly.com/wp-content/uploads/2025/11/massimino-pfwo.pdf
2•brandonlc•1h ago•1 comments

Lippmann Color Plates

https://www.eastman.org/event/workshops/lippmann-color-plates
1•andsoitis•1h ago•0 comments

Statement from Five Eyes agencies on cyber risk

https://www.ncsc.gov.uk/news/the-ai-shift-in-cyber-risk-why-leaders-must-act-now
2•reasonableklout•1h ago•1 comments