frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: How are teams validating AI-generated tests today?

2•sriramgonella•2h ago
With the rise of AI-assisted development, many tools generate tests automatically.

But validating whether those tests actually cover meaningful edge cases seems harder.

Curious how teams here handle this in real workflows.

Comments

david_iqlabs•2h ago
One thing I've noticed with AI generated tests is they can look very convincing even when they're wrong. The output reads confidently but there's not always anything grounding it in real signals.

I've found it works better when the AI is just explaining results that come from deterministic metrics rather than inventing the analysis itself.

Curious how other teams are dealing with that.

sriramgonella•1h ago
really good observation. The confidence of the output can sometimes mask the lack of grounding behind it. It almost feels like the emerging pattern is, let AI assist with generation and explanation, but keep the verification layer deterministic and measurable. Curious if you’ve seen teams building internal tooling around that, or if people are mostly relying on existing CI/testing framew
itigges22•1h ago
For security vunerability testing on websites I have been making for clients- I almost always hire a senior developer to look over the work and or tests that were created. AI can pass a test, and it can make something that passes a test, but there almost ALWAYS are problems that the senior dev finds with the tests, or with the code that was being tested. Sometimes AI will adjust the code entirely to pass the test or adjust the test to pass failing code.

Another counter-measure I have is to simply lock code before testing. Look over test files, and ensure its not following the happy path.

sriramgonella•1h ago
can we even depend on End to end Testing on this AI Tools? but how far these founders can able to rely on that with confidence. I totaly agree for VAPT it will be better

Alpine Linux on RISC-V virtual machine running in the browser via WebAssembly

https://github.com/edubart/webcm
1•lioeters•1m ago•0 comments

Show HN: Find Engineering Manager Jobs Efficiently

https://rolebeaver.com/
1•oah•1m ago•0 comments

Data centres affect the grid, but differently

https://switchgear-magazine.com/magazine/vol-03-issue-1/editorial-message-7/
1•taubek•2m ago•0 comments

The Pete Hegseth Exception

https://www.theatlantic.com/magazine/2026/04/signalgate-consequences-national-security/686056/
1•JumpCrisscross•3m ago•0 comments

I wrote a tutorial for the new Google Workspace CLI

https://nodeops.network/createos/docs/Integrations/Integration-Google-Workspace-CLI
1•julianapeace•3m ago•0 comments

You Bought the AI Licenses. Why Is Only One Developer Getting 10x Results?

https://skills.new/post/you-bought-the-ai-licenses-why-is-only-one-developer-getting-10x-results/
3•detkin•3m ago•0 comments

Datacenters are becoming a target in warfare for the first time

https://www.theguardian.com/technology/2026/mar/10/datacenters-target-warfare-iran
2•tomek_zemla•5m ago•1 comments

When Professional Looking Becomes Evidence

https://medium.com/@diamondjana/when-professional-looking-becomes-evidence-bd70ab204a49
1•janadiamond•7m ago•0 comments

Kazakhstan central bank to invest up to $350M in crypto asset markets

https://www.coindesk.com/business/2026/03/06/kazakhstan-central-bank-to-invest-usd350-million-wor...
1•janandonly•8m ago•0 comments

Exotic form of ice just got weirder

https://www6.slac.stanford.edu/news/2026-01-08-exotic-form-ice-just-got-weirder
1•whicks•8m ago•0 comments

Payphone Radio Stream

https://payphoneradio.com/
1•TigerUniversity•8m ago•0 comments

Experimental Ollama Reserach project for small LLMs

https://github.com/Infinibay/researcher
1•angaroshi•10m ago•1 comments

The U.S.‑Israel war with Iran could shatter the United Nations‑led global order

https://theconversation.com/the-u-s-israel-war-with-iran-could-shatter-the-united-nations-led-glo...
2•hkhn•11m ago•0 comments

The private alternative to Google and Apple Pay

https://walt.is
1•mngnt•11m ago•0 comments

Show HN: Inbox – An API and MCP server for managing DMs programmatically

https://docs.inboxapp.com
1•kevinpicchi•12m ago•1 comments

OpenAI on Surveillance and Autonomous Killings: You're Going to Have to Trust Us

https://theintercept.com/2026/03/08/openai-anthropic-military-contract-ethics-surveillance/
1•AndrewKemendo•13m ago•0 comments

Prediction market firms could be making $10B in yearly revenue by 2030

https://www.coindesk.com/markets/2026/02/24/from-niche-to-usd3-billion-run-rate-prediction-market...
1•PaulHoule•15m ago•0 comments

Multi-agent system for solopreneur ops (real-world architecture)

https://bleavens-hue.github.io/ai-agent-playbook/
1•agentplaybooks•15m ago•0 comments

Things I keep reminding myself about while working with AI Agents

2•wek•15m ago•1 comments

How to stop your AI agent from gaming its own KPI

https://sderosiaux.substack.com/p/how-to-stop-your-ai-agent-from-gaming
1•chtefi•16m ago•1 comments

Why do people participate in similar online communities?

https://mako.cc/copyrighteous/why-do-people-participate-in-similar-online-communities
1•jruohonen•17m ago•0 comments

It is time for the world to move on without the United States

https://www.aljazeera.com/opinions/2026/3/9/it-is-time-for-the-world-to-move-on-without-the-unite...
4•hkhn•17m ago•0 comments

I tried the top Linux terminal emulators so you don't have to

https://www.howtogeek.com/tried-top-linux-terminal-emulators/
1•losgehts•17m ago•0 comments

I'd rather be uncool than be a nihilist

https://12gramsofcarbon.com/p/id-rather-be-uncool-than-be-a-nihilist
2•theahura•17m ago•0 comments

The Unpredicted vs. the Over-Expected

https://kevinkelly.substack.com/p/the-unpredicted-vs-the-over-expected
1•surprisetalk•18m ago•0 comments

Being Maximally Useful Whilst Commuting

https://chillphysicsenjoyer.substack.com/p/being-maximally-useful-whilst-commuting
1•surprisetalk•18m ago•0 comments

You gotta think outside the hypercube

https://lcamtuf.substack.com/p/you-gotta-think-outside-the-hypercube
1•surprisetalk•18m ago•0 comments

Are Americans Getting Richer? New Data Might Surprise You

https://newsletter.humanprogress.org/p/are-americans-getting-richer-new
1•surprisetalk•18m ago•0 comments

Tech Worker Simulator

https://tech-worker-simulator.vercel.app/
1•joshcsimmons•18m ago•0 comments

Show HN: Get AI to write code that it can read

https://github.com/ELI7VH/wavelang
2•elijahlucian•19m ago•0 comments