frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Improving AI-generated tests using mutation testing

https://blog.senko.net/improving-ai-generated-tests-using-mutation-testing
1•senko•2h ago

Comments

derrak•1h ago
First, I have some advice if you are open to it. Apply a spell/grammar checker to your post before sharing it with other people. Your post has several typos and many people will stop reading after the first or second typo. “The author didn’t care enough to proofread, why should I bother reading.”

> Remember, this is important: do not look at the tests. If you let them into your context

This is a bad idea. You are trusting the LLM’s ability to follow instructions. Worse, depending on your harness the LLM might not even be able to follow these instructions. The harness may indiscriminately place code into the context in a way that is uncontrolled by the LLM.

A better idea is to modify your harness so that certain files are excluded from the context.

> I asked another AI to carefully review the tests and identify those that don't make sense.

Test validation is an entire area of research and I’m yet to be convinced that this is a task for LLMs.

senko•53m ago
> Apply a spell/grammar checker to your post before sharing it with other people.

Hey, at least you know it isn't LLM generated! :) Thank you - I usually do, here I obviously didn't. Appreciate the callout.

>> do not look at the tests. If you let them into your context

> This is a bad idea. You are trusting the LLM’s ability to follow instructions

Oh I'm bot, I manually checked what it was doing. I might set up the ignore explicitly if I turn this into a repeatable procedure.

A Bold New Test of Gravity: Roger Penrose, et al.

https://www.youtube.com/watch?v=CfjnTJos_no
1•mudil•1m ago•0 comments

Battle of U.S. rail barons: Merger is setting the industry on a collision course

https://www.thecanadianpressnews.ca/business/battle-of-the-rail-barons-how-a-merger-is-setting-th...
1•cf100clunk•3m ago•0 comments

Can It Resolve Doom? Game Engine in 2k DNS Records

https://blog.rice.is/post/doom-over-dns/
2•darccio•6m ago•0 comments

Facing US oil blockade, Cuban man powers car with charcoal

https://reuters.com/business/energy/facing-us-oil-blockade-cuban-man-powers-car-with-charcoal-202...
2•1e1a•7m ago•1 comments

Jeditek.net

https://jeditek.com.au
1•stephenorazi•9m ago•1 comments

Streaks

https://lopespm.com/notes/2026/03/22/streaks.html
1•lopespm•9m ago•0 comments

OpenClaw Is a Security Nightmare Dressed Up as a Daydream

https://composio.dev/content/openclaw-security-and-vulnerabilities
1•fs_software•10m ago•0 comments

March, 19-21: God is a comedian

https://no01.substack.com/p/march-19-21-god-is-a-comedian
1•robin_reala•10m ago•0 comments

Microsoft rolls back some of its Copilot AI bloat on Windows

https://techcrunch.com/2026/03/20/microsoft-rolls-back-some-of-its-copilot-ai-bloat-on-windows/
2•cratermoon•11m ago•1 comments

The U.S. Ammo Shortage Is Worse Than You Think

https://www.wsj.com/opinion/the-u-s-ammo-shortage-is-worse-than-you-think-97096193
3•Teever•11m ago•0 comments

China is wrestling with a novel phenomenon: inherited wealth

https://www.google.com/url?q=https://www.economist.com/briefing/2026/03/12/china-is-wrestling-wit...
2•mooreds•14m ago•1 comments

How to Not Get Hacked Through File Uploads

https://www.eliranturgeman.com/2026/03/14/uploads-attack-surface/
1•fs_software•14m ago•0 comments

Prosecutor notes explicitly state Jean Luc Brunel offered to cooperate

https://www.wsj.com/us-news/epstein-accomplice-brunel-evidence-6693cb70
2•Jimmc414•15m ago•0 comments

Genome modelling and design across all domains of life with Evo 2

https://www.nature.com/articles/s41586-026-10176-5
1•tiborsaas•16m ago•0 comments

Even an AI Needs a Diary

https://adventuresinclaude.ai/posts/even-an-ai-needs-a-diary
1•mooreds•16m ago•0 comments

How cross-thread double free detection could work in glibc malloc

https://kallus.org/blog_tcache_key.html
1•bkallus•16m ago•0 comments

Style Is a Consistent Constraint

https://stephango.com/style
1•rdegges•16m ago•0 comments

New development in PRNG of wyhash: w1rand

https://github.com/wangyi-fudan/wyhash
1•jinyu2026•17m ago•1 comments

US Job Market Visualizer

https://github.com/karpathy/jobs
2•lopespm•17m ago•0 comments

Show HN: AI Prompts for DPC Practice Ops – one found $18.6K/mo in billing leaks

https://altmaniac4.gumroad.com/l/uzyimr
1•LabSageMD•20m ago•0 comments

Show HN: Threejs 3D wireframe stylizing tool – Generate infinite variations

https://github.com/Lywald/Wireframed.js
1•ycosynot•20m ago•0 comments

Microsoft considers legal action over $50B Amazon-OpenAI cloud deal

https://www.reuters.com/technology/microsoft-weighs-legal-action-over-50-billion-amazon-openai-cl...
3•indigodaddy•23m ago•0 comments

Iran will close strait of Hormuz if Trump acts on 48 hour infrastructure threat

https://www.theguardian.com/world/live/2026/mar/22/middle-east-crisis-live-iran-war-trump-ultimat...
10•Jimmc414•24m ago•0 comments

80% of School Is a Waste of Time – Will AI Change It?

https://www.zappable.com/p/the-case-against-education-podcast
2•paulpauper•24m ago•0 comments

The Program That's Turning Schools Around

https://www.theatlantic.com/education/2026/01/texas-education-community-schools/685703/
1•paulpauper•25m ago•0 comments

Inside Maven, Palantir's Military Brain Built on Claude

https://www.linkedin.com/pulse/inside-maven-palantirs-military-brain-built-claude-anthony-maio-bd6ee
1•geox•27m ago•0 comments

CEOs Don't Steer (2017)

https://www.ribbonfarm.com/2017/11/09/ceos-dont-steer/
1•krrishd•28m ago•0 comments

Why I love NixOS

https://www.birkey.co/2026-03-22-why-i-love-nixos.html
19•birkey•29m ago•7 comments

Is Argon2 better than Bcrypt?

https://pilcrowonpaper.com/blog/14
2•mooreds•30m ago•0 comments

Vibe Coding Is a Security Disaster That Is About to Happen

https://medium.com/@jostfaganel/vibe-coding-is-a-security-disaster-that-is-about-to-happen-9f72f3...
3•jfaganel99•31m ago•1 comments