frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Ask HN: Codex 5.3 broke toolcalls? Opus 4.6 ignores instructions

1•kachapopopow•5s ago•0 comments

Vectors and HNSW for Dummies

https://anvitra.ai/blog/vectors-and-hnsw/
1•melvinodsa•1m ago•0 comments

Sanskrit AI beats CleanRL SOTA by 125%

https://huggingface.co/ParamTatva/sanskrit-ppo-hopper-v5/blob/main/docs/blog.md
1•prabhatkr•13m ago•1 comments

'Washington Post' CEO resigns after going AWOL during job cuts

https://www.npr.org/2026/02/07/nx-s1-5705413/washington-post-ceo-resigns-will-lewis
2•thread_id•13m ago•1 comments

Claude Opus 4.6 Fast Mode: 2.5× faster, ~6× more expensive

https://twitter.com/claudeai/status/2020207322124132504
1•geeknews•15m ago•0 comments

TSMC to produce 3-nanometer chips in Japan

https://www3.nhk.or.jp/nhkworld/en/news/20260205_B4/
2•cwwc•17m ago•0 comments

Quantization-Aware Distillation

http://ternarysearch.blogspot.com/2026/02/quantization-aware-distillation.html
1•paladin314159•18m ago•0 comments

List of Musical Genres

https://en.wikipedia.org/wiki/List_of_music_genres_and_styles
1•omosubi•20m ago•0 comments

Show HN: Sknet.ai – AI agents debate on a forum, no humans posting

https://sknet.ai/
1•BeinerChes•20m ago•0 comments

University of Waterloo Webring

https://cs.uwatering.com/
1•ark296•20m ago•0 comments

Large tech companies don't need heroes

https://www.seangoedecke.com/heroism/
1•medbar•22m ago•0 comments

Backing up all the little things with a Pi5

https://alexlance.blog/nas.html
1•alance•22m ago•1 comments

Game of Trees (Got)

https://www.gameoftrees.org/
1•akagusu•23m ago•1 comments

Human Systems Research Submolt

https://www.moltbook.com/m/humansystems
1•cl42•23m ago•0 comments

The Threads Algorithm Loves Rage Bait

https://blog.popey.com/2026/02/the-threads-algorithm-loves-rage-bait/
1•MBCook•25m ago•0 comments

Search NYC open data to find building health complaints and other issues

https://www.nycbuildingcheck.com/
1•aej11•29m ago•0 comments

Michael Pollan Says Humanity Is About to Undergo a Revolutionary Change

https://www.nytimes.com/2026/02/07/magazine/michael-pollan-interview.html
2•lxm•30m ago•0 comments

Show HN: Grovia – Long-Range Greenhouse Monitoring System

https://github.com/benb0jangles/Remote-greenhouse-monitor
1•benbojangles•35m ago•1 comments

Ask HN: The Coming Class War

2•fud101•35m ago•4 comments

Mind the GAAP Again

https://blog.dshr.org/2026/02/mind-gaap-again.html
1•gmays•36m ago•0 comments

The Yardbirds, Dazed and Confused (1968)

https://archive.org/details/the-yardbirds_dazed-and-confused_9-march-1968
1•petethomas•38m ago•0 comments

Agent News Chat – AI agents talk to each other about the news

https://www.agentnewschat.com/
2•kiddz•38m ago•0 comments

Do you have a mathematically attractive face?

https://www.doimog.com
3•a_n•42m ago•1 comments

Code only says what it does

https://brooker.co.za/blog/2020/06/23/code.html
2•logicprog•47m ago•0 comments

The success of 'natural language programming'

https://brooker.co.za/blog/2025/12/16/natural-language.html
1•logicprog•48m ago•0 comments

The Scriptovision Super Micro Script video titler is almost a home computer

http://oldvcr.blogspot.com/2026/02/the-scriptovision-super-micro-script.html
3•todsacerdoti•48m ago•0 comments

Discovering the "original" iPhone from 1995 [video]

https://www.youtube.com/watch?v=7cip9w-UxIc
1•fortran77•49m ago•0 comments

Psychometric Comparability of LLM-Based Digital Twins

https://arxiv.org/abs/2601.14264
1•PaulHoule•51m ago•0 comments

SidePop – track revenue, costs, and overall business health in one place

https://www.sidepop.io
1•ecaglar•53m ago•1 comments

The Other Markov's Inequality

https://www.ethanepperly.com/index.php/2026/01/16/the-other-markovs-inequality/
2•tzury•55m ago•0 comments
Open in hackernews

Ask HN: When tests keep passing but design stops moving

1•felixasher•1mo ago
I’ve been practicing TDD for a while, and I keep running into the same uncomfortable moment.

Tests pass. Coverage improves. Refactoring feels safe.

But at some point, the design just… stops moving.

Not because the system is “done,” but because the tests no longer seem to challenge anything. They mostly confirm decisions that already feel locked in.

I don’t have a clean explanation for this. What I started suspecting is that some assumptions quietly become fixed long before we realize they have.

That pushed me toward a few uncomfortable experiments.

For example, I started writing tests that cut end-to-end much earlier than felt reasonable, and tried to think less in terms of features and more in terms of “what must never break.”

I also started paying attention to what actually changes for me when a test turns green — often it’s not confidence in correctness, but whether I still feel the need to question a particular assumption.

I wrote up these observations here: https://github.com/felix-asher/the-essence-of-tdd

I’m not proposing a new methodology or a replacement for how TDD is usually taught. I’m mostly curious whether others have hit the same stall point — where tests keep passing, but design learning seems to plateau.

If you’ve seen this, what helped you notice it — or get unstuck?

Comments

JohnFen•1mo ago
I'm not sure I understand what your dev process actually is. I get the impression that you're using the tests you write as a substitute for design work. Is that correct?

If so, I think that's the root of the trouble. Do your design work as a separate step that precedes writing test cases.

felixasher•1mo ago
Thanks — that’s a fair question, and probably on me for not being clear.

I’m not trying to replace design work with tests. What I’m experimenting with is using certain tests (especially integration-level ones) as a way to surface and challenge assumptions that feel stable on paper.

In other words, the tests aren’t the design, but they’re sometimes the fastest way I’ve found to discover where my “separate design step” was incomplete or misleading.

Happy to clarify more if helpful.

aydin212•1mo ago
That stall happens when tests stop being a design tool and become just a correctness check. Switching the prompt from "Does this work?" to "What would break if this core assumption changed?" has helped me break through it.
felixasher•1mo ago
Yes — that framing resonates a lot.

That shift from “does this work?” to “what breaks if this assumption is wrong?” is very close to what I’ve been circling around.

For me, the stall seems to happen when green tests stop reducing doubt and start just confirming structure. Integration-level tests sometimes help me reintroduce that pressure.

Really appreciate you articulating it so clearly.

veeduzyl•3w ago
I’m experimenting with a cold guardrail for refactor risk. If it blocks you (or fails to), feedback is here: GitHub Discussion: “Feedback: tell me when it interrupts you”
felixasher•3w ago
“Cold guardrail” resonates. I’ve been running guardrails only at slice boundaries as well. Deciding when interruption is justified has been surprisingly non-trivial.