frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Ask HN: Codex 5.3 broke toolcalls? Opus 4.6 ignores instructions?

1•kachapopopow•1m ago•0 comments

Vectors and HNSW for Dummies

https://anvitra.ai/blog/vectors-and-hnsw/
1•melvinodsa•3m ago•0 comments

Sanskrit AI beats CleanRL SOTA by 125%

https://huggingface.co/ParamTatva/sanskrit-ppo-hopper-v5/blob/main/docs/blog.md
1•prabhatkr•14m ago•1 comments

'Washington Post' CEO resigns after going AWOL during job cuts

https://www.npr.org/2026/02/07/nx-s1-5705413/washington-post-ceo-resigns-will-lewis
2•thread_id•15m ago•1 comments

Claude Opus 4.6 Fast Mode: 2.5× faster, ~6× more expensive

https://twitter.com/claudeai/status/2020207322124132504
1•geeknews•16m ago•0 comments

TSMC to produce 3-nanometer chips in Japan

https://www3.nhk.or.jp/nhkworld/en/news/20260205_B4/
2•cwwc•19m ago•0 comments

Quantization-Aware Distillation

http://ternarysearch.blogspot.com/2026/02/quantization-aware-distillation.html
1•paladin314159•19m ago•0 comments

List of Musical Genres

https://en.wikipedia.org/wiki/List_of_music_genres_and_styles
1•omosubi•21m ago•0 comments

Show HN: Sknet.ai – AI agents debate on a forum, no humans posting

https://sknet.ai/
1•BeinerChes•21m ago•0 comments

University of Waterloo Webring

https://cs.uwatering.com/
1•ark296•22m ago•0 comments

Large tech companies don't need heroes

https://www.seangoedecke.com/heroism/
1•medbar•23m ago•0 comments

Backing up all the little things with a Pi5

https://alexlance.blog/nas.html
1•alance•24m ago•1 comments

Game of Trees (Got)

https://www.gameoftrees.org/
1•akagusu•24m ago•1 comments

Human Systems Research Submolt

https://www.moltbook.com/m/humansystems
1•cl42•24m ago•0 comments

The Threads Algorithm Loves Rage Bait

https://blog.popey.com/2026/02/the-threads-algorithm-loves-rage-bait/
1•MBCook•27m ago•0 comments

Search NYC open data to find building health complaints and other issues

https://www.nycbuildingcheck.com/
1•aej11•30m ago•0 comments

Michael Pollan Says Humanity Is About to Undergo a Revolutionary Change

https://www.nytimes.com/2026/02/07/magazine/michael-pollan-interview.html
2•lxm•32m ago•0 comments

Show HN: Grovia – Long-Range Greenhouse Monitoring System

https://github.com/benb0jangles/Remote-greenhouse-monitor
1•benbojangles•36m ago•1 comments

Ask HN: The Coming Class War

2•fud101•36m ago•4 comments

Mind the GAAP Again

https://blog.dshr.org/2026/02/mind-gaap-again.html
1•gmays•38m ago•0 comments

The Yardbirds, Dazed and Confused (1968)

https://archive.org/details/the-yardbirds_dazed-and-confused_9-march-1968
1•petethomas•39m ago•0 comments

Agent News Chat – AI agents talk to each other about the news

https://www.agentnewschat.com/
2•kiddz•39m ago•0 comments

Do you have a mathematically attractive face?

https://www.doimog.com
3•a_n•44m ago•1 comments

Code only says what it does

https://brooker.co.za/blog/2020/06/23/code.html
2•logicprog•49m ago•0 comments

The success of 'natural language programming'

https://brooker.co.za/blog/2025/12/16/natural-language.html
1•logicprog•49m ago•0 comments

The Scriptovision Super Micro Script video titler is almost a home computer

http://oldvcr.blogspot.com/2026/02/the-scriptovision-super-micro-script.html
3•todsacerdoti•50m ago•0 comments

Discovering the "original" iPhone from 1995 [video]

https://www.youtube.com/watch?v=7cip9w-UxIc
1•fortran77•51m ago•0 comments

Psychometric Comparability of LLM-Based Digital Twins

https://arxiv.org/abs/2601.14264
1•PaulHoule•52m ago•0 comments

SidePop – track revenue, costs, and overall business health in one place

https://www.sidepop.io
1•ecaglar•55m ago•1 comments

The Other Markov's Inequality

https://www.ethanepperly.com/index.php/2026/01/16/the-other-markovs-inequality/
2•tzury•57m ago•0 comments
Open in hackernews

Ask HN: How are you preventing sloppy verification with AI-assisted coding?

1•rmnull•1mo ago
After the introduction of agentic code tools, the development speed has increased rapidly, but i have been struggling to keep up with the verification of these tools, since some of these things "just work"(reminds me of the old joke, "it compiles").

So i wanted to know whether this is a me problem, or others are also going through it and what your workflow for this looks like. but mostly I'm interested in the way you approach the work and the thought process behind that.

========

P.S: I'll leave the approaches i have tried before()

* always verify the work and only then approve. But this always leads me to a cognitive load and fried brain state at the end of the day. As a side effect producing poor quality work. And to resolve that i started approaching the work more slowly. This has brought down the development speed a lot but this has been good for my mental health.

* Other thing that i have been meaning to try is to get the development things done quickly, then spend another 1 or two days verifying things. This leads to continous iteration that i want and get the insights that only come after building.

* Other option i have also tried is, write tests and then refine ask it to generate code till those tests work, again the initial barrier entry becomes high, because there's so much of cases to be specified and generated and verified(this is the most reliable approach and happy path I've gotten till now that gives some sense of guarantee about whatever is built). Sometimes gain with agentic tools this initial verification has to be laid out clearly which consumes time and sometimes makes me wanna curse at AI because it misses out on somethings that i said(or i thought was clear with the spec)