frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

The Rise of Spec Driven Development

https://www.dbreunig.com/2026/02/06/the-rise-of-spec-driven-development.html
1•Brajeshwar•4m ago•0 comments

The first good Raspberry Pi Laptop

https://www.jeffgeerling.com/blog/2026/the-first-good-raspberry-pi-laptop/
2•Brajeshwar•4m ago•0 comments

Seas to Rise Around the World – But Not in Greenland

https://e360.yale.edu/digest/greenland-sea-levels-fall
1•Brajeshwar•4m ago•0 comments

Will Future Generations Think We're Gross?

https://chillphysicsenjoyer.substack.com/p/will-future-generations-think-were
1•crescit_eundo•7m ago•0 comments

State Department will delete Xitter posts from before Trump returned to office

https://www.npr.org/2026/02/07/nx-s1-5704785/state-department-trump-posts-x
1•righthand•10m ago•0 comments

Show HN: Verifiable server roundtrip demo for a decision interruption system

https://github.com/veeduzyl-hue/decision-assistant-roundtrip-demo
1•veeduzyl•11m ago•0 comments

Impl Rust – Avro IDL Tool in Rust via Antlr

https://www.youtube.com/watch?v=vmKvw73V394
1•todsacerdoti•11m ago•0 comments

Stories from 25 Years of Software Development

https://susam.net/twenty-five-years-of-computing.html
2•vinhnx•12m ago•0 comments

minikeyvalue

https://github.com/commaai/minikeyvalue/tree/prod
3•tosh•17m ago•0 comments

Neomacs: GPU-accelerated Emacs with inline video, WebKit, and terminal via wgpu

https://github.com/eval-exec/neomacs
1•evalexec•22m ago•0 comments

Show HN: Moli P2P – An ephemeral, serverless image gallery (Rust and WebRTC)

https://moli-green.is/
2•ShinyaKoyano•26m ago•1 comments

How I grow my X presence?

https://www.reddit.com/r/GrowthHacking/s/UEc8pAl61b
2•m00dy•27m ago•0 comments

What's the cost of the most expensive Super Bowl ad slot?

https://ballparkguess.com/?id=5b98b1d3-5887-47b9-8a92-43be2ced674b
1•bkls•28m ago•0 comments

What if you just did a startup instead?

https://alexaraki.substack.com/p/what-if-you-just-did-a-startup
5•okaywriting•35m ago•0 comments

Hacking up your own shell completion (2020)

https://www.feltrac.co/environment/2020/01/18/build-your-own-shell-completion.html
2•todsacerdoti•37m ago•0 comments

Show HN: Gorse 0.5 – Open-source recommender system with visual workflow editor

https://github.com/gorse-io/gorse
1•zhenghaoz•38m ago•0 comments

GLM-OCR: Accurate × Fast × Comprehensive

https://github.com/zai-org/GLM-OCR
1•ms7892•39m ago•0 comments

Local Agent Bench: Test 11 small LLMs on tool-calling judgment, on CPU, no GPU

https://github.com/MikeVeerman/tool-calling-benchmark
1•MikeVeerman•40m ago•0 comments

Show HN: AboutMyProject – A public log for developer proof-of-work

https://aboutmyproject.com/
1•Raiplus•40m ago•0 comments

Expertise, AI and Work of Future [video]

https://www.youtube.com/watch?v=wsxWl9iT1XU
1•indiantinker•41m ago•0 comments

So Long to Cheap Books You Could Fit in Your Pocket

https://www.nytimes.com/2026/02/06/books/mass-market-paperback-books.html
3•pseudolus•41m ago•1 comments

PID Controller

https://en.wikipedia.org/wiki/Proportional%E2%80%93integral%E2%80%93derivative_controller
1•tosh•45m ago•0 comments

SpaceX Rocket Generates 100GW of Power, or 20% of US Electricity

https://twitter.com/AlecStapp/status/2019932764515234159
2•bkls•45m ago•0 comments

Kubernetes MCP Server

https://github.com/yindia/rootcause
1•yindia•46m ago•0 comments

I Built a Movie Recommendation Agent to Solve Movie Nights with My Wife

https://rokn.io/posts/building-movie-recommendation-agent
4•roknovosel•46m ago•0 comments

What were the first animals? The fierce sponge–jelly battle that just won't end

https://www.nature.com/articles/d41586-026-00238-z
2•beardyw•55m ago•0 comments

Sidestepping Evaluation Awareness and Anticipating Misalignment

https://alignment.openai.com/prod-evals/
1•taubek•55m ago•0 comments

OldMapsOnline

https://www.oldmapsonline.org/en
2•surprisetalk•57m ago•0 comments

What It's Like to Be a Worm

https://www.asimov.press/p/sentience
2•surprisetalk•57m ago•0 comments

Don't go to physics grad school and other cautionary tales

https://scottlocklin.wordpress.com/2025/12/19/dont-go-to-physics-grad-school-and-other-cautionary...
2•surprisetalk•57m ago•0 comments
Open in hackernews

AI coding tools are like that helpful but untrustworthy friend, devs say

https://www.theregister.com/2025/06/12/devs_mostly_welcome_ai_coding/
8•cliffly•8mo ago

Comments

pmarreck•8mo ago
Yep. Moments of sheer utility mixed with moments of "WTF were you 'thinking', if you can even call it that?'"

I've seen a lot of bad patterns, only some of which might be "trained out" with better training data in the future, and a lot of them revolve around testing:

1) Failure to stick to an existing valid test suite as a source of truth

2) Related: Failing to realize when the test is INvalid or has internally-inconsistent expectations (i.e., when to adjust the test)

3) Failure to run the full test suite right before declaring victory (you'd think it would be embarrassed... but it can't be embarrassed...)

4) Manually testing things instead of adding them to the test suite as test cases (which is almost always warranted)

5) When unable to solve something in a reasonable number of iterations, forcing the code to output what the test expects ("hardcoding the answer") instead of asking for help, then declaring partial victory (this one offended me the most, somehow, to the point that I was fascinated by how offended I was, like something I didn't even realize was sacred got violated)

6) Not sticking with TDD for more than 1 or 2 cycles before resorting to the above (this one is tragic because it would actually cause it to code better IMHO! Just like it would with the programmers who don't use it, creating the data it's training on! sigh)

7) not adhering to emphasized instructions (there's no way to "exclamation-point" certain directives without simply repeating them and/or using all-caps or threats etc... which is silly)

8) Writing a bunch of one-off test scripts/test data, and then not cleaning up after itself or merging those into the main test suite (if warranted)... It has ZERO sense of a "clean project directory" and that means I have to spend additional cycles of MY time either instructing it what to clean up (and hoping for the best) or manually going through everything it's produced and deciding what to keep or what to toss. Often these were artifacts that were valuable during intermediate steps, but are no longer, at least in a "this PR is wrapped up and ready for prod" sense.

In short, it knows the price of everything, but the value of nothing. As Sundar Pichai recently termed it, this is "artificial jagged intelligence (AJI)"

[shameless self-promo: I'm currently looking for interesting work, ping me, contact info in profile.]

tyleo•8mo ago
I like these tools and use them on a daily basis. That being said, the claimed benefits to productivity are way overblown. I find some folks wanting to cram them into every step of the dev process like they are some panacea.

They are a great boost but I think folks need to fit them in where they help naturally rather than cramming them into every nook and cranny.

bl4ck1e•8mo ago
I gave it a red hot try, ended up just turning off all the fancy predictive features, tried agentic mode... wasn't a fan, I still use Copilot occasionally to "rubber duck" ideas, and get some pointers on bugs...

I don't know, I think I'm missing something.

oytis•7mo ago
Well, everyone on the internet is 10x productive with these tools, so it must be on you.
matt3D•8mo ago
I think we need to start being more nuanced in the way we describe "AI Coding tools".

In the same way Claude Code is a different beast to Cursor, my own process is a different beast to Claude Code and the months I've spent building out a robust pipeline is now paying dividends.

I also think someone at The Register needs to go on a statistics course. Those figures seem to paint the picture that an overwhelming majority of those surveyed have had positive outcomes, which I don't think is represented by the slightly snarky headline.

JohnFen•7mo ago
Snark is what The Register does, and does well.

That said, their headline does say that devs find the tools helpful, so I don't think they're misrepresenting anything.