frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Ask HN: What home printer do you use/recommend?

10•niyazpk•2h ago•8 comments

How to find AI-conservative companies to work for?

7•tossitawayplz•3h ago•2 comments

Ask HN: Anthropic banned me from using Claude Code and I don't know what to do

67•ayi•17h ago•82 comments

Ask HN: Are people generally interested using LLMs for learning purposes?

4•iknownthing•5h ago•8 comments

Ask HN: Am I missing something with AI

4•vasko•10h ago•9 comments

Overfitted a 900KB Transformer to Compress a 100MB CSV into 7MB

5•spidy__•11h ago•2 comments

Ask HN: Do you have an unusual income source

45•xupybd•1d ago•33 comments

Tell HN: I never bought anything from clicking on a paid ad

18•julienreszka•22h ago•17 comments

Ask HN: Will programmers write more efficient code during the memory shortage?

152•amichail•4d ago•244 comments

Ask HN: New clean macOS install. Must-have apps? Best browser?

14•simonebrunozzi•12h ago•17 comments

Open source, global vs. proprietary but for US in US, which is fundable in SaaS?

4•avijeetsingh16•18h ago•0 comments

Ask HN: Is anyone using the A2A protocol?

94•asim•5d ago•44 comments

Ask HN: What tools are you using for AI-assisted code review?

25•agos•5d ago•26 comments

Ask HN: What is your opinion on TUI applications

8•po1nt•1d ago•9 comments

Ask HN: Are people optimistic about the future?

40•JohnDSDev•3d ago•83 comments

Ask HN: What did you find out or explore today?

5•blahaj•1d ago•5 comments

Ask HN: Are you being "529 Overloaded" by Anthropic too?

8•hmokiguess•1d ago•8 comments

Ask HN: Fda.gov Down for You?

2•jmount•1d ago•2 comments

Ask HN: How close are we to local LLMs being useful? What's the impact?

6•AbstractH24•1d ago•6 comments

Ask HN: What would justify writting an OS kernel in 2026?

5•alonsovm44•2d ago•7 comments

Ask HN: I'm lost. How can I define ICP (Ideal Customer Profile)?

8•snowhy•5d ago•6 comments

Norrin – Git/ diff control in Claude Code

4•gagewoodard•2d ago•1 comments

Ask HN: Are You a Workaholic?

5•julienreszka•2d ago•5 comments

My Opinion on RL

3•umjunsik132•1d ago•1 comments

GitHub Banned All CI for Our (OSS) Org Because of Bad Drive-By Contributors

9•BlueMatt•1d ago•4 comments

Ask HN: Has Codex gotten slower recently?

6•aurenvale•21h ago•1 comments

Ask HN: How should I convert Microsoft Word documents to Markdown?

5•lkrubner•2d ago•7 comments

Ask HN: After you ship a feature, what happens to what you learned?

10•gaggle_dk•3d ago•13 comments

Ask HN: What technique do you use to make Claude Code deterministic?

8•hbarka•3d ago•11 comments

Ask HN: What do you care about? What is your joy and purpose?

12•bix6•3d ago•22 comments
Open in hackernews

Ask HN: Am I missing something with AI

4•vasko•10h ago
I constantly hear developers around me talk about how AI has completely changed their life and how they don't even program anymore, they just prompt. But any time I've used it, the output has always been off. And when the output is off I have to go and read through everything, learn how it works and fix it, which at that point I might as well write it myself.

I just don't understand what other people are seeing, I've mainly used Claude and ChatGPT, I got a free trial for premium but it's just underwhelming, their only use so far for me has been as a search engine, but they're a search engine that's wrong 20% of the time so even that use is questionable.

Comments

dejan_kocic•10h ago
I think AI is good for creating a foundation, then branching out and adding features, you shouldn't overdo it with AI.
morisil•10h ago
I am shocked how much my experience is different from yours. I wrote Claudine, my own version of Claude Code, almost 2 years ago. This experience gave me the understanding of how the technology works. Since then I've produced maybe 300k lines of open source code, and all of it meaningful to the bones. What kind of projects are you working on, maybe it's the specificity of your domain?
JohnFen•10h ago
You might be even more shocked to learn that the author's experience isn't rare.
Festro•10h ago
We're reaching a point currently where output quality is very much determined by input quality. Previously output quality was hampered fundamentally by model knowledge, hallucinations, and model quality.

Now, we have better knowledge of prompting as people have learnt what to say, models are better, models make use of memory from other conversations, they have skills written by humans or even themselves on how to do things, access to the internet to get live info, access to project files to check info, and the built in 'thinking' to challenge their own assumptions and loop on outputs until its refined.

You're right that output is always off still, but a lot of people have reached a point where it's only 'off' by an amount that is less than the effort required to do the task themselves, and considerably so.

My example today is prompting Claude to do a technical audit of a new client site.

It has skills for UX and SEO audits. Connects to an SEO tool. Pulls client info from OneDrive. Outputs to Word from a template for our agency. I even had it drive a remote pagespeed testing tool in Chrome because they don't have an MCP server currently.

Doing that report myself is 3.5-7 hours depending on what's found. Claude did it in 0.5 hours. Now I'm sorting out the oddities and anything that feels 'off'. I know and understand the full content of the report and can get on with actioning the recommendations or prioritising them for others. I've got maybe 1 hour of review and writing to do. It's not a 10x improvement but I'm happy with it.

Although, whilst Claude did it's bit I was doing other work. So, perhaps the multiplier is higher than I give it credit for.

vasko•8h ago
The way AI is able to interact with outside resources is pretty impressive, but the quality of code it produces to me is still questionable, more so in the larger scope, and the errors it produces are sometimes hard to catch because they're not normal human errors.

Recently I tried to get Claude to write a script that produces large amounts of code so I could profile a compiler. The script ended up outputing code that uses variables outside of their scope, didn't utilize like 90% of the features of the language, and basically ended up being something that I could make by spamming copy paste.

The script itself was also written in really weird way, utilizing recursion for pretty much everything when most of what it did could be done in simple loops. It ended up being a bit of a nightmare to fix and the entire time I was asking myself "why didn't I just write this in 30 minutes instead of going through all of this".

jr_isidore•10h ago
When you join a new company, is it faster to fix a bug rewriting everything from scratch or to modify what's there? Seriously, get your head out of your ass.
ex-aws-dude•2h ago
Are you treating it like a genie to build huge things in one shot or working on small incremental changes?

I’ve found the latter works way better

Festro•8h ago
I can't speak to coding as it's not my area but certainly the pattern I've spotted is that it's best at grunt work. That's where the time savings kick in.

Browsing sites, linking up data, spotting anomalies, writing documentation, formatting documents, etc.

If a task isn't repetitive or doesn't involve ingesting data, then I think the time savings shrink rapidly and the need for oversight increases massively. I think some people are managing to set up enough automated oversight to get round that, but it's adding a layer that multiplies your token usage to do so and still has no guarantee. But certainly all these layers being added are increasing success rates.

Andrei Karpathy is speaking about barely coding now. He has a bias, a comment from him like that is marketing for Anthropic, but I believe he's found some groove with his setup to achieve that.

I think the current status quo this month in 2026 we're at a point where the best tips and tricks to get usable answers out of ChatGPT a year ago have been consolidated into what we know call memory and skills in Claude and other agent harness type systems. You might need to explore those more, in fact I think for Claude Code/Cursor there are even more layers for checking outputs that I've not even seen in Claude Desktop.

And I think your exact issue, and the experience of the vast volumes of people who share it with you, are an audience that the app makers want to better convince. The free tiers and marketing sites are going to step up their game gradually and there will be new features that lower failure rates even more.

bawis•1h ago
>> Now, we have better knowledge of prompting as people have learnt what to say

Can you back up this claim? what do you mean exactly by "better knowledge" ?