frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Why Vibe Coding Fails

6•10keane•1h ago
i am using claude to maintain an agent loop, which will pause to ask for users' approval before important tool call. while doing some bug fixes,i have identified some clear patterns and reasons why vibe coding can fail for people who dont have technical knowledge and architecture expertise.

let me describe my workflow first - this has been my workflow across hundreds of successful sessions: 1. identify bugs through dogfooding 2. ask claude code to investigate the codebase for three potential root causes. 3. paste the root causes and proposed fixes to claude project where i store all architecture doc and design decision for it to evaluate 4. discuss with claude in project to write detailed task spec - the task spec will have a specified format with all sorts of test 5. give it back to claude code to implement the fix

in today's session, the root cause analysis was still great, but the proposed fixes are so bad that i really think that's how most of vibe coded project lost maintainability in the long run.

there is two of the root causes and proposed fix:

bug: agent asks for user approval, but sometimes the approval popup doesnt show up. i tried sending a message to unstick it. message got silently swallowed. agent looks dead. and i needed to restart the entire thing.

claude's evaluation: root cause 1: the approval popup is sent once over a live connection. if the user's ui isn't connected at that moment — page refresh, phone backgrounded, flaky connection — they never see it. no retry, no recovery.

this is actually true.

proposed fix "let's save approval state to disk so it survives crashes". sounds fine but then the key is by design, if things crashes, the agent will cold-resume from the session log, and it wont pick up the approval state anyway. the fix just add schema complexity and it's completely useless

root cause 2: when an approval gets interrupted (daemon crash, user restart), there's an orphan tool_call in the session history with no matching tool_result.

proposed fix: "write a synthetic tool_result to keep the session file structurally valid." sounds clean. but i asked: who actually breaks on this? the LLM API? no, it handles missing results. the session replay? no, it reads what's there. the orphan tool_call accurately represents what happened: the tool was called but never completed. that's the truth. writing a fake result to paper over it introduces a new write-coordination concern (when exactly do you write the fake result? what if the daemon crashes during the write?) to solve a problem that doesn't exist. the session file isn't "broken." it's accurate.

claude had full architecture docs, the codebase, and over a hundred sessions of project history in context. it still reaches for the complex solution because it LOOKS like good engineering. it never asked "does it even matter after a restart?"

i have personally encounterd this preference for seemingly more robust over-engineering multiple times. and i genuinely believe that this is where human operate actually should step in, instead of giving an one-sentence requirement and watches agents to do all sorts of "robust" engineering.

Comments

boesboes•1h ago
> because it LOOKS like good engineering

That is the whole problem imho. I've found that I can use LLMs to do programming only if I fully understand the problem and solution. Because if I don't, it will just pretend that I'm right and happily spend hours trying to implement a broken idea.

The problem is that it's very hard to known whether my understanding of something is sufficient to have claude propose a solution and for me to know if it is going to work. If my understanding of the problem is incorrect or incomplete, the plan will look fine too me, but it will be wrong.

If I start working on something from poor understanding, I will notice and improve my understanding. A LLM will just deceive and try to do the impossible anyway.

Also, it overcooks everything, atleast 50-60% of the code it generates are pointlessly verbose abstractions. agian: imho, ymmv, ianal, not financial advice ;)

10keane•1h ago
exactly. vibe coding only works when you fully understand the problem and know precisely how to solve it. ai just do the dirty implementation work for you.

that is another reason in why i separate product/architecture design and implementation into two agents with isolated context in my workflow. because i can always iterate with the product agent to refine my understanding and THEN ask the coding agent to implement it. by that time i already have the ability to make proper judgement and evaluate coding agent's output

Open-Sourcing SEC Edgar on Hugging Face

https://twitter.com/TeraflopAI/status/2044430993549832615
1•EnricoShippole•2m ago•1 comments

40% Increased Throughput 16.8% Less Energy for AI (Verified via ZKP)

https://github.com/BerzeShift/Berze-Shift
1•BerzeShift•3m ago•1 comments

Democracy Policy Under Obama [pdf]

https://obamaforillinois.s3.amazonaws.com/static/files/Democracy_Under_Obama_Executive_Summary.pdf
1•prepostseo•4m ago•1 comments

Show HN: Lazy-HN, a faster Hacker News front end you probably don't need

https://hn.tin-sever.de/
1•tin7•5m ago•0 comments

Rest of the World Annual Report 2025

https://restofworld.org/annual-report/2025/
1•hunglee2•5m ago•0 comments

Snap's Crucible Moment

https://sources.news/p/snap-crucible-moment
1•gmays•5m ago•0 comments

Show HN: Evo – parallel autoresearch experiments for Claude Code and Codex

https://github.com/evo-hq/evo
2•abtom•6m ago•0 comments

Cal.com is going closed source

https://cal.com/blog/cal-com-goes-closed-source-why
5•Benjamin_Dobell•6m ago•3 comments

Richard Dawkins, let's not bring back Neanderthals

https://unherd.com/newsroom/no-richard-dawkins-lets-not-bring-back-neanderthals/
1•voxleone•6m ago•0 comments

Ask HN: Which LLM model and agentic CLI are you using for local development?

1•alfiedotwtf•7m ago•0 comments

The Malleable Computer

https://world.hey.com/dhh/the-malleable-computer-7c187a9b
1•Tomte•7m ago•0 comments

I built a calculator site that doesn't look like garbage

https://www.calculatoris.dev
1•danzxc•8m ago•1 comments

We're only seeing the tip of the chip-smuggling iceberg

https://cyberscoop.com/ai-chip-smuggling-china-export-controls-enforcement-op-ed/
2•lschueller•11m ago•0 comments

Meta creating AI version of Mark Zuckerberg so staff can talk to the boss

https://www.theguardian.com/technology/2026/apr/13/meta-ai-mark-zuckerberg-staff-talk-to-the-boss
3•gmays•12m ago•0 comments

The best way to advertise a programming language

https://www.stylewarning.com/posts/write-programs/
1•cottonseed•13m ago•0 comments

Cybersecurity Looks Like Proof of Work Now

https://www.dbreunig.com/2026/04/14/cybersecurity-is-proof-of-work-now.html
1•brie22•13m ago•0 comments

Show HN: A semantic flow tool for embeddings

https://github.com/Pixedar/TraceScope
1•pixedar•13m ago•0 comments

Allbirds shares surge over 430% as footwear firm trades shoes for AI business

https://www.euronews.com/business/2026/04/15/allbirds-shares-surge-over-430-as-footwear-firm-trad...
2•gouthamve•13m ago•1 comments

I built my first AI agent (and what I got wrong)

https://thoughts.jock.pl/p/how-to-build-your-first-ai-agent-beginners-guide-2026
3•joozio•16m ago•0 comments

I'm curating a digital library of lindy books

https://www.thelindylibrary.com/
1•juansuero•16m ago•1 comments

Show HN: Cachefetch – Fast CLI tool that shows cache file sizes

https://github.com/ErenayDev/cachefetch
1•Erenay09•16m ago•0 comments

Unreal Engine C++ compilation for Windows under Linux with Wine

https://tensorworks.com.au/blog/unreal-engine-cpp-compilation-for-windows-under-wine/
2•mariuz•17m ago•0 comments

WhatDoTheyMake, Anonymous Salary Sharing

https://whatdotheymake.com/
1•jabsters•18m ago•2 comments

Show HN: Aegis – 85ns Sovereign Infrastructure Running on $100 Android Hardware

1•Aegis_Labs•18m ago•1 comments

No one's sure if synthetic mirror life will kill us all

https://www.technologyreview.com/2026/04/15/1135197/synthetic-mirror-life-microbes-kill-us-all/
1•Brajeshwar•18m ago•0 comments

Mathematics Isn't Unreasonably Effective

https://itsiweinstock.substack.com/p/mathematics-isnt-unreasonably-effective
2•ItsiW•21m ago•0 comments

Show HN: I built on-device TTS app because I run out of audiobooks on a flight

https://loudreader.io
2•mowmiatlas•22m ago•1 comments

Technical debt is dead, the metaphor is broken

https://p-322.com/notes/technical-debt-metaphor-is-broken/en/
3•jauco•22m ago•0 comments

Show HN: DeepFake Detector Flags Swalwell Video as Fake

https://graomelo.github.io/
1•IzhaqBlues•22m ago•0 comments

Show HN: Avec – iOS email app that lets you handle your Gmail inbox in seconds

https://apps.apple.com/us/app/avec-email-app-for-gmail/id6742199038
3•jnnnthnn•24m ago•0 comments