frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•11mo ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•11mo ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•11mo ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•11mo ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

What do flying cars and AI innovation have in common?

https://medium.com/@groundtruthpost/everyone-is-building-flying-cars-60451bf36a91
1•thefeedbackloop•45s ago•0 comments

Worse Is Better

https://en.wikipedia.org/wiki/Worse_is_better
1•jermaustin1•1m ago•0 comments

The Bitter Lesson of Agentic Coding

https://agent-hypervisor.ai/posts/bitter-lesson-of-agentic-coding/
1•peterzat•2m ago•0 comments

The sonic anatomy of a double tap strike

https://earshotngo.substack.com/p/the-sonic-anatomy-of-a-double-tap
1•moxifly7•2m ago•0 comments

I froze a TCP connection for 10 minutes to migrate a live server

https://github.com/DongSunchao/libccmc
1•sunchaodong•2m ago•1 comments

The United States Is Repeating Its Silicon Mistake with Gallium Nitride

https://warontherocks.com/cogs-of-war/the-united-states-is-repeating-its-silicon-mistake-with-gal...
1•crescit_eundo•3m ago•0 comments

Wait Is Over – Coreboot on the AMD StarBook – Star Labs

https://it.starlabs.systems/blogs/news/coreboot-on-the-amd-starbook-finally
1•g-b-r•3m ago•1 comments

I'm Sorry, Dave. I'm Afraid I Can't De-Escalate: On (AI) Wargaming, Nuclear War

https://warontherocks.com/im-sorry-dave-im-afraid-i-cant-de-escalate-on-ai-wargaming-and-nuclear-...
2•crescit_eundo•3m ago•0 comments

GridMove for macOS: Move or snap windows by dragging from anywhere inside them

https://github.com/mirtlecn/GridMoveForMac/
1•mirtle•4m ago•0 comments

Nobel Lecture: On the possibility of progress (2019)

https://paulromer.net/prize/
1•ipnon•4m ago•0 comments

We OCR'ed 30k papers using Codex, open OCR models and Jobs

https://huggingface.co/blog/nielsr/ocr-papers-jobs
1•speckx•4m ago•0 comments

Consider the Chairmaker

https://ben.stolovitz.com/posts/consider-the-chairmaker/
1•citelao•4m ago•1 comments

The most underrated distribution channel in SaaS is hiding in browser toolbar

https://www.indiehackers.com/post/the-most-underrated-distribution-channel-in-saas-is-hiding-in-y...
1•max_flowly_run•5m ago•0 comments

Turing Award Winner - Mike Stonebraker: Postgres, Disagreeing with Google [video]

https://www.youtube.com/watch?v=YPObBOwIrHk
1•abkolan•5m ago•0 comments

Show HN: A stateless search proxy using Cloudflare Workers

https://github.com/logotam-app/stateless-search-proxy
1•vovanidze•5m ago•0 comments

The Timelessness of TUIs

https://xit-vcs.github.io/xitlog/the-timelessness-of-tuis.html
2•xeubie•6m ago•0 comments

Websites break California privacy law at 'industrial scale,' survey finds

https://calmatters.org/economy/technology/2026/04/data-privacy-opt-outs/
1•cdrnsf•7m ago•0 comments

Anthropic takes $5B from Amazon and pledges $100B in cloud spending in return

https://techcrunch.com/2026/04/20/anthropic-takes-5b-from-amazon-and-pledges-100b-in-cloud-spendi...
3•Brajeshwar•8m ago•0 comments

AI Tool Rips Off Open Source Software Without Violating Copyright

https://www.404media.co/this-ai-tool-rips-off-open-source-software-without-violating-copyright/
1•cdrnsf•9m ago•0 comments

Adobe Unveils Agents for Businesses Amid Threat of AI Disruption

https://www.wsj.com/cio-journal/adobe-unveils-agents-for-businesses-amid-threat-of-ai-disruption-...
2•JumpCrisscross•10m ago•0 comments

Scaling Codex to Enterprises Worldwide

https://openai.com/index/scaling-codex-to-enterprises-worldwide/
3•salkahfi•10m ago•0 comments

Show HN: Mulder – Containerized MCP server for digital forensics investigations

https://github.com/calebevans/mulder
2•calebevans•10m ago•0 comments

OpenAI Is Working with Consultants to Sell Codex

https://www.wsj.com/cio-journal/openai-is-working-with-consultants-to-sell-codex-f355b1b9
3•littlexsparkee•11m ago•0 comments

Jeff Bezos' Blue Origin blasts customer's satellite into wrong place

https://nypost.com/2026/04/20/business/jeff-bezos-blue-origin-blasts-satellite-into-wrong-place/
1•1vuio0pswjnm7•12m ago•0 comments

AI-conducted FRB study finds two emission regions at 9.2σ. ApJ halted it

https://blankline.org/newsroom/ai-frb-paper-apj-halted
2•DarenWatson•13m ago•1 comments

Show HN: SimCast – Control iOS Simulators remotely with real-time interaction

https://www.simcast.dev/
2•florinmatinca•14m ago•1 comments

MakerCookS – free online profolio for verified revenue and cost only

https://makercooks.com
1•Gule•14m ago•1 comments

Tindie store under "scheduled maintenance" for days

https://www.tindie.com/
2•somemisopaste•14m ago•0 comments

How to maintain flow and keep your momentum, How to Live, time and schedules

https://sive.rs/2020-03-flow
1•theorchid•15m ago•0 comments

Boundary Work, Not Castles

https://linuxtoaster.com//blog/boundary-work-not-castles.html
1•dirk94018•16m ago•1 comments