frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•10mo ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•10mo ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•10mo ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•10mo ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

Think Twice Before Buying or Using Meta's Ray-Bans

https://www.eff.org/deeplinks/2026/03/think-twice-buying-or-using-metas-ray-bans
1•hn_acker•24s ago•0 comments

Anthropic gives lesson in AI revenue hallucination

https://www.reuters.com/commentary/breakingviews/anthropic-gives-lesson-ai-revenue-hallucination-...
1•latinodev•4m ago•1 comments

Production query plans without production data

https://boringsql.com/posts/portable-stats/
1•birdculture•8m ago•0 comments

Build a deep researcher and learn DSPy Signatures and Modules

https://www.cmpnd.ai/blog/learn-dspy-deep-research.html
1•dbreunig•9m ago•0 comments

AI Is Making Libraries Obsolete

https://maho.dev/2026/03/ai-is-making-libraries-obsolete/
1•mahoivan•10m ago•0 comments

Singularity Is Around?

1•essekar•11m ago•0 comments

Do YC companies all use the top sales tools?

1•justin_cheu•12m ago•0 comments

Deleted Tweet from Energy Secretary Sends Oil Markets on Another Wild Ride

https://www.wsj.com/finance/stocks/deleted-tweet-from-energy-secretary-sends-oil-markets-on-anoth...
1•petethomas•13m ago•0 comments

Evolving the Node.js Release Schedule

https://nodejs.org/en/blog/announcements/evolving-the-nodejs-release-schedule
1•suresh70•13m ago•0 comments

DOGE employee stole Social Security data and put it on a thumb drive

https://techcrunch.com/2026/03/10/doge-employee-stole-social-security-data-and-put-it-on-a-thumb-...
8•elsewhen•17m ago•1 comments

Claude Opus 4.6 generated a YouTube poop video with a single prompt

https://twitter.com/josephdviviano/status/2031196768424132881
1•dokdev•17m ago•1 comments

Build a "Deep Data" MCP Server to Connect LLMs to Your Local Database in 10min

https://root-ai.beehiiv.com/p/build-a-deep-data-mcp-server-to-connect-llms-to-your-local-database...
1•mehdikbj•19m ago•0 comments

Aaron Swartz and the Return of Jottit

https://jottit.org/
1•shanselman•19m ago•1 comments

A Special AMD Ryzen AM5 Motherboard for Linux / Open-Source Enthusiasts

https://www.phoronix.com/review/msi-pro-b850p-wifi
3•RachelF•19m ago•0 comments

Side questions with /btw in Claude Code

https://code.claude.com/docs/en/interactive-mode
2•mfiguiere•22m ago•0 comments

Mathematics is undergoing the biggest change in its history

1•Stratoscope•23m ago•0 comments

SaaSpocalypse Now

https://hantverkskod.se/2026/03/01/saaspocalypse/
1•mosura•24m ago•0 comments

Classifying email providers of 2000 Swiss municipalities via DNS

https://mxmap.ch/
1•notmine1337•26m ago•0 comments

I Ching or Book of Changes

https://iching.r053.org/
1•tzury•27m ago•0 comments

I Got Root on Meta AI's Infrastructure Using a Chat Prompt

https://netguard24-7.com/blog/meta-ai-root
2•cybrdude•27m ago•0 comments

Chemists thought phosphorus had shown all its cards–until it surprised them

https://phys.org/news/2026-02-chemists-thought-phosphorus-shown-cards.html
2•PaulHoule•27m ago•0 comments

How to start coding with AI agents

https://www.paralect.com/academy/product-engineer/ai-agents-coding
1•igorkrasnik•28m ago•0 comments

Zero Point Energy

https://twitter.com/EagleworksSonny/status/2031128667019972616
1•Flere-Imsaho•29m ago•0 comments

Show HN: Repovex – GitHub repo health scores for your whole org

https://repovex.com
1•calminferno•35m ago•0 comments

Front End Memory Leaks: 500-Repo Static Analysis and 5-Scenario Benchmark Study

https://stackinsight.dev/blog/memory-leak-empirical-study/
1•nadis•38m ago•0 comments

Visual plasticity and exercise revisited: No evidence for a "cycling lane"

https://jov.arvojournals.org/article.aspx?articleid=2737222
2•amadeuspagel•40m ago•0 comments

Google and Tesla think we're managing the electrical grid all wrong

https://techcrunch.com/2026/03/10/google-and-tesla-think-were-managing-the-electrical-grid-all-wr...
1•jnord•40m ago•0 comments

I've no technical background, hope someone finds this interesting

https://github.com/aleflow420/rinoa
1•aleflow420•40m ago•0 comments

GLP-1 drugs push U.S. consumers toward spicy foods, lifting sauce makers

https://www.reuters.com/business/healthcare-pharmaceuticals/sauce-spice-makers-attract-deal-inter...
2•petethomas•40m ago•0 comments

Television and computer use and dementia risk in older adults

https://alz-journals.onlinelibrary.wiley.com/doi/10.1002/alz.71259
3•amadeuspagel•42m ago•0 comments