frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•1y ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•1y ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•1y ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•1y ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

Why You Actually Want Machines Writing the Code for Your Next Flight

https://decodingvibes.com/blog/why-you-actually-want-machines-writing-the-code-for-your-next-flight/
1•altmanaltman•4m ago•0 comments

South Korea Exploring Using Hyundai Robots as Army Numbers Fall

https://www.bloomberg.com/news/articles/2026-05-11/south-korea-exploring-using-hyundai-robots-as-...
1•petethomas•8m ago•0 comments

Growling in a corner: Samuel Johnson's lost years

https://www.commonreader.co.uk/p/growling-in-a-corner-samuel-johnsons
1•pepys•10m ago•0 comments

Europe Is Losing Its Best Engineers – Not to Emigration, but to Management

https://andrulis.de/blog/20260429_management.html
1•taubek•12m ago•0 comments

Iran mulls taking control of all 7 cables passing through Strait of Hormuz

https://www.wionews.com/world/iran-to-take-full-control-of-all-7-undersea-internet-cables-passing...
2•jonah•15m ago•0 comments

The Trouble with Narrative History

https://thereader.mitpress.mit.edu/the-trouble-with-narrative-history/
2•Hooke•16m ago•0 comments

Geography Is Four-Dimensional

https://sive.rs/4d
1•Curiositry•16m ago•0 comments

Visual Generation Unlocks Human-Like Reasoning Through Multimodal World Models

https://arxiv.org/abs/2601.19834
2•felineflock•18m ago•0 comments

Blink – AI Assistant

https://blink-oi.vercel.app
1•Pascal1997•18m ago•0 comments

Neural Machine Perception

https://openstrate.com/
1•realitymatrixyz•20m ago•0 comments

2-mile long WAR.GOV/UFO Microfilm reel

https://hypergrid.systems/war.gov-ufo-viewer/microfilm2?frame=12404&page=12404
1•keepamovin•20m ago•0 comments

The SEC plans to end quarterly reporting

https://keepitquarterly.org/
1•froglop•28m ago•0 comments

I scanned 100 random Supabase projects. 22% leak user data anonymously

https://perufitlife.github.io/supabase-security-skill/blog/scanned-100-supabase-projects.html
1•renzom13•30m ago•0 comments

When GPT 5.5 flags your chat for possible cybersecurity risk–ask it to help you

https://martin.wojtczyk.de/2026/05/11/when-gpt-5-5-flags-your-chat-for-possible-cybersecurity-ris...
1•wojtczyk•34m ago•0 comments

The Vercel breach wasn't just a hack, it was a trust failure

https://www.inc.com/heather-wilde/the-vercel-breach-wasnt-just-a-hack-it-was-a-trust-failure/9133...
2•bobrenze•36m ago•0 comments

The future of work isn't human vs. AI, it's human with AI

https://www.inc.com/heather-wilde/the-future-of-work-isnt-human-vs-ai-its-human-with-ai/91335123
1•bobrenze•38m ago•0 comments

7 lines of code, 3 minutes: Implement a programming language (2010)

https://matt.might.net/articles/implementing-a-programming-language/
5•azhenley•40m ago•0 comments

Microbenchmark-Driven Analytical Performance Modeling Across Modern GPUs

https://arxiv.org/abs/2605.04178
1•matt_d•52m ago•0 comments

Baidu ERNIE 5.1 just dropped

https://ernie.baidu.com/
2•pretext•55m ago•0 comments

RPCS3 says "learn to code" as it bans AI agents from project

https://www.neowin.net/news/rpcs3-says-learn-to-code-as-it-bans-ai-agents-from-project/
1•bundie•1h ago•0 comments

I want to archive but DMCA stopped me

1•quanvm0501alt1•1h ago•0 comments

The eye in your pocket: digital devices are made to track you

https://aeon.co/essays/things-have-jobs-and-digital-devices-are-made-to-track-you
3•the-mitr•1h ago•0 comments

PyTorch DevLog

https://docs.pytorch.org/devlogs/
2•matt_d•1h ago•0 comments

Lie-to-Children

https://en.wikipedia.org/wiki/Lie-to-children
3•o4c•1h ago•0 comments

I reverse engineered macOS to disable built-in display

https://frankster0542.gumroad.com/l/saafi
2•fkusiapp•1h ago•0 comments

Ask HN: How much does Gemini API cost for a simple n8n workflow?

1•Meld5792•1h ago•0 comments

MCP server that gives a forensic verdict on biopharma catalyst plays

https://github.com/yesc97/biopharma-catalyst-mcp
2•yesc97•1h ago•0 comments

Addressing GitHub's recent availability issues

https://github.blog/news-insights/company-news/addressing-githubs-recent-availability-issues-2/
3•mvdtnz•2h ago•3 comments

Plenty of Hours in the Day

https://www.wsj.com/arts-culture/books/big-time-review-plenty-of-hours-in-the-day-d3744c1a
1•lxm•2h ago•0 comments

Show HN: Kheeper, a registry designed for bootable images

https://kheeper.com/
2•areed•2h ago•0 comments