frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•1y ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•1y ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•1y ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•1y ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

Gaza's Children

https://gazaschildren.com/
2•abdelhousni•2m ago•1 comments

The Lost World (1925) [video]

https://archive.org/details/the.-lost.-world.-1925.1080p.-blu-ray.x-264-sadpanda
1•petethomas•4m ago•0 comments

New serious vulnerabilities spiked around release of Claude Mythos Preview

https://epoch.ai/data-insights/cve-severity-spike
1•cubefox•5m ago•0 comments

Show HN: Pulse v0.2.0

2•xerrs•6m ago•0 comments

AI inference is obviously profitable

https://www.seangoedecke.com/ai-inference-is-obviously-profitable/
1•emirb•6m ago•1 comments

Africans Are Turning to Starlink

https://www.economist.com/middle-east-and-africa/2026/07/02/africans-are-turning-to-starlink
6•bookofjoe•12m ago•1 comments

Ads in ChatGPT

https://ads.openai.com/
3•vlan121•13m ago•2 comments

RememberLI

https://github.com/KlausSchaefers/rememberli
1•klausschaefers•14m ago•0 comments

Special forces ban Volvo/Chinese electric cars over spying fears

https://www.telegraph.co.uk/news/2026/07/03/special-forces-bans-chinese-cars-spying-fears-volvo/
3•cwwc•14m ago•0 comments

Show HN: Mlx-serve – LLM inference server for Apple Silicon, written in Zig

https://mlxserve.com/
1•ddalcu•15m ago•1 comments

MiniKotlin – A Kotlin Compiler That Runs in a Browser Tab

https://minikotlin.run
1•TheWiggles•17m ago•0 comments

Show HN: ContextCodeCache in Rust

https://github.com/colwill/ccc
1•colwont•17m ago•0 comments

Show HN: Maestro – scaffold Go microservices and keep them in sync

https://github.com/Zagforge-Org/maestro
1•anzedev•18m ago•1 comments

Collabora Office 26.04 Keeps AI Optional and Refines Writer and Calc

https://itsfoss.com/news/collabora-office-26-04/
1•mmarian•19m ago•0 comments

Mistralai/Leanstral-1.5-119B-A6B

https://huggingface.co/mistralai/Leanstral-1.5-119B-A6B
1•satvikpendem•20m ago•0 comments

Meta AI chief says their coming LLM has caught up with OpenAI's flagship model

https://www.businessinsider.com/meta-ai-model-catches-up-openai-gpt-5-says-2026-7
2•maxloh•20m ago•0 comments

Sumit Rana to Step Away from Epic

https://www.healthcareittoday.com/2026/07/03/breaking-news-sumit-rana-to-step-away-from-epic/
1•Forge36•24m ago•0 comments

Ask HN: What did you fail at and what did you learn from it?

2•basilikum•24m ago•0 comments

Jj v0.43.0 Released

https://github.com/jj-vcs/jj/releases/tag/v0.43.0
1•birdculture•26m ago•0 comments

Camera with transparent display launches for the equivalent of $29

https://www.notebookcheck.net/Camera-with-transparent-display-launches-for-the-equivalent-of-29.1...
2•yread•26m ago•1 comments

Congressman says hack of his Signal account proves app is unsecure. Is it true?

https://san.com/cc/congressman-says-hack-of-his-signal-account-proves-app-is-unsecure-is-it-true/
3•devonnull•30m ago•1 comments

Show HN: How clanker are you? A reverse Turing test

https://howclankerareyou.com/
3•niklio•36m ago•1 comments

Ross Spiral Curriculum

https://spiral.ross.org/spiral/#/
1•el3ctron•38m ago•0 comments

Palantir and the NHS – things you need to know

https://theconversation.com/palantir-and-the-nhs-10-things-you-need-to-know-281165
1•abdelhousni•38m ago•1 comments

Applied Category Theory Course (2018)

https://math.ucr.edu/home/baez/act_course/index.html
2•measurablefunc•39m ago•0 comments

A Runtime Modulation Layer for Large Language Models

https://github.com/divinecanon/signalengine-EN-
1•w89780175•40m ago•0 comments

OpenCode, Pi, and Goose: Three Layers of the AI Agent Stack

https://gist.github.com/AIMOWAY/bd8007c8f834a9bc83c71e3178239d75
1•AIMOWAY•42m ago•0 comments

Espionage Against the European Parliament

https://citizenlab.ca/research/member-of-committee-investigating-spyware-hacked-with-pegasus/
26•ledoge•42m ago•0 comments

Giving a domain a hill to climb: benchmarking as data activation

https://sparsethought.com/2026/07/03/benchmarking-as-data-activation/
3•galsapir•48m ago•1 comments

AI Is Boring

2•sverp•49m ago•1 comments