frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•1y ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•1y ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•1y ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•1y ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

Luck Surface Area

https://blog.danwald.me/luck-surface-area
1•danwald•1m ago•0 comments

Demesne, Zanzibar-style authz compiled to RLS

https://github.com/foir-io/demesne
1•mattblr•1m ago•0 comments

Medical diagnosis AIs can be tricked into telling whose data trained them

https://www.theregister.com/ai-and-ml/2026/06/24/medical-diagnosis-ais-can-be-tricked-into-tellin...
1•Bender•2m ago•0 comments

Show HN: Hacker News Job Salary Trends

https://hacker-job.com/trends
1•timqian•2m ago•0 comments

Mythos discovers 'Squidbleed,' a memory leak thats gone undetected since Clinton

https://www.theregister.com/security/2026/06/23/mythos-discovers-squidbleed-a-memory-leak-thats-g...
1•Bender•2m ago•0 comments

GNU C/C++ Vector Extensions

https://gcc.gnu.org/onlinedocs/gcc/Vector-Extensions.html
1•pillmillipedes•3m ago•0 comments

Open Robotics Is Maturing

https://digitalcxo.com/article/open-robotics-is-maturing/
2•CrankyBear•4m ago•0 comments

Attaky – The ultimate modular ecosystem for everyone

https://attaky.com/
2•LorenDB•4m ago•0 comments

Drastically Reduce Stress with a Work Shutdown Ritual – Cal Newport

https://calnewport.com/drastically-reduce-stress-with-a-work-shutdown-ritual/
2•ankitg12•7m ago•0 comments

The AI Data Centre Legal Case That Could Eradicate Civil Rights

https://read.misalignedmag.com/the-ai-data-centre-legal-case-that-could-eradicate-civil-rights-c2...
2•lcubw•8m ago•0 comments

Why big AI labs are hiring so many philosophers

https://www.economist.com/science-and-technology/2026/06/24/why-big-ai-labs-are-hiring-so-many-ph...
4•Brajeshwar•8m ago•0 comments

What does your eval measure?

https://shash42.substack.com/p/what-does-your-benchmark-actually
2•shash42•8m ago•0 comments

Show HN: Tuip – CLI / TUI for checking SaaS vendors' statuses

https://github.com/ikan31/tuip
2•ahme•10m ago•0 comments

Loops Burn Tokens

https://www.wheresyoured.at/cargo-culture/
3•felixdoerp•11m ago•0 comments

Show HN: Gifhub, bug hunter that shows instead of tells

https://github.com/press-pass/gifhub
2•spmartin823•12m ago•0 comments

The Bargain. Or what America forgot and Europe still keeps

https://idle.news/blog/the-forgotten-bargain/
2•umilio•12m ago•0 comments

The Xteink X4 E-Ink Reader

https://blog.omgmog.net/post/xteink-x4-e-ink-reader/
2•felixdoerp•13m ago•0 comments

Sentrup – AI Customer Support Platform

2•sentrup•14m ago•0 comments

Exploiting vulnerabilities in Johnson and Johnson web apps

https://eaton-works.com/2026/06/24/jnj-webapp-hacks/
3•EatonZ•15m ago•0 comments

Show HN: Cutlistor – Instant cut list optimizer with 3D Model and PDF Import

https://www.cutlistor.com
2•xiyan•15m ago•0 comments

I crawled 827 employers' career sites to measure ATS market share

https://resumegeni.com/research/ats-market-share-2026
3•blakec•16m ago•0 comments

Germany's Kai Havertz: 'I make runs that look pointless but I'm creating space'

https://www.theguardian.com/football/2026/jun/24/kai-havertz-germany-world-cup-2026-interview
2•bookofjoe•16m ago•0 comments

Ask HN: How much coding should beginners learn in the AI era?

3•JohnDSDev•17m ago•0 comments

Show HN: Empowering codex/Claude Code with Aswath Damodaran valuation thinking

https://github.com/stockvaluation-io/stockvaluation_io
2•pradeep1177•17m ago•0 comments

Building a LoFi Radio

https://cieslak.dev/en/blog/2026-06-24-lofi/
2•cieslak•20m ago•1 comments

Show HN: Metaspec: The DpANS3R Common Lisp Spec in S-Expr and HTML Format

https://metaspec.dev/#
3•dlowe-net•20m ago•0 comments

Show HN: Browser based tool for programming ch57x macro-pads

https://pollrobots.com/cheese-tax.html
2•pacaro•22m ago•0 comments

Create cross-platform mobile apps with Ruby

https://ruflet.dev/
3•AdamMusaAly•22m ago•0 comments

Show HN: (Spotlight/Raycast for Web Search not local) && (compare AI responses)

https://uberninja.co/
2•healersource•24m ago•0 comments

How to Measure the ROI of FDE

https://jaygoel.com/posts/building-an-fde-motion/
3•memset•25m ago•0 comments