frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•1y ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•1y ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•1y ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•1y ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

Papa Johns Can Predict When Your Fridge Is Empty

https://www.adexchanger.com/tv/papa-johns-can-predict-when-your-fridge-is-empty/
1•ohjeez•1m ago•0 comments

Show HN: Follow London Trains in 3D

https://ride.nexttrain.london/
1•mgranados•1m ago•0 comments

Combined 1D and 2D Barcodes

https://shkspr.mobi/blog/2026/07/combined-1d-and-2d-barcodes/
1•Brajeshwar•1m ago•0 comments

Alexa+, the Next Generation of Alexa

https://www.aboutamazon.com/news/devices/new-alexa-generative-artificial-intelligence
1•doodlesdev•1m ago•0 comments

Are we still writing software?

https://ernestscribbler.xyz/are-we-still-writing-software.html
1•nickstinemates•2m ago•0 comments

ActiveGraph v1.2.0 is live – x30 speedup

https://activegraph.ai/blog/activegraph-v1-2-0
1•gkorland•2m ago•1 comments

The LLVM Compiler Infrastructure

https://cacm.acm.org/federal-funding-of-academic-research/the-llvm-compiler-infrastructure/
1•yarapavan•3m ago•0 comments

Understanding the Dynamics of the AI Ecosystem with Pace Layers

https://www.dbreunig.com/2026/07/03/ai-ecosytem-pace-layers.html
1•dbreunig•4m ago•0 comments

For Tailscale, good feedback is private feedback

https://doesmycode.work/posts/for-tailscale-good-feedback-is-private-feedback/
2•steveiliop56•7m ago•1 comments

Show HN: See a Random American

https://a-random-american.github.io
1•tintjosh•8m ago•0 comments

Show HN: WyrmRSS - Self-hosted RSS reader with inline YoutTube

https://github.com/kryoseu/WyrmRSS
1•kryoseu•8m ago•1 comments

Career Advice in the Age of AI

https://twitter.com/philhchen/status/2072793818945167475
1•yarapavan•10m ago•0 comments

The AI Compass

https://bambamramfan.github.io/ai-compass/
1•FLpxpyJ•11m ago•0 comments

Ask HN: America turns 250 today. What does it mean to you?

4•abixb•12m ago•0 comments

Make a website to learn Chinese and Enghlish

https://learnudot.com
1•jeyzolo•13m ago•0 comments

Agentic test processes, LLM benchmarks

https://danluu.com/ai-coding/
1•eatonphil•14m ago•0 comments

Work on multiple projects at once (ONE terminal window for everything)

https://github.com/philmard/mygrid
1•fmard•14m ago•1 comments

UPower 1.91.3 Fixes Behavior to Avoid Degrading Your Laptop Battery Faster

https://www.phoronix.com/news/UPower-1.91.3
1•Bender•16m ago•0 comments

The Polarization Trap: Gender Based Challenges to Liberal Democracy

https://richprocida.substack.com/p/gender-and-the-polarization-trap
1•RichProcida•16m ago•0 comments

4K 60 FPS USB Video Capture Becomes Less Problematic on Linux

https://www.phoronix.com/news/4K-60-FPS-USB-Video-Capture
1•Bender•16m ago•0 comments

How to Tax a Billionaire

https://www.motherjones.com/politics/2026/06/california-billionaire-tax-billionaires-wealth-gap-u...
1•mukmuk•17m ago•0 comments

Show HN: Gemma 3 inference in pure C++ with Metal acceleration

https://github.com/ybubnov/metalchat
1•ybubnov•17m ago•0 comments

Recovering outgoing reply-edges from a local X archive (with a bonus SVG)

https://github.com/responsiblparty/twitterverse
1•responsiblparty•18m ago•0 comments

Airplane Boneyards List and Map

https://airplaneboneyards.com/airplane-boneyards-list-and-map.htm
2•hyperific•19m ago•0 comments

CATL is building more than 200 battery swap stations every month

https://electrek.co/2026/07/04/catl-is-building-more-than-200-battery-swap-stations-every-month/
1•Bender•20m ago•0 comments

Show HN: SurfSkills – agent skills, each with a video of it working

https://surfskills.surf/discover
2•ephx•21m ago•0 comments

The World Cup is listed on Polymarket and Kalshi. They aren’t the same bet.

https://crosswire-api.com/
1•NicolasDJ04•22m ago•0 comments

Fable created novel 4D splat format

https://adamraudonis.github.io/splats4D/
1•adamraudonis•23m ago•1 comments

OpenSparrow – Schema-driven PHP and Postgres platform for CRUD zero dependencies

https://opensparrow.org/en/
1•tomaszwrobel•31m ago•0 comments

Reshaping the Quantum Arrow of Time

https://journals.aps.org/prx/abstract/10.1103/l18s-9vmh
1•bookofjoe•31m ago•0 comments