frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•8mo ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•8mo ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•8mo ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•8mo ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

Your Dog Might Be Eavesdropping on You

https://www.scientificamerican.com/article/some-dogs-learn-new-words-just-like-toddlers-do/
1•sohkamyung•3m ago•0 comments

Novel AI Method Sharpens 3D X-ray Vision

https://www.bnl.gov/newsroom/news.php?a=222627
1•cl3misch•5m ago•0 comments

Show HN: Where do my taxes go in Berlin? A personal receipt generator

https://berlin-bill.eamag.me
1•eamag•6m ago•0 comments

39C3 – Cracking Open What Makes Apple's Low-Latency WiFi So Fast [video]

https://media.ccc.de/v/39c3-cracking-open-what-makes-apple-s-low-latency-wifi-so-fast
1•amanverasia•8m ago•0 comments

Tik-Tok (Novel)

https://en.wikipedia.org/wiki/Tik-Tok_(novel)
1•firebaze•8m ago•0 comments

Pentagon is embracing Grok AI chatbot as it draws global outcry

https://apnews.com/article/artificial-intelligence-pentagon-hegseth-musk-7f99e5f32ec70d7e39cec92d...
1•geox•11m ago•1 comments

Show HN: Nametag – open-source personal relationships manager

https://nametag.one/
1•mattogodoy•13m ago•0 comments

European firms hit hiring brakes over AI and slowing growth

https://www.dw.com/en/european-eurozone-job-labor-market-unemployment-company-hiring-practice-cov...
1•smurda•13m ago•0 comments

Rewiring Mozilla: Doing for AI what we did for the web

https://blog.mozilla.org/en/mozilla/rewiring-mozilla-ai-and-web/
1•nalinidash•15m ago•0 comments

AI, AI Everywhere

1•okokwhatever•17m ago•0 comments

Physicians see 1 in 6 patients as 'difficult,' study finds

https://www.beckershospitalreview.com/patient-experience/physicians-see-1-in-6-patients-as-diffic...
1•Growtika•18m ago•1 comments

International central bankers stand in full solidarity with Powell

https://www.ecb.europa.eu/press/pr/date/2026/html/ecb.pr260113~ec4630b9fa.en.html
2•throw0101c•18m ago•2 comments

Boundary Enforcement in Code Review

1•mthssalome•21m ago•0 comments

Owners, not renters: Mozilla's open source AI strategy

https://blog.mozilla.org/en/mozilla/mozilla-open-source-ai-strategy/
1•nalinidash•22m ago•0 comments

Show HN: Janus – Anki flashcards from PDFs, videos and notes

https://janus.cards
1•A-F-V16•22m ago•1 comments

$999 RTX 5090 GPU scam claims 42 victims

https://www.tomshardware.com/pc-components/gpus/usd999-rtx-5090-gpu-scam-claims-42-victims-fanny-...
1•croes•22m ago•0 comments

People as Harmonic Oscillators

https://dogdogfish.com/blog/2026/01/13/people-as-oscillators/
2•matthewsharpe3•29m ago•1 comments

Hit squad recruiter for Sweden's Foxtrot criminal network arrested in Iraq

https://www.thenationalnews.com/news/mena/2026/01/13/hit-squad-recruiter-for-swedens-foxtrot-crim...
1•campuscodi•30m ago•0 comments

Show HN: MakersHub.dev – A community platform for people building with AI tools

https://makershub.dev/
1•adilmoujahid•30m ago•0 comments

Show HN: FreeMarker Support for Zed Editor

https://github.com/debba/zed-freemarker
1•debba•32m ago•0 comments

Ask HN: Quantum Computation, Computers and Programming

1•rramadass•34m ago•0 comments

Show HN: Policy-governed AI system for offline deployment in expertise deserts

https://github.com/thepoorsatitagain/Tutor-to-disaster-expert
1•thepoors•35m ago•0 comments

Podshop, the hedge fund game. As seen on Bloomberg's money stuff

https://www.podshop.io
1•WiseHare•35m ago•0 comments

Could Magic Mushrooms Have 'Woken Up' Our Ancestors?

https://thesporereport.com/?p=580
1•richrichardsson•36m ago•0 comments

WebUSB Unpinner: network analysis for the masses

https://reversing.works/posts/2025/12/webusb-unpinner-network-analysis-for-the-masses/
1•chobeat•37m ago•0 comments

Show HN: High-precision mouse polling rate tester

https://mousepollingratetest.com/
1•zylics•40m ago•1 comments

User authorization just got 10x harder

https://leaddev.com/event/user-authorization-just-got-10x-harder
2•mooreds•40m ago•0 comments

Ask HN: Infrastructure teams – what's your biggest compliance headache?

1•coppinfra•43m ago•0 comments

Iran official says 2k people have been killed in unrest

https://www.reuters.com/world/china/iranian-mp-warns-greater-unrest-urging-government-address-gri...
7•JumpCrisscross•43m ago•2 comments

Dullness and Disbelief: The 2026 AI Regression

https://vibesbench.substack.com/p/dullness-and-disbelief-the-2026-ai
2•firasd•43m ago•0 comments