frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•6mo ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•6mo ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•6mo ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•6mo ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

Azure APIM Cross-Tenant Signup Bypass

https://github.com/bountyyfi/Azure-APIM-Cross-Tenant-Signup-Bypass
1•chili-salsa•2m ago•1 comments

Decker is a multimedia platform for creating and sharing interactive documents

http://beyondloom.com/decker/index.html
1•TheTaytay•10m ago•0 comments

I Read the Terms of Service for My Smart TV and Now I Sleep with One Eye Open

https://jxself.org/smart-tv-tos.shtml
1•Gedxx•17m ago•0 comments

Cryptology firm cancels elections after losing encryption key

https://www.bbc.com/news/articles/c62vl05rz0ko
1•ColinWright•18m ago•0 comments

Creepy AI Toys

https://www.nytimes.com/2025/08/15/arts/ai-toys-curio-grem.html
1•I_Nidhi•21m ago•0 comments

Amazon faces FAA probe after delivery drone snaps internet cable in Texas

https://www.cnbc.com/2025/11/25/amazon-faa-probe-delivery-drone-incident-texas.html
2•jonathanzufi•21m ago•0 comments

Porn Giant Calls for Device-Based Digital ID

https://reclaimthenet.org/porn-giant-calls-for-device-based-digital-id
1•uyzstvqs•24m ago•1 comments

Secrets in unlisted GitHub gists are reported to secret scanning partners

https://github.blog/changelog/2025-11-25-secrets-in-unlisted-github-gists-are-now-reported-to-sec...
1•petercooper•24m ago•1 comments

Await Is Not a Context Switch: Understanding Python's Coroutines vs. Tasks

https://mergify.com/blog/await-is-not-a-context-switch-understanding-python-s-coroutines-vs-tasks
6•remyduthu•26m ago•0 comments

Devenv 1.11: Module changelogs and SecretSpec 0.4.0

https://devenv.sh/blog/2025/11/26/devenv-111-module-changelogs-and-secretspec-040/
1•domenkozar•27m ago•0 comments

Practical Intro to Operational Transformation

https://archive.casouri.cc/note/2025/practical-intro-ot/
1•casouri•29m ago•0 comments

Estimating AI productivity gains from Claude conversations

https://www.anthropic.com/research/estimating-productivity-gains
1•kerim-ca•42m ago•0 comments

Show HN: ConfluenceMeter Beta, live panel for crypto confluence

https://www.confluencemeter.com/mvp
2•Paugallego•44m ago•1 comments

Show HN: ~$root-dir: a command-line community for devs, builders and creators

https://www.root-dir.com
2•madsmadsdk•48m ago•0 comments

Formal Specification for Authorization: Clarity Before Implementation

https://blog.gchinis.com/posts/2025/11/formal-specification-for-authorization/
2•gchinis•48m ago•0 comments

Hamas attack victims sue Binance for allowing payments to militant group

https://www.reuters.com/legal/government/hamas-attack-victims-sue-binance-allegedly-allowing-paym...
2•barredo•49m ago•0 comments

Alphaproof paper (IMO 2024 Silver) is finally published in Nature [pdf]

https://www.nature.com/articles/s41586-025-09833-y_reference.pdf
2•zuzatm•50m ago•1 comments

Show HN: MenuPhotoAI – AI food photography that keeps dishes real

https://www.menuphotoai.com
1•redp314•52m ago•0 comments

Canva is considering porting Affinity to Linux

https://techcentral.co.za/affinity-for-linux-canvas-next-big-move-could-reshape-the-desktop-softw...
6•methuselah_in•53m ago•0 comments

Dutch public broadcaster NOS quits X over disinformation

https://www.reuters.com/business/media-telecom/dutch-public-broadcaster-nos-quits-x-over-disinfor...
7•giuliomagnifico•54m ago•2 comments

Skyscrapers engulfed in flames after fire spreads on bamboo scaffolding

https://metro.co.uk/2025/11/26/three-skyscrapers-engulfed-flames-fire-spreads-bamboo-scaffolding-...
1•perihelions•56m ago•0 comments

Coffee

https://chrispymm.co.uk/coffee
1•worez•1h ago•0 comments

Invisible Details of Interaction Design

https://rauno.me/craft/interaction-design
1•bfirsh•1h ago•0 comments

Learnings from 1 year of agents: PostHog AI

https://posthog.com/blog/8-learnings-from-1-year-of-agents-posthog-ai
1•czue•1h ago•1 comments

Show HN: An app that turns doomscrolling into learning

https://apps.apple.com/app/id6754678719
1•HamadAlmheiri•1h ago•0 comments

Show HN: NxtPitch – AI that instantly generates pitch proposals

https://nxtpitch.com
2•anmolkushwah19•1h ago•0 comments

Get us off Microsoft! Lawmakers press EU Parliament to change in-house IT

https://www.politico.eu/article/get-us-off-microsoft-eu-lawmakers-press-parliament-to-change-in-h...
3•robtherobber•1h ago•0 comments

Dell (Dell) Q3 2026 Earnings Call Transcript

https://www.theglobeandmail.com/investing/markets/stocks/DELL/pressreleases/36316186/dell-dell-q3...
1•doener•1h ago•1 comments

I don't care how well your "AI" works

https://fokus.cool/2025/11/25/i-dont-care-how-well-your-ai-works.html
24•todsacerdoti•1h ago•1 comments

Dynamic Skillset Reference Architecture

https://chatbotkit.com/examples/dynamic-skillset-reference-architecture
1•_pdp_•1h ago•1 comments