frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•11mo ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•11mo ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•11mo ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•11mo ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

Clawdia – Claude Code/Codex and with full OS control – electron app with browser

https://github.com/chillysbabybackribs/Clawdia7.1
1•op15•55s ago•0 comments

This specific GitHub issue is crashing

https://github.com/npm/cli/issues/4828
1•n943qw2•1m ago•0 comments

How Many Times Should a "Math-Y Kid" See a Math Idea Before They Understand It?

https://kidswholovemath.substack.com/p/how-many-times-should-a-math-y-kid
1•sebg•2m ago•0 comments

DNS is Simple. DNS is Hard

https://www.wespiser.com/posts/2026-03-29-dns-simple-dns-hard.html
1•wespiser_2018•2m ago•1 comments

Aki.io – Open-source AI models via API on EU infrastructure (OpenAI-compatible)

https://aki.io
1•headkit•3m ago•1 comments

Show HN: Front end research and design system tool for designers and developers

https://github.com/somekiwiplease/component-census
1•somekiwiplease•3m ago•0 comments

Show HN: Migas – Meeting copilot with live speaker labels (no bot, no cloud STT)

https://migas.ai/
1•blakers95•4m ago•0 comments

A college instructor turns to typewriters to curb AI-written work

https://apnews.com/article/typewriter-ai-cheating-chatgpt-cornell-ce10e1ca0f10c96f79b7d988bb56448b
1•shagie•5m ago•0 comments

Hacking Google Support: Leaking millions of customer records ($14k bounty)

https://michaeldalton.au/posts/hacking-google-support
1•llui85•6m ago•0 comments

Input "DEADBEEF" into a calculator with an external key clicker demo [video]

https://www.youtube.com/shorts/jlZHY4ImTQs
1•VGAble•7m ago•0 comments

Show HN: Built this prediction market aggregation tool

https://pmse.netlify.app/
1•Aron_tk•7m ago•0 comments

Show HN: I built a way for AI to remember how you like things done

https://github.com/Robi-Labs/AEP
1•unfavalen•8m ago•0 comments

US Supreme Court Rejects Colorado Ban on 'Conversion Therapy' for LGBTQ Minors

https://www.nytimes.com/2026/03/31/us/politics/supreme-court-colorado-conversion-therapy.html
3•Geekette•9m ago•1 comments

Git history is not a reliable record of work. I tried to make it one

https://www.gitglimpse.com/blog/git-history-is-not-a-reliable-record-of-work/
1•dinoze•10m ago•1 comments

Show HN: DeepTable – an API that converts messy Excel files into structured data

https://docs.deeptable.com/
2•francisrafal•10m ago•0 comments

Show HN: PromptQL – AI-Native Slack

https://promptql.io
2•argo12•13m ago•0 comments

Show HN: A complete index of all 14,000 articles on MDN Web Docs

https://github.com/tamnd/mdn-index
2•tamnd•13m ago•0 comments

Sony and Tcl Sign Definitive Agreements for Strategic Partnership

https://www.sony.co.jp/en/news-release/202603/26-0331E/
2•gbil•14m ago•0 comments

Autonomous RL Fine-Tuning on Ephemeral GPUs: Extending Karpathy's Autoresearch

https://templarresearch.substack.com/p/autonomous-rl-fine-tuning-on-ephemeral
2•synapz_org•15m ago•0 comments

Show HN: Flowtriq – Per-node DDoS detection with auto-mitigation in under 1s

https://flowtriq.com/
1•jacob_masse•16m ago•0 comments

Tell HN: Chrome says "Suspicious Download" when trying to download yt-dlp

14•joering2•17m ago•1 comments

SOTAVerified the open verification layer for ML research

https://sotaverified.org
1•uberdavid•18m ago•1 comments

The Paradox of Derivatives and Integrals

https://statmodeling.stat.columbia.edu/2026/03/14/the-paradox-of-derivatives-and-integrals/
2•saeedesmaili•18m ago•0 comments

Project Mario: the inside story of DeepMind

https://colossus.com/article/project-mario-demis-hassabis-deepmind-mallaby/
1•highfrequency•19m ago•0 comments

Every Time Zone

https://everytimezone.com/
4•thunderbong•19m ago•0 comments

Ask HN: Are there any good coaching skills worth exploring?

1•taariqlewis•22m ago•0 comments

The day my AI editor went silent: Debugging Git worktrees, and lost weights

https://saheb.github.io/blog/git-worktree-unset/
2•saheb37•23m ago•0 comments

Alan Moore on William Blake's Contempt for Newton

https://www.royalacademy.org.uk/article/william-blake-isaac-newton-ashmolean-oxford
1•justin66•23m ago•0 comments

Why Is OpenAI Dropping Video? To Focus on What Matters

https://www.thefp.com/p/why-is-openai-dropping-video-to-focus
1•paulpauper•24m ago•0 comments

The Quinoa-Kitniyos Conundrum (2019)

https://ohr.edu/holidays/pesach/laws_and_customs/5390
1•powera•24m ago•0 comments