frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•8mo ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•8mo ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•8mo ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•8mo ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

Remove repo selector from charts, usage, and PRs pages

1•nishiohiroshi•2m ago•0 comments

I built a tool that forces 5 AI to debate and cross-check facts before answering

https://github.com/KeaBase/kea-research
1•Stanislaw_•4m ago•0 comments

My 2025 Bug Bounty Stories

https://joshua.hu/2025-bug-bounty-stories-fail
2•karel-3d•7m ago•1 comments

BullSheet – My "Local" Quantitative Finance Engine

https://bayramovanar.substack.com/p/why-i-built-bullsheet-part-1
1•Bayramovanar•9m ago•1 comments

A 1990s CMS That Still Ships: Exponential CMS Reaches PHP 8.5

https://vincentopar.com/
1•Vincent_Opar•10m ago•0 comments

DOGE staffers at Social Security agency may have violated Hatch Act, DOJ says

https://abcnews.go.com/US/2-doge-staffers-social-security-agency-violated-hatch/story?id=129393252
1•vaxman•10m ago•0 comments

Are 'toxic' personality traits useful test cases for AI or behavioral models?

https://github.com/FlDanyT/ai-celebrity-models
1•yakalmar2048•13m ago•1 comments

LiveContainer: Run iOS apps without installing them

https://github.com/LiveContainer/LiveContainer
1•handfuloflight•13m ago•0 comments

DragonSweeper: A minesweeper game that requires observation

https://dragonsweeper.org
1•wslh•14m ago•0 comments

WebRTC VPN Tunnel

https://github.com/Manav1011/webrtc-vpn
1•walterbell•15m ago•0 comments

DiffRatio – A One-Step Diffusion Model with SOTA quality and 50% less memory

https://www.arxiv.org/pdf/2502.08005
2•LoMoGan•17m ago•1 comments

The Issue with Special Issues: When Guest Editors Publish in Support of Self

https://arxiv.org/abs/2601.07563
1•wslh•18m ago•0 comments

Amazon Joins the Big-Box League with Its Largest-Ever Store

https://www.wsj.com/business/retail/amazon-orland-park-illinois-opening-13362c97
1•divbzero•22m ago•0 comments

When I Talk to AI About My Feelings, I Don't Want a Therapy Ad

https://www.theverge.com/news/864103/mixed-messaging
1•thor1122•23m ago•0 comments

Green vs. Blue

https://greenvblue.npeercy.com/
1•greenwallnorway•28m ago•0 comments

Sony to Transfer Home Entertainment Operations to Tcl-Led Joint Venture

https://xthe.com/news/sony-tv-business-tcl/
1•Sandhyaseo•29m ago•1 comments

Negotiating Relationships with ChatGPT

https://arxiv.org/abs/2601.13188
2•7777777phil•30m ago•0 comments

Why Submit to AI in Production: Speaking as a Tool for Better Work

https://www.r-bloggers.com/2026/01/why-submit-to-ai-in-production-speaking-as-a-tool-for-better-w...
1•7777777phil•32m ago•0 comments

Crates.io: Development Update

https://blog.rust-lang.org/2026/01/21/crates-io-development-update/
3•quapster•33m ago•0 comments

AT&T Archives: The Unix Operating System (1972) [video]

https://www.youtube.com/watch?v=tc4ROCJYbm0
1•vismit2000•34m ago•0 comments

Agentic RAG for Dummies

https://github.com/GiovanniPasq/agentic-rag-for-dummies
1•thunderbong•35m ago•0 comments

Mnemonic BTC Slots

https://coinables.github.io/mnemonic-slots/#
1•nicholasbraker•36m ago•0 comments

Welcome to Niji V7

https://nijijourney.com/blog/niji-7
1•ankitg12•38m ago•0 comments

Show HN: A Spectrum Album – Structuring AI-Generated Music with Suno

https://karbeyazalbum.replit.app/
2•ersinesen•38m ago•0 comments

Accidentally making $1000 for finding Security Bugs as a Back end Developer

https://not-afraid.medium.com/accidentally-making-1000-for-finding-security-bugs-as-a-backend-dev...
1•birdculture•39m ago•0 comments

Show HN: BSS Blue Hive Guide

https://www.bluehiveguide.com/index.html
1•andy846851797•39m ago•0 comments

Git Show

https://tonystr.net/blog/git
1•TonyStr•41m ago•0 comments

Show HN: LLM fine-tuning without infra or ML expertise

https://www.tinytune.xyz/
2•Jacques2Marais•41m ago•1 comments

Ask HN: How do you manage your morning catch-up routine?

2•Peterz_shu•44m ago•1 comments

Show HN: I built an enterprise weather intelligence platform with Lovable

https://preview--chrono-strata.lovable.app/shop
2•lavandar-admin•47m ago•0 comments