frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•8mo ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•8mo ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•8mo ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•8mo ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

Our Slapdash Cultural Change

https://www.overcomingbias.com/p/our-slapdash-cultural-change
1•paulpauper•10s ago•0 comments

MoneyRank – a daily 60-second game that scores your financial risk instincts

https://moneyrank.onrender.com/
1•abbster52•1m ago•1 comments

Vanderbilt University Plans New Campus in San Francisco

https://www.wsj.com/us-news/education/vanderbilt-san-francisco-cca-california-college-arts-expans...
1•noleary•1m ago•0 comments

Toyota remained top automaker by sales for 6th straight year in 2025

https://asia.nikkei.com/business/automobiles/toyota-remained-top-automaker-by-sales-for-6th-year-...
1•breve•2m ago•0 comments

Device that may be tied to "Havana Syndrome" obtained by U.S. government

https://www.cbsnews.com/news/device-havana-syndrome-obtained-by-u-s-government/
1•mhb•2m ago•0 comments

Why China Is Suddenly Obsessed with American Poverty

https://www.nytimes.com/2026/01/13/business/china-american-poverty.html
3•xnhbx•5m ago•1 comments

More Young Americans Are Unfit to Serve, a New Study Finds. Here's Why

https://www.military.com/daily-news/2022/09/28/new-pentagon-study-shows-77-of-young-americans-are...
1•paulpauper•6m ago•1 comments

Ask HN: Preserving knowledge long-term without a central authority

1•SERSI-S•7m ago•0 comments

Claude Coworks

https://thezvi.substack.com/p/claude-coworks
1•paulpauper•7m ago•0 comments

Former NYC Mayor Eric Adams accused of $2.5M rug pull as NYC Token crashes 80%

https://www.theverge.com/news/861269/former-nyc-mayor-eric-adams-accused-of-2-5-million-crypto-ru...
2•beeandapenguin•7m ago•1 comments

Show HN: Demo of Rust Lettre crate for sending email using SMTP

1•jph•9m ago•1 comments

Show HN: Agentic Equities – track ChatGPT sentiment around stocks

https://www.agenticequities.com/dashboard
1•subtlesoftware•10m ago•0 comments

AI Tools: Image Generation, Video Creation, Website Builders (2026)

https://curateclick.com/blog/2026-best-ai-tools-websites
2•czmilo•10m ago•1 comments

Show HN: Free WCAG accessibility scanner – EAA compliance deadline is June 2025

https://tryinclusiv.com
1•callally_colin•11m ago•0 comments

Even Linus Torvalds is trying his hand at vibe coding (but just a little)

https://arstechnica.com/ai/2026/01/hobby-github-repo-shows-linus-torvalds-vibe-codes-sometimes/
1•spaggot•12m ago•0 comments

Bake Oven Knob

https://en.wikipedia.org/wiki/Bake_Oven_Knob
1•forks•14m ago•0 comments

Phases of Vibe Coding

https://zergai.com/blog/4-phases-vibe-coding
1•idanb•14m ago•0 comments

NewPipe 0.28.1 released bringing tons of fixes and improvements

https://newpipe.net/blog/pinned/announcement/newpipe-0.28.1-released/
1•cyb0rg0•17m ago•1 comments

A Chrome extension plugin featuring a magical particle mouse cursor effect

https://chromewebstore.google.com/detail/crazy-cursor-magical-part/eejfljdgkaanachdckmpmfgjhncihfmd
1•spacedogs•19m ago•0 comments

The Killing Fields of Tehran

https://www.thefp.com/p/the-killing-fields-of-tehran
4•mhb•19m ago•1 comments

Logitech caused its mice to freak out by not renewing a certificate

https://www.theverge.com/news/857377/logitech-macos-logi-options-mouse-certification-fix
1•abdelhousni•23m ago•0 comments

AVX-512: First Impressions on Performance and Programmability

https://shihab-shahriar.github.io//blog/2026/AVX-512-First-Impressions-on-Performance-and-Program...
1•shihab•24m ago•0 comments

StackChan is a cute, community-build, open-source AI desktop robot(Crowdfunding)

https://www.cnx-software.com/2026/01/13/m5stack-stackchan-is-a-cute-open-source-ai-desktop-robot/
2•meganetaaan•25m ago•0 comments

Contrary to popular belief, EV sales growth continued to accelerate in 2025

https://electrek.co/2026/01/13/contrary-to-popular-belief-ev-sales-growth-continued-to-accelerate...
3•breve•26m ago•0 comments

CoreWeave Overhyped AI Computing Capacity After IPO, Suit Says

https://news.bloomberglaw.com/securities-law/coreweave-overhyped-ai-computing-capacity-after-ipo-...
1•zerosizedweasle•32m ago•0 comments

We may know what a healthy gut microbiome looks like

https://www.newscientist.com/article/2508109-we-may-finally-know-what-a-healthy-gut-microbiome-lo...
1•herbertl•34m ago•0 comments

Show HN: Vibe scrape with AI Web Agents, prompt => get data [video]

https://www.youtube.com/watch?v=ggLDvZKuBlU
4•arjunchint•37m ago•1 comments

Smaller houses can lead to happier lives

https://www.washingtonpost.com/climate-environment/2026/01/06/smaller-houses-happier-lives/
4•bigwheels•41m ago•4 comments

A quick blog template built using NextJS and SleekCMS

https://github.com/sleekcms/sleekcms-next-blog
1•yusufnb•44m ago•0 comments

Are You Dead?: The viral Chinese app for young people living alone

https://www.bbc.com/news/articles/c3381r5nnn6o
2•bookofjoe•45m ago•1 comments