frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•9mo ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•9mo ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•9mo ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•9mo ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

Does Intermittent Fasting Live Up to the Hype?

https://www.nytimes.com/2026/01/26/well/eat/intermittent-fasting.html
1•doener•3m ago•0 comments

2 days left to comment on DOT's plan to hike US fuel costs by $23B

https://electrek.co/2026/02/02/2-days-left-to-comment-on-dots-plan-to-hike-us-fuel-costs-by-23b/
1•Bender•6m ago•0 comments

Released: Ace-Step 1.5: Pushing the Boundaries of Open-Source Music Generation

https://ace-step.github.io/ace-step-v1.5.github.io/
1•4chandaily•6m ago•1 comments

Dual N-Back

https://gwern.net/dnb-faq
1•aggrrrh•7m ago•0 comments

FERC: Renewables made up 88% of new US power generating capacity to Nov 2025

https://electrek.co/2026/02/02/ferc-renewables-power-generating-capacity-to-nov-2025/
1•Bender•7m ago•0 comments

Four theories about the SpaceX – xAI merger

https://garymarcus.substack.com/p/four-theories-about-the-spacex-xai
2•headalgorithm•9m ago•0 comments

Bruce Schneier: AI and the scaling of betrayal

https://www.schneier.com/blog/archives/2023/12/ai-and-trust.html
4•insuranceguru•9m ago•1 comments

Pornhub shuts off access to new UK users, citing age verification constraints

https://www.cnn.com/2026/02/02/uk/uk-pornography-restricted-access-intl
1•Bender•9m ago•0 comments

BYD's next-gen megawatt charger leaks: 1,500 kW vs. 1k kW first gen

https://carnewschina.com/2026/02/02/byds-next-gen-megawatt-charger-leaks-1500-kw-power-1500-a-cur...
2•jampa•10m ago•0 comments

In Under 500 Words, a Judge Weaponized Wit to Free the Child Detained by ICE

https://www.nytimes.com/interactive/2026/02/03/books/judge-ruling-liam-conejo-ramos-analysis.html
3•petethomas•10m ago•0 comments

Postgres managed by ClickHouse

https://clickhouse.com/cloud/postgres
2•tosh•11m ago•0 comments

Life without good internet is boring

https://blog.usmanity.com/posts/life-without-good-internet-is-boring
1•speckx•11m ago•0 comments

Where the Work Goes When Agents Arrive

https://dreamiurg.net/2026/02/02/where-the-work-goes-when-agents-arrive.html
1•dreamiurg•11m ago•1 comments

Mad Rust: The JVM Developer's Journey. Kotlin/Java Developer's Road to Valhalla

https://sobolev.substack.com/p/mad-rust-escape-from-the-jvm-citadel
1•alexsobolev•12m ago•0 comments

Show HN: Tenuo – Capability-Based Authorization (Macaroons for AI Agents)

2•niyikiza•13m ago•0 comments

Show HN: Real-world speedrun timer that auto-ticks via vision on smart glasses

https://github.com/RealComputer/GlassKit/tree/main/examples/rokid-rfdetr
2•tash_2s•13m ago•1 comments

My deep thoughts and considered opinions on AI

https://skryblans.com/my-very-deep-thoughts-and-considered-opinions-on-ai/
2•milkcircle•13m ago•0 comments

Why are Spain and Portugal growing twice as fast as the Eurozone?

https://www.euronews.com/business/2026/01/30/why-are-spain-and-portugal-growing-twice-as-fast-as-...
1•belter•14m ago•0 comments

Elon Musk is taking SpaceX's minority shareholders for a ride

https://www.theguardian.com/business/nils-pratley-on-finance/2026/feb/03/elon-musk-is-taking-spac...
2•6LLvveMx2koXfwn•15m ago•1 comments

PayPal Appoints Enrique Lores as Chief Executive Officer

https://investor.pypl.com/news-and-events/news-details/2026/PayPal-Appoints-Enrique-Lores-as-Chie...
1•zatkin•15m ago•0 comments

Elevated error rates for ChatGPT users – OpenAI Status

https://status.openai.com/incidents/01KGJK9Q6PDB3C3VX6MPCY6106
8•rossant•17m ago•1 comments

5M installs, $1M Open Source Grant program, and the story of how we got here

https://cline.bot/blog/5m-installs-1m-open-source-grant-program
2•raybb•17m ago•0 comments

Ruptures in China's Leadership Could Be Due to Paranoia and Power Plays

https://www.nytimes.com/2026/02/03/us/politics/china-xi-military-purge.html
2•JumpCrisscross•17m ago•0 comments

Nava Acquires Beam to Raise the Bar for Public Service IT

https://www.govtech.com/biz/nava-acquires-beam-to-raise-the-bar-for-public-service-it
1•stephenhuey•17m ago•1 comments

Epstein Backed Coinbase in Crypto Exchange's Early Years

https://www.bloomberg.com/news/articles/2026-02-03/epstein-backed-coinbase-in-crypto-exchange-s-e...
6•wslh•19m ago•1 comments

Rules_Claude: Hermetic Bazel toolchain and rules for Claude Code

https://github.com/buildbuddy-io/rules_claude
4•siggi•20m ago•1 comments

Future home might be framed with printed plastic

https://news.mit.edu/2026/your-future-home-might-be-framed-with-printed-plastic-0203
1•gnabgib•22m ago•0 comments

Fintech CEO and Forbes 30 Under 30 alum has been charged for alleged fraud

https://techcrunch.com/2026/02/02/fintech-ceo-and-forbes-30-under-30-alum-has-been-charged-for-al...
1•wslh•23m ago•1 comments

Net Neutrality for AI

https://vanderbiltpolicyaccelerator.substack.com/p/net-neutrality-for-ai
1•geox•25m ago•0 comments

When Vibe Coded Consumer Agents Go Rogue

https://nearfuturelaboratory.com/editorial/when-vibe-coded-consumer-agents-go-rogue/
2•cyanbane•27m ago•0 comments