frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•11mo ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•11mo ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•11mo ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•11mo ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

From Hierarchy to Intelligence

https://block.xyz/inside/from-hierarchy-to-intelligence
1•cebert•1m ago•0 comments

54kb Minecraft clone made without a graphics library [video]

https://www.youtube.com/watch?v=JO8EnzNfEk4
1•PaulHoule•2m ago•0 comments

Show HN: Infer – Pipe friendly Agent Harness with one tool: Bash

https://github.com/turlockmike/infer
1•turlockmike•3m ago•0 comments

I built a tool to let you export your X bookmarks and categorize them

https://x-archive.netlify.app/
1•xarchive•3m ago•1 comments

Google backs London AI Campus with Camden partners to expand AI skills provision

https://www.edtechinnovationhub.com/news/google-backs-london-ai-campus-with-camden-partners-to-ex...
1•smurda•4m ago•0 comments

Automatic registration for US Military draft to begin in December

https://thehill.com/policy/defense/5822914-automatic-registration-military-draft/
2•c420•5m ago•0 comments

Boring Is Good (2024)

https://www.coryd.dev/posts/2024/boring-is-good
1•cdrnsf•6m ago•0 comments

Russian spooks hack Wi-Fi routers to spy on West

https://www.politico.eu/article/russias-gru-hacked-hundreds-of-wi-fi-routers-world-wide/
1•c420•7m ago•0 comments

Show HN: Free-AI-ad-credits.md – $12K+ in free AI ad credits

https://github.com/darwin-studios/agents/blob/main/free-ai-ad-credits.md
1•jason-festa•8m ago•0 comments

Claude Managed Agents Overview

https://platform.claude.com/docs/en/managed-agents/overview
1•NicoJuicy•9m ago•0 comments

Industrial Policy for the Intelligence Age

https://openai.com/index/industrial-policy-for-the-intelligence-age/
1•geox•11m ago•0 comments

Show HN: Composer – Diagram Your Codebase with MCP

https://www.usecomposer.com/
1•olivergrabner•13m ago•0 comments

The Cost of Scrolling

https://azariak.github.io/CostOfScrolling/
3•HerbManic•13m ago•1 comments

Have Codex review Claude's work

https://gist.github.com/anchpop/56688d3526e1d4019db7c4fc1bcb2b2c
2•ChadNauseam•15m ago•1 comments

The Political Disaster That Is California [video][15min]

https://www.youtube.com/watch?v=inOci0iH4Q8
1•Bender•17m ago•1 comments

Amazon Stopping Support for Old Kindles

https://www.independent.co.uk/news/world/americas/amazon-kindle-ending-support-old-e-readers-b295...
3•oj2828•20m ago•2 comments

Show HN: Prefab – A generative UI framework for Python

https://prefab.prefect.io/docs/welcome
2•jlowin•20m ago•0 comments

Local SEO Analyst Agent – PDF Report Generation

https://github.com/jeffjbowie/Local-SEO-Analyst-Agent
1•Veritaco•22m ago•0 comments

Are We in the Chinese Room?

https://thefriendlyghost.nl/chinese-room-ai/
1•cvanelteren•26m ago•0 comments

The Hormuz chokehold affects AI funding too

https://highabsolutevalue.substack.com/p/the-hormuz-chokehold-affects-ai-funding
3•preetnation•26m ago•0 comments

I Don't Care What the Haters Are Saying, I'm Having a Blast

https://systemdrift.neocities.org/blog/i-dont-care-what-the-haters-are-saying-im-having-a-blast
2•myrrhman•27m ago•1 comments

Waymo's Robot Car Testing Ends in NYC After Permits Expire

https://www.thecity.nyc/2026/04/06/waymo-driverless-cars-testing-roads-autonomous-vehicle/
4•xnx•28m ago•0 comments

Digital Assets Rules Need Clarity

https://www.wsj.com/opinion/digital-assets-rules-need-clarity-6dfcab70
1•petethomas•32m ago•0 comments

U.S. Made a Deal That Gives Us Nothing We Wanted

https://www.theatlantic.com/national-security/2026/04/iran-strait-hormuz-us-trump-nuclear-weapons...
3•JumpCrisscross•33m ago•0 comments

WireGuard VPN developer's Microsoft account locked

https://twitter.com/EdgeSecurity/status/2041872931576299888
3•worik•33m ago•2 comments

I built a canvas where AI agents work together as a design team

https://designagents.app/blog/what-if-your-design-team-was-made-of-ai-agents
1•aliparnan•34m ago•1 comments

All You Need Is Not All You Need

https://www.researchsquare.com/article/rs-8399522/v1
2•cfcfcf•35m ago•0 comments

I built an agent agnostic, locally run, open-source observability product

https://github.com/Metabuilder-Labs/openclawwatch
2•anil-metabldr•36m ago•0 comments

Wells Fargo whistleblower award slashed by Wall Street watchdog

https://www.ft.com/content/b18fff7e-0797-4dba-bec6-4f8843cffd13
1•petethomas•37m ago•0 comments

I built CLI tool that analyzes logs and explains incidents

https://github.com/sydes-ai/ai-autopsy
1•naiks1214•39m ago•1 comments