frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•11mo ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•11mo ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•11mo ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•11mo ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

Why do AI models hallucinate? [video]

https://www.youtube.com/watch?v=005JLRt3gXI
1•simonebrunozzi•52s ago•0 comments

Rabbi boasting of buldozing Gazan homes to light torch for Israel's national day

https://www.theguardian.com/world/2026/apr/21/rabbi-who-boasts-bulldozing-palestinian-homes-light...
1•hebelehubele•4m ago•0 comments

Ask HN: People who moved away from tech business/career, what do you do?

1•throwaway_32u10•4m ago•0 comments

Delivering a dynamic hexagonal world map in just 10kb

https://calibreapp.com/blog/building-our-beloved-hex-map
1•robin_reala•8m ago•0 comments

A guide on how to be a Programmer (2002)

https://github.com/braydie/HowToBeAProgrammer
1•downbad_•18m ago•1 comments

Colorado Age Attestation bill gets amendments to have open source excluded

https://www.gamingonlinux.com/2026/04/colorado-age-attestation-bill-gets-amendments-to-have-open-...
2•jurgemaister•27m ago•0 comments

SpaceX says unproven AI space data centers may not be commercially viable

https://www.reuters.com/world/spacex-says-unproven-ai-space-data-centers-may-not-be-commercially-...
2•1vuio0pswjnm7•28m ago•0 comments

Went to bed with a $10 budget alert. Woke up to $25,672 in debt to Google Cloud

https://old.reddit.com/r/googlecloud/comments/1ssagtw/went_to_bed_with_a_10_budget_alert_woke_up_to/
2•ratg13•29m ago•1 comments

I built a simple flowchart tool to organize ideas

https://www.processon.io/
1•shujie_li•30m ago•0 comments

Florida Attorney General launches criminal probe into ChatGPT over FSU shooting

https://apnews.com/article/florida-chatgpt-fsu-gunman-b32a7276426f621193f61a0f904f924c
2•1vuio0pswjnm7•31m ago•0 comments

Show HN: Content-mill: Index any static content into Meilisearch via YAML config

https://github.com/blueinit/content-mill
1•centrali•32m ago•1 comments

Let's Measure Gravity

https://owl.billpg.com/lets-measure-gravity/
2•billpg•33m ago•0 comments

Show HN: We benchmarked 18 LLMs on OCR (7K+ calls) – cheaper models win

https://www.arbitrhq.ai/leaderboards/
3•TimoKerr•33m ago•1 comments

Top Law Firm Apologizes to Bankruptcy Judge for AI Hallucination

https://www.bloomberg.com/news/articles/2026-04-21/top-law-firm-apologizes-to-bankruptcy-judge-fo...
2•1vuio0pswjnm7•34m ago•0 comments

Show HN: Notation – an iOS chess coach (Stockfish and optional BYOK Claude)

https://notationchesscoach.app
1•darrenc81•35m ago•1 comments

Orson Scott Card on feedback from editors

https://twitter.com/i/status/2046702294406680751
2•Michelangelo11•37m ago•1 comments

You can make something good

https://sfalexandria.com/posts/farzas-creations/
1•aadillpickle•42m ago•0 comments

Git for web services – everything as a file for coding agents

https://github.com/KrzysztofBogdan/gitfs
1•kpbogdan•42m ago•1 comments

American Supply Chain

https://twitter.com/bihanmahadewa/status/2046732759339552772
1•bihan•42m ago•1 comments

In Search of (Claude's) Lost Time - Globalizing Claude's project memories

https://www.gsarigiannidis.gr/claude-global-memory/
2•gsarig•45m ago•0 comments

Deep Research Max

https://blog.google/innovation-and-ai/models-and-research/gemini-models/next-generation-gemini-de...
2•markerbrod•46m ago•0 comments

Aube: A fast Node.js package manager

https://github.com/endevco/aube
1•icar•46m ago•0 comments

Spain's greatest matador gored by bull in comeback from retirement

https://www.thetimes.com/world/europe/article/morante-bullfighter-injured-bull-goring-tsj0bt7ks
2•petethomas•48m ago•2 comments

C++ Scripting with Libriscv

https://libriscv.no/blog/expert-example/
1•fwsgonzo•49m ago•0 comments

Anthropic CVP – Run 2

https://sunglasses.dev/reports/anthropic-cvp-opus-4-7-evaluation-run-2
1•azrollin•52m ago•0 comments

Shared Agent Harness

https://github.com/goncalossilva/.agents
1•ankitg12•57m ago•0 comments

Rspack 2.0

https://rspack.rs/blog/announcing-2-0
3•0x1997•1h ago•1 comments

The Free Universal Construction Kit

https://fffff.at/free-universal-construction-kit/
1•robinhouston•1h ago•0 comments

Force all app traffic into the tunnel in the iOS app

https://mullvad.net/en/blog/force-all-app-traffic-into-the-tunnel
1•eptcyka•1h ago•0 comments

Run Commands on File Event

https://evilcookie.de/on-run-commands-on-file-event.html
1•Tch1b0•1h ago•0 comments