frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•7mo ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•7mo ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•7mo ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•7mo ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.

A faster is_leap_year function (full-range, C++)

https://www.benjoffe.com/fast-date
1•benjoffe•55s ago•1 comments

NYT Connections LLM Benchmark

https://github.com/lechmazur/nyt-connections
1•cainxinth•3m ago•0 comments

Execute AI Agents with Markdown

https://github.com/johnlindquist/mdflow
1•handfuloflight•4m ago•0 comments

Microsoft finally realizes the threat SteamOS poses

https://www.techradar.com/computing/windows/microsoft-finally-realizes-the-threat-steamos-poses-b...
1•thewebguyd•5m ago•0 comments

Show HN: SetLocale – Syntax-safe localization for developers (JSON, PHP, XML)

https://setlocale.xyz/
1•byshako•5m ago•0 comments

What's the Difference Between an ACID and a BASE Database

https://aws.amazon.com/compare/the-difference-between-acid-and-base-database/
1•teleforce•6m ago•0 comments

A VC's 2026 Crystal Ball

https://medium.com/bread-and-butter-ventures/a-vcs-2026-crystal-ball-that-is-assuredly-going-to-p...
1•azhenley•9m ago•0 comments

What Is Comprehensible Input?

https://www.dreaming.com/blog-posts/what-is-comprehensible-input
1•carabiner•12m ago•0 comments

Intercom launches free AI Startup Pack with $100k+ in credits / value

https://fin.ai/startup-pack
3•chenchenhuo•12m ago•1 comments

China launches aerial drone carrier in show of prowess

https://www.japantimes.co.jp/news/2025/12/11/asia-pacific/china-aerial-drone-carrier/
1•ianrahman•13m ago•0 comments

Building a RAG Server with PostgreSQL – Part 3: Deploying Your RAG API

https://www.pgedge.com/blog/building-a-rag-server-with-postgresql-part-3-deploying-your-rag-api
2•pgedge_postgres•15m ago•0 comments

You can turn a cluster of Macs into an AI supercomputer in macOS Tahoe 26.2

https://www.engadget.com/ai/you-can-turn-a-cluster-of-macs-into-an-ai-supercomputer-in-macos-taho...
2•doener•17m ago•0 comments

The Art of Amiga Lettering

https://damieng.com/blog/2025/12/04/art-of-amiga-lettering/
1•freediver•17m ago•0 comments

I've built a website called UpVote a bit like this but more modern

https://www.upvote.social/
1•upvotenow•18m ago•0 comments

Universal UI

https://llmparty.pixeletes.com/experiments/universal_ui
1•victornomad•18m ago•0 comments

Sam Altman's World (formerly Worldcoin) unveiled a new "super app"

https://world.org/blog/announcements/the-new-world-app-secure-chat-global-payments-and-mini-apps-...
1•transfunct•19m ago•0 comments

Show HN: Sunset Compass – Privacy-first sun/moon tracker in 50 languages

https://apps.apple.com/au/app/golden-hour-compass/id6755098089
1•BenjaminHarris•25m ago•1 comments

Show HN: TrackSplit – Remove vocals and instruments from any song, offline

https://tracksplit.co/
1•jomargon•30m ago•0 comments

Layoutz – Make Simple, Beautiful CLI Output, No Component-Library Limitations

https://github.com/mattlianje/layoutz
2•MrJulia•33m ago•0 comments

The Trump Tracker

https://docs.google.com/spreadsheets/d/1VNPGRB5ZcrxxIk_27Mmbe10nxc5wyCuHJPpD4ZGSvEU/edit?gid=1528...
1•josh_carterPDX•34m ago•0 comments

The Code That Revolutionized Orbital Simulation [video]

https://www.youtube.com/watch?v=nCg3aXn5F3M
1•todsacerdoti•35m ago•0 comments

Show HN: Data Axolotl – Monitor analytic data without writing individual tests

https://github.com/thorntale/data-axolotl
2•johnstimac111•38m ago•0 comments

VMware kills vSphere Foundation in parts of EMEA

https://www.theregister.com/2025/12/11/vmware_kills_vsphere_foundation_parts_emea/
1•Bender•38m ago•0 comments

NASA internship prototyping radiation-tolerant Framework Laptop 16 mainboard

https://stemgateway.nasa.gov/s/course-offering/a0BSJ000004rBsf2AE/radiationtolerant-crew-laptop
6•Lammy•39m ago•0 comments

Bit Twiddling Hacks

https://graphics.stanford.edu/~seander/bithacks.html
1•gurjeet•40m ago•0 comments

Cursor Launches an AI Coding Tool for Designers

https://www.wired.com/story/cursor-launches-pro-design-tools-figma/
1•rmason•41m ago•0 comments

IETF 124 post-meeting survey

https://www.ietf.org/blog/ietf124-post-meeting-survey/
1•mooreds•41m ago•0 comments

AI coding is sexy, but accounting is the real low-hanging automation target

3•bmadduma•44m ago•0 comments

Ask HN: Why doesn't HN ask to confirm hiding posts?

1•ludamn•45m ago•1 comments

Uber pulls back from electric cars, slashing incentives for drivers

https://financialpost.com/commodities/energy/electric-vehicles/uber-pulls-back-electric-cars
2•smurda•46m ago•0 comments