frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Eric S. Raymond: why is there such a huge variance in results from using LLMs?

https://twitter.com/esrtweet/status/2016849708254179501
4•dist-epoch•1h ago

Comments

sara_builds•1h ago
The variance mostly comes down to prompt craft and context management.

People who get consistently good results have usually internalized a few things: (1) being explicit about constraints and output format, (2) providing relevant context without noise, (3) matching the model to the task (reasoning-heavy vs creative vs code), and (4) iterating on the prompt when something fails rather than assuming the model is broken.

I've seen the same person get wildly different results depending on whether they ask "write code to do X" vs "I need a function that takes A, returns B, handles edge case C, and should be optimized for D. Here's the existing code it needs to integrate with: [context]."

The gap between those two approaches can be a 10x difference in usefulness. Most of the "LLMs are useless" crowd and the "LLMs are magic" crowd are just working with very different prompt habits.

armchairhacker•1h ago
What are your exact prompts (including project context) and generated code?

And for those who are struggling with LLMs, what are their prompts and code?

FrankWilhoit•45m ago
He thinks they ought to converge. What does he think they ought to converge upon? How will he know that thing when he sees it? If he will know it when he sees it, why does he need help making it?

The answer to all of these, of course, is that convergence is not expected and correctness is not a priority. The use of an LLM is a boasting point, full stop. It is a performance. It is "look, Ma, no coders!". And it is only relevant, or possible, because although the LLM code is not right, the pre-LLM code wasn't right either. The right answer is not part of the bargain. The customer doesn't care whether the numbers are right. They care how the technology portfolio looks. Is it currently fashionable? Are the auditors happy with it? The auditors don't care whether the numbers are right: what they care about is whether their people have to go to any -- any -- trouble to access or interpret the numbers.

Amazon's Promotion of 'Melania' Has Critics Questioning Its Motives

https://www.nytimes.com/2026/01/28/business/media/amazon-melania-trump-film-critics.html
1•JumpCrisscross•1m ago•0 comments

Pakistan becomes latest Asian country to introduce checks for deadly Nipah virus

https://www.reuters.com/business/healthcare-pharmaceuticals/pakistan-becomes-latest-asian-country...
1•JumpCrisscross•4m ago•0 comments

GNU Gettext Reaches Version 1.0 After 30 Years in Development

https://www.phoronix.com/news/GNU-gettext-1.0
2•marcodiego•6m ago•0 comments

Drop your URL. I will Analyze and give FREE SEO tips

1•itsjoaki•7m ago•1 comments

Game jam for high schoolers in 200 cities

https://campfire.hackclub.com/
1•sadeshmukh•7m ago•0 comments

Show HN: Cueso – iPhone app for location-based grocery reminders

1•riqbal4•8m ago•1 comments

Mugabo Rongin

https://github.com/Ronny12345-art/MRcutter
1•Davidbombal•8m ago•1 comments

The New Shadowbanning Panic

https://www.theatlantic.com/technology/2026/01/tiktok-shadowbanning-trump/685798/
1•JumpCrisscross•9m ago•1 comments

How British Queues Got Out of Hand

https://timharford.com/2026/01/how-british-queues-got-out-of-hand/
3•asplake•10m ago•0 comments

Ask HN: Bookmarking service with snapshots and context based (local LLM?) search

1•haunter•10m ago•0 comments

Email Security: Where We Are and What the Future Holds

https://www.privacyguides.org/articles/2025/11/15/email-security/
1•evolve2k•10m ago•0 comments

Google Co-Founder Seeds Billionaire Political Effort Amid Wealth Tax Debate

https://www.nytimes.com/2026/01/28/us/politics/california-billionaires-sergey-brin-campaign.html
1•mitchbob•10m ago•1 comments

Why California is keeping this unusual solar plant running

https://www.latimes.com/environment/story/2026-01-11/trump-biden-both-want-this-california-solar-...
1•PaulHoule•11m ago•0 comments

When Cloud Came to Stay at the Village Bed and Breakfast

https://www.robpanico.com/articles/display/?entry_short=when-cloud-came-to-stay-at-the-village-be...
1•retrocog•11m ago•1 comments

How Norway Accomplished a Near-Total EV Transition

https://spectrum.ieee.org/norway-ev-policy-electric-vehicles
1•pseudolus•12m ago•0 comments

CW/Morse Code Trainer Inspired by G4FON

https://github.com/jhnhnsn/headcopycw
1•jhnhnsn•12m ago•1 comments

How to Bring Back the American Dream

https://www.nytimes.com/2026/01/28/opinion/american-dream-poverty.html
1•mitchbob•13m ago•1 comments

Developmental convergence and divergence in human stem cell models of autism

https://www.nature.com/articles/s41586-025-10047-5
1•bookofjoe•14m ago•0 comments

Show HN: Built a way to validate ideas with AI personas and Simulated Community

https://www.nichesim.com/
1•justincxa•14m ago•0 comments

A fresh take on offline data collection

https://tommaso-girotto.co/blog/a-fresh-take-on-offline-data-collection
1•tgirotto•15m ago•0 comments

Types Of ML Jobs In 2026 [video]

https://www.youtube.com/watch?v=6tD07TvN73o
1•ssunboyy•17m ago•0 comments

One-Click Clawdbot/Moltbot on Security-Hardened DigitalOcean Droplets

https://www.digitalocean.com/blog/moltbot-on-digitalocean
3•makaimc•18m ago•0 comments

DanceJump for YouTube – Rhythm Dance Game – v0.3.3 Released for Edge

https://microsoftedge.microsoft.com/addons/detail/dancejump-for-youtube-r/kjcikodgaapodnjkhhmaobb...
1•maaydin•18m ago•1 comments

A practical primer on confidential computing

https://github.com/lunal-dev/home/tree/main/docs/confidential-computing-primer
12•grun•20m ago•0 comments

Codex Daily Benchmarks for Degradation Tracking (Marginlab.ai)

https://marginlab.ai/trackers/codex/
1•wendgeabos•21m ago•0 comments

XCCache: Faster Swift builds, less waiting

https://xccache.trinhngocthuyen.com
1•wahnfrieden•21m ago•0 comments

What I found reading Claude's leaked 57K-word system prompts

2•jbetala7•21m ago•3 comments

Show HN: KnowledgeForAI – remote MCP for various data sources

https://knowledgeforai.com/
1•winchester6788•22m ago•0 comments

Tell HN: Beeper deletes inactive accounts without notice

1•kldx•22m ago•0 comments

Patients Are Often More Honest with AI Than Clinicians [video]

https://www.youtube.com/watch?v=97HLETD7CGY
1•vitlyoshin•23m ago•1 comments