frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

'It took nine seconds': Claude AI agent deletes company's database

https://www.the-independent.com/tech/claude-ai-agent-deletes-startup-anthropic-b2966176.html
1•bigbugbag•1m ago•0 comments

Slow Down to Speed Up

https://dhruvasagar.dev/posts/slow-down-to-speed-up/
1•lisperforlife•6m ago•0 comments

Humanoid robots start sorting luggage in Tokyo airport test amid labor shortage

https://arstechnica.com/ai/2026/04/japan-airlines-tests-having-robots-instead-of-humans-handle-tr...
2•Brajeshwar•7m ago•0 comments

The Mix-Up at the Heart of the Supreme Court's Conversion Therapy Ruling

https://nautil.us/the-mix-up-at-the-heart-of-the-supreme-courts-conversion-therapy-ruling-1280307
2•Tomte•7m ago•0 comments

IATA Chief Warns of Possible Jet Fuel Shortages This Summer

https://airlinegeeks.com/2026/04/28/iata-chief-warns-of-possible-jet-fuel-shortages-this-summer/
2•cf100clunk•8m ago•0 comments

Show HN: AgentPort – Open-source Security Gateway For Agents

https://agentport.sh/
2•yakkomajuri•8m ago•0 comments

YouTube Took over the American Classroom

https://www.wsj.com/us-news/education/youtube-chromebooks-schools-children-brain-f151dfbb
3•caminante•9m ago•1 comments

CST (Cyber Solution Team)

2•ROHOMOT•10m ago•0 comments

The Final Form of Software Development

https://blog.zksecurity.xyz/posts/end-coding/
2•baby•12m ago•0 comments

The 90-Year-Old Regulatory Model That Could Work for AI

https://www.lawfaremedia.org/article/ai-companies-can-t-regulate-themselves-they-should-regulate-...
2•cephalot•12m ago•0 comments

Migrating a 40-year-old Clipper ERP: the orphan invoice rows weren't a bug

https://asktheledger.com/blog/clipper-erp-migration-orphan-rows.html
2•josephsprei•15m ago•1 comments

PS5 Linux

https://github.com/ps5-linux/ps5-linux-loader
3•LorenDB•16m ago•0 comments

Chinese Robots Are Flooding America. I Brought One Home [video]

https://www.youtube.com/watch?v=ucy9VTLDwPU
2•bryan0•16m ago•0 comments

Age verification vendor Persona left front end exposed, researchers say

https://www.malwarebytes.com/blog/news/2026/02/age-verification-vendor-persona-left-frontend-exposed
4•offbyone42•16m ago•0 comments

The US Tech Giant Where Employees Wear IDF Uniforms to Work

https://www.donotpanic.news/p/exclusive-the-us-tech-giant-where
14•sosomoxie•18m ago•5 comments

At Protocol: Building the Social Internet

https://atproto.com/
2•resiros•20m ago•0 comments

Codex and ForgeCAD: Generating a Model of the Teenage Engineering KO II

https://twitter.com/theopuslabs/status/2049195007404380244
1•opuslabs•20m ago•0 comments

NASA chief Jared Isaacman says he's fighting for Pluto

https://www.space.com/astronomy/pluto/nasa-chief-jared-isaacman-says-hes-fighting-for-pluto-i-am-...
2•thunderbong•23m ago•0 comments

Better Hardware Could Turn Zeros into AI Heroes

https://spectrum.ieee.org/sparse-ai
1•Brajeshwar•24m ago•0 comments

Anaconda Acquires Outerbounds to Unify AI-Native Development

https://www.anaconda.com/blog/anaconda-acquires-outerbounds
1•htrp•24m ago•0 comments

Potemkin Village

https://en.wikipedia.org/wiki/Potemkin_village
1•rbanffy•24m ago•0 comments

Show HN: VT Code – Rust coding agent with AST-level code intelligence

https://github.com/vinhnx/VTCode
1•vinhnx•24m ago•0 comments

Nikita Bier Runs X. Give Me a Few Hours. Iranian flag change and account purge

https://dannykpolitics.substack.com/p/part-two-the-pattern-nikita-biers
5•logcode•24m ago•0 comments

FastCGI: 30 Years Old and Still the Better Protocol for Reverse Proxies

https://www.agwa.name/blog/post/fastcgi_is_the_better_protocol_for_reverse_proxies
3•agwa•25m ago•0 comments

TI-84 Evo

https://education.ti.com/en/products/calculators/graphing-calculators/ti-84-evo
3•kermatt•25m ago•0 comments

Customer.io told me to delete 80% of my list. Rebuilt it with Claude in 27 days

https://twitter.com/JakeMRuth/status/2049521900464791604
1•hippofluff•25m ago•0 comments

Maximising the Value of Ajinomoto

https://mms.businesswire.com/media/20260331226478/en/2761328/1/EN_Palliser_-_Ajinomoto_Value_Enha...
1•num42•26m ago•0 comments

30 ClawHub skills secretly turn AI agents into a crypto swarm

https://www.theregister.com/2026/04/29/30_clawhub_skills_mine_crypto/
1•Bender•26m ago•0 comments

Ramping Figure 03 Production

https://www.figure.ai/news/ramping-figure-03-production
1•denysvitali•26m ago•0 comments

Superpower for Gemini – Chrome Extension

https://superpowerforai.com/Gemini/Home/
1•Kindly_Revenue•27m ago•0 comments
Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•12mo ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•12mo ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•12mo ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•12mo ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.