frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

A GPU-rendered terminal emulator with inline 3D graphics

https://github.com/orhun/ratty
1•LelouBil•45s ago•0 comments

Chitin-rich crab shell by-products modulate the marine lifetime of PHBV films

https://linkinghub.elsevier.com/retrieve/pii/S0141391026001564
1•PaulHoule•1m ago•0 comments

Shai Hulud attack ships signed malicious TanStack, Mistral NPM packages

https://www.bleepingcomputer.com/news/security/shai-hulud-attack-ships-signed-malicious-tanstack-...
1•Brajeshwar•2m ago•0 comments

Show HN: Tiny long-memory benchmark with Harbor running across Islo sandboxes

https://zozo123.github.io/longmem-mini-on-islo/
1•zozo123-IB•4m ago•0 comments

A Russian ship may have been carrying submarine nuclear reactors to North Korea

https://www.cnn.com/2026/05/12/world/a-russian-ship-sank-in-mysterious-circumstances-it-may-have-...
2•vinnyglennon•7m ago•0 comments

In Trump administration battle over AI, U.S. spy agencies seek more power

https://www.washingtonpost.com/politics/2026/05/11/trump-ai-regulation-commerce-intelligence/
1•delichon•8m ago•0 comments

Dusklight: A reverse-engineered reimplementation of Twilight Princess

https://github.com/TwilitRealm/dusk
1•klaussilveira•9m ago•0 comments

Show HN: Nimbalyst open source Obsidian, Codex app, and Linear for coding agents

https://github.com/nimbalyst/nimbalyst
3•wek•10m ago•0 comments

Dusk brings Twilight Princess to PC and mobile platforms

https://twilitrealm.dev/
1•transportheap•11m ago•0 comments

/Goal: The Six-Hour Codex Run That Survived a Five-Hour Pause

https://tectontide.com/en/blog/codex-goal-six-hour-run/
2•gmays•11m ago•0 comments

My offline AI-assisted Linux development machine

https://deepu.tech/my-fully-offline-ai-assisted-linux-development-machine/
1•deepu105•11m ago•0 comments

Down the memory lane with OS/2 (2020)

https://jmmv.dev/2020/08/os2-memory-lane.html
2•jmmv•11m ago•0 comments

Before you move to San Francisco, read this

https://junglegym.substack.com/p/before-you-move-to-san-francisco
4•ndewilde•12m ago•0 comments

Cameras Are Easy – Fingertips Aren't – Atoms to Algo

https://atomsfrontier.substack.com/p/cameras-are-easy-fingertips-arent
1•jpatel3•12m ago•0 comments

GameStop's $55.5B bid for eBay rejected as 'neither credible nor attractive'

https://www.theguardian.com/business/2026/may/12/gamestop-bid-for-ebay-rejected-as-neither-credib...
1•andsoitis•13m ago•0 comments

Show HN: Automatic Git snapshots while AI agents edit your repo

https://github.com/Baukaalm/safesandbox
1•baursha•15m ago•0 comments

World Passkey Day: Advancing passwordless authentication

https://www.microsoft.com/en-us/security/blog/2026/05/07/world-passkey-day-advancing-passwordless...
1•vdelitz•16m ago•1 comments

Snack giant makes mono(chrome) move due to Middle East conflict

https://www.printweek.com/content/news/snack-giant-makes-mono-move-due-to-middle-east-conflict
1•Kaibeezy•16m ago•0 comments

Why passkey sign-in success rate is much lower than many companies pretend

https://www.corbado.com/passkey-benchmark-2026/passkey-authentication-success-rate
2•vdelitz•17m ago•0 comments

AI coders are carrying half-open laptops through airports, offices, & ice rinks

https://www.businessinsider.com/coders-keep-laptops-open-in-public-ai-agent-2026-5
2•littlexsparkee•18m ago•0 comments

As agentic dev tools boom, workflow auditability becomes the constraint

https://thenewstack.io/agentic-cicd-audit-compliance-gap/
1•Brajeshwar•18m ago•0 comments

Quant Operator's Log

1•popcuptea•18m ago•0 comments

Six years of Planning Poker Online: from a 2020 Hacker News post to 147K users

https://weagileyou.com/blog/six-years-making-an-app-used-by-147000-people/
1•MiquelLHC•19m ago•0 comments

What could Functional Architecture mean? [video]

https://www.youtube.com/watch?v=x8jLOtZoOOU
1•surprisetalk•19m ago•0 comments

Yarbo says it will remove the intentional backdoor from its robot lawn mower

https://www.theverge.com/tech/928289/yarbo-remove-robot-lawn-mower-backdoor
1•SpyCoder77•19m ago•0 comments

Show HN: Browser-native replay of an AI agent governance incident

https://slashlife.ai/agent-gate/demo
1•ossughychen•20m ago•0 comments

Finlynq – Open-source personal finance app with a first-party MCP server

https://github.com/finlynq/finlynq
1•hhalawi•21m ago•0 comments

Has Merge Join's Time Gone?

https://buttondown.com/jaffray/archive/has-merge-joins-time-gone/
1•ibobev•22m ago•0 comments

Pediatricians say schools need to make time for recess for kids of all ages [video]

https://apnews.com/video/pediatricians-say-schools-need-to-make-time-for-recess-for-kids-of-all-a...
1•rawgabbit•22m ago•0 comments

Rendering the Sky, Sunsets, and Planets

https://blog.maximeheckel.com/posts/on-rendering-the-sky-sunsets-and-planets/
2•ibobev•22m ago•0 comments
Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•1y ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•1y ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•1y ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•1y ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.