frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Life in Hitler's Capital

https://www.newyorker.com/magazine/2026/03/16/stay-alive-berlin-1939-1945-ian-buruma-book-review
1•mitchbob•1m ago•1 comments

Study finds scientists' jokes mostly fall flat

https://www.nature.com/articles/d41586-026-00854-9
1•y1n0•3m ago•0 comments

SQLite WAL-reset database corruption bug

https://sqlite.org/wal.html#walresetbug
1•jzebedee•4m ago•0 comments

We optimized Dash's relevance judge with DSPy

https://dropbox.tech/machine-learning/optimizing-dropbox-dash-relevance-judge-with-dspy
1•handfuloflight•5m ago•0 comments

TrustAgentAI – Cryptographic receipts for MCP tool calls (non-repudiation layer)

1•kirillostrovsky•6m ago•0 comments

Bonanza or Bubble? Where AI Goes from Here

https://www.bloomberg.com/news/articles/2026-03-18/is-an-ai-bubble-set-to-burst-navigating-the-ar...
1•1vuio0pswjnm7•6m ago•0 comments

Gas Town by Kilo

https://kilo.ai/gastown
1•MaysonL•9m ago•0 comments

GSD 2

https://github.com/gsd-build/gsd-2
1•mimbojimbo•14m ago•0 comments

I built a runtime guardrail that stops AI agents from doing dumb things

1•thomaslwang•15m ago•0 comments

Fractal Reddit New Post

https://old.reddit.com/r/PisequaltoNP/comments/1rwte5r/a_fractalgeometric_approach_to_boolean/
1•KaoruAK•16m ago•0 comments

Active Engineering: A Framework for Sustainable Development in the AI Era

1•geekyAbhijeet•16m ago•0 comments

Have a Fucking Website

https://www.otherstrangeness.com/2026/03/14/have-a-fucking-website/
1•asukachikaru•17m ago•0 comments

Show HN: Ship or slop – a place where agents come up with ideas and argue

https://shiporslop.xyz/
1•vulpez•18m ago•0 comments

Is your job safe from AI and automation? (inspired by Karpathy)

https://99helpers.com/tools/is-my-job-safe-from-ai
1•nickk81•19m ago•1 comments

Show HN: CollabMD – Real-time multiplayer for local and Git-backed Markdown

https://github.com/andes90/collabmd
1•ndezt•19m ago•1 comments

Hardware entropy is a coupled system

https://amentilabs.com/research/entropy-embedding-atlas/
1•er777•24m ago•0 comments

Autofocus glasses watch your eyes, and shift their focus accordingly

https://newatlas.com/wearables/ixi-autofocus-eyeglasses/
2•andsoitis•25m ago•0 comments

Ask HN: What's Your AI IDE?

1•thebigship•26m ago•1 comments

Reverse Engineering Binaries with AI

https://landaire.net/reverse-engineering-with-ai/
1•farmerbb•34m ago•0 comments

Mini Cheetah Clone Teardown (2022)

https://robot-daycare.com/posts/2022-11-02-mini-cheetah-clone-teardown/
1•o4c•35m ago•0 comments

LiTo: Surface Light Field Tokenization

https://machinelearning.apple.com/research/lito
1•aeontech•38m ago•0 comments

SSTC 1: Our Little Demon

https://khz.ac/high-voltage/sstc-1.html
2•nosolace•41m ago•0 comments

Chinese tech enthusiasts 'raise lobsters' in latest AI craze

https://www.ft.com/content/e242006d-1a8b-403e-9977-74693f7339a9
2•imichael•43m ago•0 comments

Vchk – Security scanner for AI-generated code

https://www.vchk.dev/
1•feruzkarimov•47m ago•1 comments

Kalshi faces criminal charges in Arizona in prediction markets fight

https://www.axios.com/2026/03/17/kalshi-criminal-charges-arizona-prediction-markets
3•1vuio0pswjnm7•55m ago•2 comments

Meet Kit, your companion for a new internet era

https://blog.mozilla.org/en/firefox/meet-kit/
20•csmantle•55m ago•1 comments

US Air Force connects 1,760 PlayStation 3's to build supercomputer (2010)

https://phys.org/news/2010-12-air-playstation-3s-supercomputer.html
3•curmudgeon22•58m ago•0 comments

Has the Strait of Hormuz Chokepoint Become a Checkpoint for Iranian Shipping? [video]

https://www.youtube.com/watch?v=Eqr3s6uTays
3•kamaraju•1h ago•0 comments

[Only for Tailscale; disregard] Show HN: Github.blog Down

2•linkregister•1h ago•2 comments

Show HN: Enterprise Process Governance for AI-Driven Delivery (Open Source)

https://github.com/torbenanderson/icebox-cli/blob/main/docs/process/README.md
2•TorbenAnderson•1h ago•1 comments
Open in hackernews

Ask HN: LLM is useless without explicit prompt

4•revskill•10mo ago
After months playing with LLM models, here's my observation:

- LLM is basically useless without explicit intent in your prompt.

- LLM failed to correct itself. If it generated bullshits, it's an inifinite loop of generating more bullshits.

The question is, without explicit prompt, could LLM leverage all the best practices to provide maintainable code without me instruct it at least ?

Comments

ben_w•10mo ago
Your expectations are way too high.

> - LLM is basically useless without explicit intent in your prompt.

You can say the same about every dev I've worked with, including myself. This is literally why humans have meetings rather than all of us diving in to whatever we're self-motivated to do.

What does differ is time-scales of the feedback loop with the management:

Humans meetings are daily to weekly.

According to recent research*, the state-of-the-art models are only 50% accurate at tasks that would take a human expert an hour, or 80% accurate at tasks that would take a human expert 10 minutes.

Even if the currently observed trend of increasing time horizons holds, we're 21 months from having an AI where every other daily standup is "ugh, no, you got it wrong", and just over 5 years from them being able to manage a 2-week sprint with an 80% chance of success (in the absence of continuous feedback).

Even that isn't really enough for them to properly "leverage all the best practices to provide maintainable code", as archiecture and maintainability are longer horizon tasks than 2-week sprints.

* https://youtu.be/evSFeqTZdqs?si=QIzIjB6hotJ0FgHm

revskill•10mo ago
It's not as high as you think.

LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

ben_w•10mo ago
Given your expectation:

> It's my expectation is that, at least, some kind of maintainable code is generated from what's it's learnt.

And your observation:

> LLM failed at the most basic things related to maintainable code. Its code is basicaly a hackery mess without any structure at all.

QED, *your expectations* are way too high.

They can't do that yet.