frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Deterministic Programming with LLMs

https://www.mcherm.com/deterministic-programming-with-llms.html
17•todsacerdoti•3d ago

Comments

andyfilms1•1h ago
At what point does this just wrap all the way back around to being genetic algorithms?

I'm also reminded of the old software called Formulize, which could take in a set of arbitrary data and find a function that described it. http://nutonian.wikidot.com/

xyzzy_plugh•38m ago
If you extend this line of thinking a lot, given we traditionally author the software, everything kind of boils down to a genetic algorithm.
4b11b4•1h ago
soon
dataviz1000•48m ago
> The Solution is Code-Checking Code

I'm finding code falls into two categories. Code that produces known results and code that produces results that are not known. For example, creating a table with a pagination component with a backend that loads the first 30 rows ordered by date descending from the database on page 1 and the second set of 30 rows on page 2. We know what the code is supposed to output, we know what the result looks like. On the other hand, there is code that does statistical analysis on the 30 rows of data. This is different because we don't know what the result is.

The known result code is easy to use an LLM with. I have a skill that will iterate with an OODA loop — observe, act, and validate. It will in the validate step take screenshots and even without telling it, it will query the database from the CLI, compare the rendered row data to the database data. It will more surprisingly make sure that all the components are responsive and render beautifully on mobile. I'm orders of magnitude past linting here which is solved with Biome.

The statistical analysis is different. The only way I can know for sure of the result is by writing the code painstakingly by hand. The LLM will always produce specious lies. It will fabricate and show me what I want to see, not the truth. This is because until it is written manually by hand, there is no ground truth. In this case, there is no code checking code.

nemo1618•34m ago
> But like humans — and unlike computer programs — they do not produce the exact same results every time they are used. This is fundamental to the way that LLMs operate: based on the "weights" derived from their training data, they calculate the likelihood of possible next words to output, then randomly select one (in proportion to its likelihood).

This is emphatically not fundamental to LLMs! Yes, the next token is selected randomly; but "randomly" could mean "chosen using an RNG with a fixed seed." Indeed, many APIs used to support a "temperature" parameter that, when set to 0, would result in fully deterministic output. These parameters were slowly removed or made non-functional, though, and the reason has never been entirely clear to me. My current guess is that it is some combination of A) 99% of users don't care, B) perfect determinism would require not just a seeded RNG, but also fixing a bunch of data races that are currently benign, and C) deterministic output might be exploitable in undesirable ways, or lead to bad PR somehow.

pavpanchekha•21m ago
Deterministic output is incompatible with batching, which in turn is critical to high utilization on GPUs, which in turn is necessary to keep costs low.
computersuck•5m ago
this is a long article that doesn't say much at all. likely generated by AI?

it goes on for ages just to reach the point of "write the tests first"

We do not think Anthropic should be designated as a supply chain risk

https://twitter.com/OpenAI/status/2027846016423321831
222•golfer•4h ago•89 comments

The Windows 95 user interface: A case study in usability engineering (1996)

https://dl.acm.org/doi/fullHtml/10.1145/238386.238611
131•ksec•3h ago•68 comments

MinIO Is Dead, Long Live MinIO

https://blog.vonng.com/en/db/minio-resurrect/
200•zufallsheld•4h ago•72 comments

Obsidian Sync now has a headless client

https://help.obsidian.md/sync/headless
385•adilmoujahid•9h ago•136 comments

The happiest I've ever been

https://ben-mini.com/2026/the-happiest-ive-ever-been
319•bewal416•2d ago•146 comments

Show HN: Xmloxide – an agent made rust replacement for libxml2

https://github.com/jonwiggins/xmloxide
20•jawiggins•2h ago•8 comments

Block the “Upgrade to Tahoe” Alerts

https://robservatory.com/block-the-upgrade-to-tahoe-alerts-and-system-settings-indicator/
128•todsacerdoti•6h ago•57 comments

Addressing Antigravity Bans and Reinstating Access

https://github.com/google-gemini/gemini-cli/discussions/20632
199•RyanShook•12h ago•171 comments

Building a Minimal Transformer for 10-digit Addition

https://alexlitzenberger.com/blog/post.html?post=/building_a_minimal_transformer_for_10_digit_add...
33•kelseyfrog•3h ago•4 comments

Woxi: Wolfram Mathematica Reimplementation in Rust

https://github.com/ad-si/Woxi
244•adamnemecek•3d ago•103 comments

Verified Spec-Driven Development (VSDD)

https://gist.github.com/dollspace-gay/d8d3bc3ecf4188df049d7a4726bb2a00
144•todsacerdoti•8h ago•69 comments

H-Bomb: A Frank Lloyd Wright Typographic Mystery

https://www.inconspicuous.info/p/h-bomb-a-frank-lloyd-wright-typographic
5•mrngm•2d ago•1 comments

Qwen3.5 122B and 35B models offer Sonnet 4.5 performance on local computers

https://venturebeat.com/technology/alibabas-new-open-source-qwen3-5-medium-models-offer-sonnet-4-...
213•lostmsu•5h ago•135 comments

Deterministic Programming with LLMs

https://www.mcherm.com/deterministic-programming-with-llms.html
17•todsacerdoti•3d ago•8 comments

Show HN: Now I Get It – Translate scientific papers into interactive webpages

https://nowigetit.us
185•jbdamask•12h ago•99 comments

Werner Herzog Between Fact and Fiction

https://www.thenation.com/article/culture/werner-herzog-future-truth/
63•Hooke•1d ago•14 comments

New evidence that Cantor plagiarized Dedekind?

https://www.quantamagazine.org/the-man-who-stole-infinity-20260225/
108•rbanffy•3d ago•69 comments

The whole thing was a scam

https://garymarcus.substack.com/p/the-whole-thing-was-scam
601•guilamu•9h ago•163 comments

MCP server that reduces Claude Code context consumption by 98%

https://mksg.lu/blog/context-mode
228•mksglu•15h ago•55 comments

The archivist preserving decaying floppy disks

https://www.popsci.com/technology/floppy-disk-archivist-project/
46•Brajeshwar•3d ago•4 comments

747s and Coding Agents

https://carlkolon.com/2026/02/27/engineering-747-coding-agents/
125•cckolon•1d ago•58 comments

Ghosts'n Goblins – “Worse danger is ahead”

https://superchartisland.com/ghostsn-goblins/
62•elvis70•3d ago•24 comments

From Noise to Image – interactive guide to diffusion

https://lighthousesoftware.co.uk/projects/from-noise-to-image/
104•simedw•2d ago•15 comments

Pentagon chief blocks officers from Ivy League schools and top universities

https://fortune.com/2026/02/28/pentagon-officer-education-ivy-league-schools-universities-partner...
50•geox•2h ago•17 comments

The Eternal Promise: A History of Attempts to Eliminate Programmers

https://www.ivanturkovic.com/2026/01/22/history-software-simplification-cobol-ai-hype/
232•dinvlad•3d ago•165 comments

Unsloth Dynamic 2.0 GGUFs

https://unsloth.ai/docs/basics/unsloth-dynamic-2.0-ggufs
197•tosh•16h ago•52 comments

Our Agreement with the Department of War

https://openai.com/index/our-agreement-with-the-department-of-war
212•surprisetalk•5h ago•190 comments

What I learned while trying to build a production-ready nearest neighbor system

https://github.com/thatipamula-jashwanth/smart-knn
18•Jashwanth01•3d ago•10 comments

The United States and Israel have launched a major attack on Iran

https://www.cnn.com/2026/02/28/middleeast/israel-attack-iran-intl-hnk
1045•lavp•19h ago•2277 comments

The Future of AI

https://lucijagregov.com/2026/02/26/the-future-of-ai/
117•BerislavLopac•15h ago•94 comments