frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Taming LLMs: Using Executable Oracles to Prevent Bad Code

https://john.regehr.org/writing/zero_dof_programming.html
18•mad44•3h ago

Comments

dktoao•53m ago
"Our goal should be to give an LLM coding agent zero degrees of freedom"

Wouldn't that just be called inventing a new language with all the overhead of the languages we already have? Are we getting to the point where getting LLMs to be productive and also write good code is going to require so much overhead and additional procedures and tools that we might as well write the code ourselves. Hmmm...

seanw444•20m ago
Yeah, precision LLM coding is kind of an oxymoron. English language -> codebase is essentially lossily-compressed logic by definition. The less lossy the compression becomes, the more you probably approach re-inventing programming languages. Which then means that in order to use LLMs to code, you're accepting some degree of imprecision.
virgilp•14m ago
Actually, no. We always needed good checks - that's why you have techniques like automated canary analysis, extensive testing, checking for coverage - these are forms of "executable oracles". If you wanted to be able to do continuous deployment - you had to be very thorough in your validation.

LLMs just take this to the extreme. You can no longer rely on human code reviews (well you can but you give away all the LLM advantages) so then if you take out "human judgement" *from validation*[1], you have to resort to very sophisticated automated validation. This is it - it's not about "inventing a new language", it's about being much more thorough (and innovative, and efficient) in the validation process.

[1] never from design, or specification - you shouldn't outsource that to AI, I don't think we're close to an AI that can do that even moderately effective without human help.

voxaai•29m ago
ran into this with creative generation. for code, formal constraints work great. but when the quality criteria cant be typed (feels right for this audience, sounds like infrastructure not a toy) constraints made things worse. what worked was competing generators with different objectives, then rank against the brief. the variance from competition was more useful than the precision from constraints.

We haven't seen the worst of what gambling and prediction markets will do

https://www.derekthompson.org/p/we-havent-seen-the-worst-of-what
199•mmcclure•1h ago•104 comments

CERN to host Europe's flagship open access publishing platform

https://home.cern/news/news/cern/cern-host-europes-flagship-open-access-publishing-platform
77•JohnHammersley•1h ago•6 comments

Why so many control rooms were seafoam green (2025)

https://bethmathews.substack.com/p/why-so-many-control-rooms-were-seafoam
336•Amorymeltzer•1d ago•57 comments

John Bradley, author of xv, has passed away

https://voxday.net/2026/03/25/rip-john-bradley/
105•linsomniac•2h ago•39 comments

My minute-by-minute response to the LiteLLM malware attack

https://futuresearch.ai/blog/litellm-attack-transcript/
211•Fibonar•5h ago•97 comments

Doom entirely from DNS records

https://github.com/resumex/doom-over-dns
120•Venn1•3d ago•29 comments

How much precision can you squeeze out of a table?

https://www.johndcook.com/blog/2026/03/26/table-precision/
19•nomemory•1h ago•2 comments

Show HN: Turbolite – a SQLite VFS serving sub-250ms cold JOIN queries from S3

https://github.com/russellromney/turbolite
53•russellthehippo•1h ago•14 comments

Colibri – chat platform built on the AT Protocol for communities big and small

https://colibri.social/
84•todotask2•3h ago•38 comments

Fermented foods shaped human biology

https://press.asimov.com/articles/culture-shift
62•mailyk•6d ago•29 comments

Moving from GitHub to Codeberg, for lazy people

https://unterwaditzer.net/2025/codeberg.html
443•jslakro•7h ago•228 comments

OpenTelemetry profiles enters public alpha

https://opentelemetry.io/blog/2026/profiles-alpha/
107•tanelpoder•4h ago•12 comments

HyperAgents: Self-referential self-improving agents

https://github.com/facebookresearch/hyperagents
72•andyg_blog•2d ago•25 comments

Personal Encyclopedias

https://whoami.wiki/blog/personal-encyclopedias
754•jrmyphlmn•1d ago•152 comments

Stripe Projects: Provision and manage services from the CLI

https://projects.dev/
75•piinbinary•4h ago•18 comments

From zero to a RAG system: successes and failures

https://en.andros.dev/blog/aa31d744/from-zero-to-a-rag-system-successes-and-failures/
247•andros•2d ago•77 comments

Building a Blog with Elixir and Phoenix

https://jola.dev/posts/building-a-blog-with-elixir-and-phoenix
52•shintoist•3h ago•3 comments

Fast regex search: indexing text for agent tools

https://cursor.com/blog/fast-regex-search
5•jxmorris12•2d ago•0 comments

Running Tesla Model 3's computer on my desk using parts from crashed cars

https://bugs.xdavidhu.me/tesla/2026/03/23/running-tesla-model-3s-computer-on-my-desk-using-parts-...
825•driesdep•23h ago•288 comments

My home network observes bedtime with OpenBSD and pf

https://ratfactor.com/openbsd/pf-gateway-bedtime
87•ibobev•3d ago•27 comments

Taming LLMs: Using Executable Oracles to Prevent Bad Code

https://john.regehr.org/writing/zero_dof_programming.html
18•mad44•3h ago•4 comments

End of "Chat Control": EU parliament stops mass surveillance

https://www.patrick-breyer.de/en/end-of-chat-control-eu-parliament-stops-mass-surveillance-in-vot...
473•amarcheschi•8h ago•245 comments

The Oxford Comma – Why and Why Not

https://www.deborahcourtbooks.com/post/the-oxford-comma-why-and-why-not
19•taubek•3h ago•25 comments

Interoperability Can Save the Open Web (2023)

https://spectrum.ieee.org/doctorow-interoperability
154•janandonly•6h ago•47 comments

Obsolete Sounds

https://citiesandmemory.com/obsolete-sounds/
200•benbreen•16h ago•35 comments

Light on Glass: Why do you start making a game engine?

https://analogdreamdev.substack.com/p/light-on-glass
38•atan2•3d ago•22 comments

Olympic Committee bars transgender athletes from women’s events

https://www.nytimes.com/2026/03/26/world/olympics/ioc-transgender-athletes-ban.html
146•RestlessMind•6h ago•322 comments

Shell Tricks That Make Life Easier (and Save Your Sanity)

https://blog.hofstede.it/shell-tricks-that-actually-make-life-easier-and-save-your-sanity/
456•zdw•20h ago•219 comments

Show HN: Orloj – agent infrastructure as code (YAML and GitOps)

https://github.com/OrlojHQ/orloj
13•An0n_Jon•15h ago•9 comments

New York City hospitals drop Palantir as controversial AI firm expands in UK

https://www.theguardian.com/technology/2026/mar/26/new-york-hospitals-palantir-ai
7•chrisjj•17m ago•0 comments