frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

I automated opt-outs for 500 data broker sites (open source)

https://github.com/stephenlthorn/auto-identity-remove
195•stephenlthorn•2h ago•60 comments

The foundations of a provably secure operating system (PSOS) (1979) [pdf]

http://www.csl.sri.com/users/neumann/psos.pdf
72•rurban•4h ago•34 comments

Benedict Evans: AI eats the world (Spring 26) [pdf]

https://static1.squarespace.com/static/50363cf324ac8e905e7df861/t/6a0af5d0484fbf5fe9a7743e/177910...
51•topherjaynes•1h ago•18 comments

Linux security mailing list 'almost unmanageable'

https://www.theregister.com/security/2026/05/18/linus-torvalds-says-ai-powered-bug-hunters-have-m...
62•jonbaer•1h ago•16 comments

Math Jokes in Alice in Wonderland

https://storica.club/blog/alice-is-math-jokes/
46•yekbun123•4d ago•12 comments

GenCAD

https://gencad.github.io/
379•dagenix•16h ago•102 comments

Writing Z80 assembly, 4 decades later:-)

https://github.com/ttsiodras/3D-on-a-ZX-Spectrum-48K/
30•ttsiodras•1d ago•2 comments

'We mould trees to grow into the shape of chairs'

https://www.bbc.co.uk/news/articles/cvg0yy3gp71o
30•bauc•48m ago•10 comments

It is time to give up the dualism introduced by the debate on consciousness

https://www.noemamag.com/there-is-no-hard-problem-of-consciousness/
175•ahalbert4•10h ago•431 comments

Enough with the AI FOMO, go slow-mo, says Domo CDO

https://www.theregister.com/ai-ml/2026/05/17/enough-with-the-ai-fomo-go-slow-mo-says-domo-cdo/524...
37•Bender•59m ago•14 comments

Utah lawmakers form united front in push to ban prediction markets

https://www.theguardian.com/us-news/2026/may/18/you-can-bet-on-it-utah-lawmakers-form-united-fron...
47•thm•2h ago•17 comments

Crystals found inside wreckage from the first nuclear bomb test

https://www.scientificamerican.com/article/strange-crystals-found-inside-wreckage-from-the-first-...
128•jumploops•2d ago•49 comments

I turned a $80 RK3562 Android tablet into a Debian Linux workstation

https://github.com/tech4bot/rk3562deb
402•tech4bot•1d ago•196 comments

Ask an Astronaut: 333 hours of Q&A footage with astronauts

https://askanastronaut.issinrealtime.org/
170•gaws•2d ago•22 comments

Jank now has its own custom IR

https://jank-lang.org/blog/2026-05-08-optimization/
180•DASD•2d ago•36 comments

Hershey Bets on Agentic AI to Rethink $2B in Marketing Spend

https://www.adweek.com/brand-marketing/exclusive-hershey-bets-on-ai-agents-to-fix-its-2-billion-m...
12•mooreds•1h ago•21 comments

Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep

https://github.com/MinishLab/semble
384•Bibabomas•22h ago•127 comments

Build a Radio Wave Detector with Balls of Aluminum Foil

https://www.wired.com/story/build-a-radio-wave-detector-with-balls-of-aluminum-foil/
33•Brajeshwar•2d ago•9 comments

Graphing Scientific Calculator Based on the ESP32

https://github.com/El-EnderJ/NeoCalculator
23•uticus•3d ago•6 comments

WriteUp: 16 Bytes of x86 that turn Matrix rain into sound

https://hellmood.111mb.de//wake_up_16b_writeup.html
168•HellMood•14h ago•31 comments

Prolog Coding Horror

https://www.metalevel.at/prolog/horror
167•RohanAdwankar•16h ago•65 comments

Where Are the Vibecoded Photoshops?

https://indiepixel.de/blog/posts/where-are-the-vibecoded-photoshops/
196•gizmo64k•4h ago•241 comments

Why is Google Maps back to showing old satellite images of Altadena?

https://www.reddit.com/r/pasadena/s/94BHlkE84r
64•tgrowazay•5h ago•29 comments

Profunctor Equipment in Haskell

https://bartoszmilewski.com/2026/05/16/profunctor-equipment-in-haskell/
33•g0xA52A2A•1d ago•10 comments

Eric Schmidt speech about AI booed during graduation

https://www.nbcnews.com/tech/tech-news/former-google-ceo-booed-graduation-speech-ai-rcna345585
238•nothrowaways•2h ago•215 comments

Magical Realism: “Northern Exposure” 25 Years Later (2015)

https://www.rogerebert.com/streaming/magical-realism-nothern-exposure-25-years-later
128•walterbell•2d ago•62 comments

Two EA-18 fighter jets collide at Mountain Home airshow, pilots ejected safely

https://idahonews.com/news/local/two-f-18-fighter-jets-have-crashed-during-an-airshow-at-mountain...
223•ChrisArchitect•16h ago•224 comments

A Good Lemma Is Worth a Thousand Theorems (2007)

https://sites.math.rutgers.edu/~zeilberg/Opinion82.html
75•susam•2d ago•13 comments

CUDA Books

https://github.com/alternbits/awesome-cuda-books
212•dariubs•1d ago•52 comments

kharp – k version 3 Language Interpreter in C#

https://github.com/ERufian/ksharp
45•tosh•1d ago•12 comments
Open in hackernews

LLMs as Unbiased Oracles

https://jazzberry.ai/blog/test-generation-as-the-foundation
34•MarcoDewey•1y ago

Comments

Jensson•1y ago
> An LLM, specifically trained for test generation, consumes this specification. Its objective is to generate a diverse and comprehensive test suite that probes the specified behavior from an external perspective.

If one of these tests are wrong though it will ruin the whole thing. And LLM are much more likely to make a math error (which would result in a faulty test) than to implement a math function the wrong way, so this probably wont make it better at generating code.

MarcoDewey•1y ago
I think this is a seriously excellent point.

The bet that I am making is that the system reduces its error rate by splitting a broad task into two more focused tasks.

However, it is possible that generating meaningful test cases is a harder problem (with a higher error rate) than producing code. If this is the case, then this idea I am presenting would compound the error rate.

satisfice•1y ago
If your premises and assumptions are sufficiently corrupted, you can come to any conclusion and believe you are being rational. Like those dreams where you walk around without pants on and you are more worried about not having pants than you are about how it could have come to be that your pants kept going missing. Your brain is not present enough to find the root of the problem.

An LLM is not unbiased, and you would know that if you tested LLMs.

Apart from biases, an LLM is not a reliable oracle, you would know that if you tested LLMs.

The reliabilities and unreliabilities of LLMs vary in discontinuous and unpredictable ways from task to task, model to model, and within the same model over time. You would know this if you tested LLMs. I have. Why haven’t you?

Ideas like this are promoted by people who don’t like testing, and don’t respect it. That explains why a concept like this is treated as equivalent to a tested fact. There is a name for it: wishful thinking.

walterbell•1y ago
> wishful thinking

Given the economic component of LLM wishes, we can look at prior instances of wishing-at-scale, https://en.wikipedia.org/wiki/Tulip_mania

troupo•1y ago
There's a more recent one: https://blog.mollywhite.net/blockchain/
roenxi•1y ago
Blockchains are past the gauntlet where they can be described as a mania, it is clear they are a permanent addition to the world of finance; probably as a multi-billion or -trillion dollar market cap asset class. If crypto was going to fail the interest rate rises would have done it by now.
troupo•1y ago
Tulips. You're describing tulips.
MarcoDewey•1y ago
I believe that I have unintentionally misled you. When I say "unbiased oracle" I am talking specifically about the test oracle being unbiased by how the software was implemented. ie. Black Box testing.

I don't think I made the point very clear in the blog (I will rectify that), but I am saying that because LLMs are so easily biased by their prompting that they sometimes perform better when doing black box testing tasks than they do when performing white box testing.

satisfice•1y ago
I appreciate that you replied. It warms my heart, frankly. It gives me hope.

I don't want to have a big argument about this right at this moment. But-- truly-- thank you for replying!

TazeTSchnitzel•1y ago
Is this a blogpost that's incomplete or a barely disguised ad?
saagarjha•1y ago
You'd think AI would have told them not to post it
mock-possum•1y ago
It’s hard to convince LLMs to be anything but supportive - lately I’ve been finding joy in reading its tone as patronizing.

“Exactly — that’s a very clean way to lay it out. You nailed it.”

brahyam•1y ago
The amount of time it would take to write the formal spec for the code I need is more than it would take to generate the code so doesn't sound like something that will go mainstream. Except for those industries where formal code specs are already in place.
MarcoDewey•1y ago
Yes, this test-driven approach will likely increase generation time upfront. However, the payoff is more reliable code being generated. This will lead to less debugging and fewer reprompts overall, which saves time in the long run.

Also agree on the specification formality. Even a less formal spec provides a clearer boundary for the LLM during code generation, which should improve code generation results.

bluefirebrand•1y ago
LLMs are absolutely biased

They are biased by the training dataset, which probably also reflects the biases of the people who select the training dataset

They are biased by the system prompts that are embedded into every request to keep them on the rails

They are even biased by the prompt that you write into them, which can lead them to incorrect conclusions if you design the prompt to lead them to it

I think it is a very careless mistake to think of LLMs as unbiased or neutral in any way

MarcoDewey•1y ago
You are correct that the notion of LLMs being completely unbiased or neutral does not make sense due to how they are trained. Perhaps my title is even misleading if taken at face value.

When I talk about "unbiased oracles" I am speaking in the context of black box testing. I'm not suggesting they are free from all forms of bias. Instead, the key distinction I'm trying to draw is their lack of implementation-level bias towards the specific code they are testing.

gwern•1y ago
LLMs are also heavily biased after chatbot tuning leads to mode-collapse. That's why you see the same verbal tics coming out of them, like the em-dashes or the 'twist ending' in the more recent 4os. And if LLMs really were unbiased, you'd expect better scaling when you tried to bruteforce code correctness. Training a 'test LLM' will just wind up inheriting a lot of the shared blindspots. They aren't independent of the implementation at all (just like humans are not independent, even when they didn't write the original, and didn't see it either; and this is why you can't simply throw _n_ programmers at a piece of code and be certain you got all the bugs, and why fuzzers will continue to rampage through code).
stuaxo•1y ago
The code correctness part is very true.

I don't mind LLMs as part of a journey on code, but it shouldn't be the end product.

I see something submitted by a colleague that doesn't fit the problem we have + tech well, go and ask an LLM and it outputs very similar code.

It's clear at that point that they submitted heavily LLMs produced code without giving it the work it needed.

Muromec•1y ago
This and state actors target ai crawlers specifically ti pouson llms with propaganda
ninetyninenine•1y ago
No this is just a very overly pedantic and technical way of looking at it.

First of all you'll note that all people are also biased by the Exact same reasoning. You know this. Everyone knows that all people are biased. This isn't something you don't know.

So if every single intelligence, human or not is biased. What is this article truly talking about? The article is basically saying LLMs are LESS biased then humans. Why are LLMs less biased then humans? Well maybe because the training set in an LLM is less biased then the training set given to a human. This makes sense right? A human will be made more biased by his individual experience and his parents biases while an LLM is literally inundated with as many sources of textual information as possible with no attempt at bias due to the sheer volume of knowledge they are trying to shove in there.

The article is basically referring to this.

But you will note interestingly that LLMs bias towards textual data more. They understand the world as if they have no eyes and ears and only text. So the way they think reflects this bias. But in terms of textual knowledge I think we can all agree, they are Less biased then humans.

Evidence: an LLM is not an atheist or a theist or an agnostic. But you, reader, are at the very least one of those three things.

neuroelectron•1y ago
Yeah that would be cool
MarcoDewey•1y ago
improving code generation would be awesome :)
neuroelectron•1y ago
Unfortunately, Microsoft/Google needs those models for themselves.
fallinditch•1y ago
I think it makes a lot of sense to employ various specialized LLMs in the software development lifecycle: one that's good at ideation and product development, one that fronts the organizational knowledge base, one for testing code, one (or more) for coding, etc, maybe even one whose job it is to always question your assumptions.
Mbwagava•1y ago
Unbiased seems like a pipe-dream. Unbiased between which perspectives? Would the set of perspectives chosen not be de-facto bias?
sega_sai•1y ago
I think the unbiasedness is completely red herring here, but do I agree with the point on focusing on the tests separately and implementations separately. Ideally you'd want two completely different LLMs work on both. But I think the question is, how trustworthy are the LLM tests ? Will the human review of these take more time than writing of the how code ? I think for non-critical applications, it probably does not matter, but in the end I think people will be looking for some guarantees or confidence that the errors happen with frequency less than X%. And I don't think those exist now. And given the models change so frequently it's also hard to be sure if something was working fine yesterday whether it'll be today.
MarcoDewey•1y ago
I believe that the unprecedented scale of LLM-generated code will demand a novel approach to software review and testing. Human review may not be able to keep up (or will it become the bottleneck?)