frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Midjourney Medical

https://www.midjourney.com/medical/blogpost
484•ricochet11•4h ago•348 comments

I Hate Compilers

https://xeiaso.net/notes/2026/anubis-wasm-vendor-binary/
41•xena•1h ago•30 comments

Local Qwen isn't a worse Opus, it's a different tool

https://blog.alexellis.io/local-ai-is-not-opus/
110•alphabettsy•3h ago•36 comments

Lore – Open source version control system designed for scalability

https://lore.org/
1084•regnerba•16h ago•577 comments

Apple boss Tim Cook says prices to rise due to memory chip costs

https://www.bbc.com/news/articles/c3wyxvqdx1zo
35•ilreb•3h ago•19 comments

US holds off blacklisting DeepSeek, more than 100 firms deemed security risks

https://www.reuters.com/world/china/us-holds-off-blacklisting-chinas-deepseek-more-than-100-firms...
427•giuliomagnifico•1d ago•479 comments

Taxonomy of the Occlupanida (parasitoids on bread bag tags)

https://www.horg.com/horg/?page_id=921
125•beatthatflight•7h ago•24 comments

Nim Conf 2026 (Online, Sat June 20)

https://conf.nim-lang.org/
23•pietroppeter•2h ago•2 comments

Storied Colors – a catalogue of named colors

https://storiedcolors.com/
140•susiecambria•8h ago•29 comments

[x86] AI Compute Extensions (ACE) Specification

https://x86ecosystem.org/resource/ai-compute-extensions-ace-specification/
27•matt_d•4h ago•13 comments

Clojure Hosted on Go

https://github.com/glojurelang/glojure
98•dnlo•7h ago•13 comments

Loreline – Tools for writing interactive fiction

https://loreline.app/en/
121•smartmic•10h ago•16 comments

Launch HN: Adam (YC W25) – Open-Source AI CAD

https://github.com/Adam-CAD/CADAM
174•zachdive•14h ago•84 comments

How Madrid built its metro cheaply (2024)

https://worksinprogress.co/issue/how-madrid-built-its-metro-cheaply/
90•trymas•10h ago•44 comments

SteamOS Linux 3.8 released as stable

https://store.steampowered.com/news/app/1675200/view/697641379212298072
74•jrepinc•2h ago•3 comments

Show HN: We built an 8-bit CPU as 2nd year EE students

https://github.com/c0rRupT9/STEPLA-1
60•CorRupT9•2d ago•11 comments

How we run Firecracker VMs inside EC2 and start browsers in less than 1s

https://browser-use.com/posts/firecracker-browser-infra
253•gregpr07•1d ago•165 comments

RFC 10008: The new HTTP Query Method

https://www.rfc-editor.org/info/rfc10008/
353•schappim•19h ago•150 comments

Biological evolution and information acquisition

https://www.construction-physics.com/p/biological-evolution-and-information
35•chmaynard•6d ago•3 comments

Why thinking out loud with someone beats thinking alone

https://www.thesignalist.io/s/the-dialogue-dividend/
229•kodesko•17h ago•101 comments

Show HN: An 8-bit live gamecast for baseball

https://ribbie.tv/watch
224•brownrout•13h ago•121 comments

Show HN: Spin Lab

https://srijanshukla.com/artifacts/spin-lab/
29•srijanshukla18•1d ago•11 comments

Magic Buffers and io_uring Registered Buffers

https://www.mindfruit.co.uk/posts/2025/10/magic-buffers-and-io-uring-write-fixed/
14•tosh•2d ago•2 comments

Volkswagen started blocking GrapheneOS users

https://discuss.grapheneos.org/d/35949-volkswagen-app?page=3
607•microtonal•15h ago•370 comments

Tesco moving 40k server workloads off VMware amid Broadcom's abusive conduct

https://arstechnica.com/information-technology/2026/06/tesco-moving-40000-server-workloads-off-vm...
267•Bender•9h ago•149 comments

Sodium Bicarbonate for In-Hospital Cardiac Arrest – A Randomized Clinical Trial

https://jamanetwork.com/journals/jama/fullarticle/2850405
9•bookofjoe•5d ago•3 comments

Show HN: Inkwash, a watercolor sketching app and explanation

https://johnowhitaker.github.io/inkwash/about
208•Yenrabbit•4d ago•22 comments

MicroUI – A tiny, portable, immediate-mode UI library written in ANSI C

https://github.com/rxi/microui
224•peter_d_sherman•18h ago•74 comments

GLM-5.2 is the new leading open weights model on Artificial Analysis

https://artificialanalysis.ai/articles/glm-5-2-is-the-new-leading-open-weights-model-on-the-artif...
836•himata4113•21h ago•402 comments

U.S. science is in chaos

https://www.scientificamerican.com/article/americas-compact-between-science-and-politics-is-broken/
793•presspot•20h ago•942 comments
Open in hackernews

I Hate Compilers

https://xeiaso.net/notes/2026/anubis-wasm-vendor-binary/
39•xena•1h ago

Comments

charcircuit•59m ago
As long as the program is equivalent there isn't an actual problem here. Requiring the output to always be the same is an arbitrary restriction.

If you want to have users trust that someone else hasn't modified it, then sign it with your identity.

dyauspitr•55m ago
LLMs should be trained on and directly output binary.
klodolph•51m ago
On the off chance that you’re serious, that would result in disastrously bad output. The difference between “jmp $+15” and “jmp $+16” is inscrutable and the LLM would not be able to pick the right one without tooling.

That tooling is a compiler. The higher level, the better chance the LLM can be steered to good output. Machine code is hopeless, don’t bother.

faangguyindia•48m ago
What about AOT optimization? whuch brings aot closer to JITs performance? Isn't that something LLM + Harness can easily do?
klodolph•24m ago
I think the idea that AOT is inherently faster than JIT, or vice versa, is a thoroughly debunked idea.

You can have LLMs help you optimize code but I don’t think you can do this unattended for non-trivial code.

jenadine•42m ago
> The difference between “jmp $+15” and “jmp $+16” is inscrutable

I don't see why that's the case. LLM trained on binary would totally see it, not?

Also the tool can also be running the test and a debugger.

klodolph•28m ago
> I don't see why that's the case. LLM trained on binary would totally see it, not?

It would not. You find the correct version by counting the number of bytes to the destination. LLMs are famously bad at this kind of problem (counting).

> Also the tool can also be running the test and a debugger.

The test needs to provide a good amount of signal. That’s too hard if you are throwing machine code at the wall.

In order for debuggers to work, you need some kind of model that describes what the code should do and what state the computer should be in after each instruction. That model is high-level code.

I can understand the intuitive appeal of training LLMs with machine code, but all of my experience with LLMs suggest that they are incredibly ill-suited to the task, and we just don’t have the capacity to train them to make useful machine code.

zx8080•14m ago
Can "LLMs are bad at counting" be generalized to "LLM are better in complex stuff but make more mistakes in simple"?
klodolph•5m ago
No, I don’t think so. LLMs are good at a lot of simple tasks, but bad at certain simple tasks. Moravec’s paradox in a new iteration.

It applies to humans too. Calculus is “simple” but it takes something like sixteen years to train a human to do it, if all goes well. Meanwhile, most humans think that inverse kinematics is, like, the easiest thing in the world (it’s a super complicated task).

pjmlp•41m ago
That compiler does wonders with languages that have UB on their specs, especially when having optimizations passes with heuristics.

Also there are dynamic compilers were the shape of machine code changes as the code executes, and each single execution will certainly generate different sequences, depending on the program execution and where it is running.

Deterministic JIT compiler code generation, at least on optimising ones, is not a solved problem.

xiaoyu2006•45m ago
It should not. Abstraction in software engineering brings intelligence. (compression correlates to intelligence)
dyauspitr•39m ago
Why? I mean this is all emergent, right? And it’s not like humans ever work at this level. It would be very interesting to see what sort of outputs and abstractions an LLM comes up with.
shshshjaja•36m ago
runApp()

Done! Excellent abstraction. High intelligence.

frwrfwrfeefwf•30m ago
people don't get this
rvz•44m ago
I think you forgot the "/s"
wahnfrieden•35m ago
What other recent Musk talking points do you like to repeat?
bandrami•26m ago
Generative algorithms have been studied for decades now and while they have led to some interesting results they're a bad fit for LLMs because there's no such thing as a "plausible" binary: a small perturbation yields an unusable result.
fulafel•14m ago
Technically they are, just a subset. But still a practical one, they're frequently used to produce executable files.
mathisfun123•53m ago
> lol you'd think, but no, it's not. In theory it is (and for small scale compilers it definitely is), but in practice compilers are strange and complicated beasts containing multitudes that no mere mortal can fully comprehend on their own.

Umm there are like 60,000 lit tests checked into LLVM that verify that the output is absolutely a deterministic function of the input of at least that compiler.

> Even though the source code had the same bytes, the output of the compiler was wildly different.

This is the goofiest I've seen written unironically in quite a long - the C preprocessor is not part of the compiler. The pre in preprocessor should probably give it away.

Just a tip: you should probably actually understand something before you decide you hate it.

xena•49m ago
Tell me the flags I'm missing then: https://github.com/TecharoHQ/anubis/blob/Xe/wasm4/utils/wasm...
heavensteeth•33m ago
I'm surprised by the amount of heckling this post received almost immediately! And a lack of constructive input.

I for one enjoyed the article and understand what you're getting at.

LPisGood•27m ago
Re: source code producing different binaries: things like ASLR, stack canaries, optimization levels, linking, etc all lead to different binaries.
ComputerGuru•38m ago
These seem very reasonable, the workarounds used are natural, and overall the article is not at all congruous with the conclusion in the (clickbait?) title?

Compilers literally made your project possible!

jdw64•33m ago
Reading this, I think low level engineering is actually more dependent on specific environments. Hardware also has its own points of change. Usually, when you think at a high level, environmental changes are less significant than you might expect. But low level thinking tends to be tied to specific environments, which is what makes it difficult. The reason low level is hard is that even if the code itself is short, the hidden assumptions inside it are difficult and place a heavy cognitive load on the programmer. For example, even a short snippet in C like `int value = (int)buffer` requires a lot of implicit knowledge about the 4 byte alignment of the buffer, or whether int is exactly 32 bits. LLMs do not seem to be very good at knowing these things. Rather, they are strong at high level wrapping, but at the low level, they seem surprisingly difficult and somewhat useless. Hardware has CPU generation changes, and in the case of PLCs, where I mainly work, the protocol differences between vendors are far too severe. There does not seem to be any technology with a very long lifecycle.
pertymcpert•33m ago
If Clang generated non-deterministic output due to pointer addresses then that's a bug (happens regularly) that should be fixed. The most common way this happens if it some code path is iterating over a DenseMap which is non-deterministic. Sometimes that's fine and sometimes that's not depending on how that map is used. The common way to fix that is to switch to a MapVector which pays some additional runtime/memory cost to guarantee deterministic iteration order.
xena•26m ago
I'll try and make a minimal reproduction case and file a bug. Do you know if any tooling that can take a binary and fuzz it down to a minimal reproduction set?
biglost•24m ago
Time date env variables and random address... Is also input data, maybe not as a flag but still
RyanSquared•9m ago
Time and date are... tolerable. There's SOURCE_DATE_EPOCH which should always be set to whack it into submission when used. ASLR of the _compiler being invoked_ resulting in a difference in the _program being compiled_ is nuts and would break any self-hosting compiler with consistency checks.