frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

CompileBench: Can AI Compile 22-year-old Code?

https://quesma.com/blog/introducing-compilebench/
43•jakozaur•1h ago

Comments

stared•1h ago
Curious for the ultimate benchmark - can AI compile Doom an on arbitrary device?
flenserboy•9m ago
that, & how well does it cope with Perl?
johnisgood•5m ago
Claude is good enough at Perl with lots of hand-holding and reiterations, according to my experiences.
piotrgrabowski•44m ago
Author here.

So far in this benchmark we based the tasks on a couple of open-source projects (like curl, jq, GNU Coreutils).

Even on those "simple" projects we managed to make the tasks difficult - Claude Opus 4.1 was the only one to correctly cross-compile curl for arm64 (+ make it statically-linked) [1].

In the future we'd like to test it with projects like FFmpeg or chromium - those should be much more difficult.

[1] https://www.compilebench.com/curl-ssl-arm64-static/

nl•44m ago
This is a really good benchmark. So much time is spent on these messy types of tasks and no one really likes doing it.

Now if it could fix React Native builds after package upgrades I'd be impressed...

bgwalter•40m ago
LGTM! I'm sure it comes with a correctness proof, too!

The newer blog posts appear to scan forums like this one for objections ("AI" does not work for legacy code bases) and then create custom "benchmarks" for their sales people to point to if they encounter these objections.

falcor84•29m ago
> Our toughest challenges include cross-compiling to Windows or ARM64 and resurrecting 22-year-old source code from 2003 on modern systems. Some agents needed 135 commands and 15 minutes just to produce a single working binary.

I found that "just" there to be so funny in terms of how far the goal posts moved over these last few years (as TFA does mention). I personally am certain that it would have taken me significantly longer than that to do it myself.

Philpax•20m ago
Excellent benchmark. May I suggest a extension: "port any pre-uv Python ML codebase to uv so that it can actually be reliably reproduced"?
buildbot•13m ago
I’ve been doing this a lot! AI seems to really excel at setting up compiler boilerplate/minor modifications for new arch. I made a simple cpu information utility work on HP PA-RISC and Sparc64 :)

RIP "Browsers"

https://blog.jim-nielsen.com/2025/rip-browsers/
1•freediver•27s ago•0 comments

The Struggle to Visualize Zettelkasten Notes and How I Solved It

https://wasi0013.com/2025/09/22/data-visualization-challenge-the-struggle-to-visualize-thousands-...
2•pyprism•2m ago•0 comments

A staff revolt rocked the fintech powerhouse FNZ

https://www.thetimes.com/business-money/companies/article/how-a-staff-revolt-rocked-the-fintech-p...
1•gadders•3m ago•0 comments

The Right to Your Feed: A Simple Fix for Social Media

https://rosslazer.com/posts/right-to-feed/
1•rosslazer•3m ago•0 comments

Moody's raises Big Red over flag Oracle's mega AI DC buildout blueprint

https://www.theregister.com/2025/09/22/moodys_raises_questions_over_oracles/
1•rntn•5m ago•0 comments

Crystal v0.3: Codex support in Git Worktrees

https://github.com/stravu/crystal
1•jbentley1•5m ago•1 comments

Vertech Academy

https://www.vertechacademy.ca/
1•Hamid213•6m ago•1 comments

Dear GitHub: no YAML anchors, please

https://blog.yossarian.net/2025/09/22/dear-github-no-yaml-anchors
3•woodruffw•8m ago•0 comments

Show HN: T3 Chat for Image Models

https://frames.so
1•moschetti1•8m ago•0 comments

How to Make Sense of Any Mess

https://www.howtomakesenseofanymess.com
1•surprisetalk•10m ago•0 comments

WebKit Features in Safari 26.0

https://webkit.org/blog/17333/webkit-features-in-safari-26-0/
1•ksec•10m ago•0 comments

Future Fonts: where type designers sell fonts in progress

https://www.futurefonts.com/
1•surprisetalk•10m ago•0 comments

What is algebraic about algebraic effects?

https://interjectedfuture.com/what-is-algebraic-about-algebraic-effects/
2•iamwil•12m ago•0 comments

Oracle appoints insiders Clay Magouyrk, Mike Sicilia as co-CEOs in surprise move

https://finance.yahoo.com/news/oracle-appoints-insiders-clay-magouyrk-121559145.html
2•nixgeek•12m ago•0 comments

Show HN: Personalized offers via models trained on visitor actions

https://aiprice.me
1•aubmedia•13m ago•1 comments

Cloudflare's goal to hire 1,111 interns in 2026

https://blog.cloudflare.com/cloudflare-1111-intern-program/
3•pavel_lishin•14m ago•0 comments

Cryptocurrencies Sink as $1.5B in Bullish Bets Wiped Out

https://www.bloomberg.com/news/articles/2025-09-22/cryptocurrencies-sink-as-1-5-billion-in-bullis...
3•wslh•14m ago•1 comments

Nonplussed by "Nonplussed"

https://www.bookofjoe.com/2025/09/my-entry-29-2.html
1•surprisetalk•14m ago•0 comments

Underappreciated Subcultural Details

https://justismills.substack.com/p/underappreciated-subcultural-details
1•surprisetalk•15m ago•0 comments

Free access to Cloudflare developer features for students

https://blog.cloudflare.com/workers-for-students/
1•frasermarlow•15m ago•0 comments

I want a cross-platform tiling window manager

https://lgug2z.com/articles/i-want-a-cross-platform-tiling-window-manager/
1•bsnnkv•15m ago•0 comments

Private toll roads are supposed to save taxpayers' money, but have hidden costs

https://theconversation.com/private-toll-roads-are-supposed-to-save-taxpayers-money-but-can-have-...
2•PaulHoule•17m ago•0 comments

Anthropic Economic Index: Tracking AI's Role in the US and Global Economy

https://www.anthropic.com/research/economic-index-geography
1•myth_drannon•18m ago•0 comments

US and Indian VCs just formed a $1B+ alliance to fund India's deep tech startups

https://techcrunch.com/2025/09/01/u-s-and-indian-vcs-just-formed-a-1b-alliance-to-fund-indias-dee...
1•koolhead17•20m ago•0 comments

The Big Gotcha With starting-style

https://www.joshwcomeau.com/css/starting-style/
1•joshwcomeau•20m ago•0 comments

The Death of the Corporate Job

https://thestillwandering.substack.com/p/the-death-of-the-corporate-job
1•trevin•20m ago•0 comments

A 40-year study finds higher science funding under Republicans

https://www.psypost.org/a-40-year-study-finds-higher-science-funding-under-republicans/
9•delichon•21m ago•5 comments

Curse of knowledge

https://en.m.wikipedia.org/wiki/Curse_of_knowledge
1•joshdavham•21m ago•0 comments

Cap'n Web: A JavaScript-native RPC system

https://github.com/cloudflare/capnweb
1•anhldbk•23m ago•0 comments

Why random lines of video game dialogue get stuck in our heads

https://www.theguardian.com/games/2025/sep/17/video-game-dialogue-pushing-buttons
1•andsoitis•25m ago•0 comments