frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Flux 2 Klein pure C inference

https://github.com/antirez/flux2.c
59•antirez•1h ago

Comments

reactordev•58m ago
This is both awesome and scary. Yes, now we can embed image gen in things like game engines and photoshop or build our own apps. On the other hand, we can include image gen in anything…
nusl•53m ago
This was possible before, though
rvz•5m ago
Yes, it was always possible.

It's almost as if this is the first time many have seen something built in C with zero dependencies which makes this easily possible.

Since they are used to languages with package managers adding 30 package and including 50-100+ other dependencies just before the project is able to build.

d_watt•52m ago
Regarding the meta experiment of using LLMs to transpile to a different language, how did you feel about the outcome / process, and would you do the same process again in the future?

I've had some moments recently for my own projects as I worked through some bottle necks where I took a whole section of a project and said "rewrite in rust" to Claude and had massive speedups with a 0 shot rewrite, most recently some video recovery programs, but I then had an output product I wouldn't feel comfortable vouching for outside of my homelab setup.

antirez•43m ago
I depends on the situation. In this case the agent worked only using the reference code provided by Flux's Black Forest Labs which is basically just the pipeline implemented as a showcase. The fundamental way for this process to work is that the agent can have a feedback to understand if it is really making progresses, and to debug failures against a reference implementation. But then all the code was implemented with many implementation hints about what I wanted to obtain, and without any reference of other minimal inference libraries or kernels. So I believe this just is the effect of putting together known facts about how Transformers inference works plus an higher level idea of how software should appear to the final user. Btw today somebody took my HNSW implementation for vector sets and translated it to Swift (https://github.com/jkrukowski/swift-hnsw). I'm ok with that, nor I care of this result was obtained with AI or not. However it is nice that the target license is the same, given the implementation is so similar to the C one.
rcarmo•34m ago
This is pretty great. I’ve gone and hacked your GTE C inference project to Go purely for kicks, but this one I will look at for possible compiler optimizations and building a Mac CLI for scripting…
rcarmo•26m ago
I have a set of prompts that are essentially “audit the current code changes for logic errors” (plus linting and testing, including double checking test conditions) and I run them using GPT-5.x-Codex on Claude generated code.

It’s surprising how much even Opus 4.5 still trips itself up with things like off-by-one or logic boundaries, so another model (preferably with a fresh session) can be a very effective peer reviewer.

So my checks are typically lint->test->other model->me, and relatively few things get to me in simple code. Contrived logic or maths, though, it needs to be all me.

antirez•24m ago
Something that may be interested for the reader of this thread: this project was possible only once I started to tell Opus that it needed to take a file with all the implementation notes, and also accumulating all the things we discovered during the development process. And also, the file had clear instructions to be taken updated, and to be processed ASAP after context compaction. This kinda enabled Opus to do such a big coding task in a reasonable amount of time without loosing track. Check the file IMPLEMENTATION_NOTES.md in the GitHub repo for more info.
lukebechtel•19m ago
Very cool!

Yep, a constantly updated spec is the key. Wrote about this here:

https://lukebechtel.com/blog/vibe-speccing

I've also found it's helpful to have it keep an "experiment log" at the bottom of the original spec, or in another document, which it must update whenever things take "a surprising turn"

vessenes•2m ago
Salvatore - this is cool. I am a fan of using Steve Yegge's beads for this - it generally cuts the markdown file cruft significantly.

Did you run any benchmarking? I'm curious if python's stack is faster or slower than a pure C vibe coded inference tool.

throwaway2027•22m ago
If I asked Claude to do the same can I also just put MIT license on it with my name? https://github.com/black-forest-labs/flux2 uses Apache License apparently. I know it doesn't matter that much and as long as it's permissive and openly available people don't care it's just pedantics but still.
antirez•10m ago
The reference code shows how to setup the inference pipeline. It does not implement 99% of what the C code does. That is, the inference kernels, the transformer and so forth.
neomantra•20m ago
Thanks for sharing this — I appreciate your motivation in the README.

One suggestion, which I have been trying to do myself, is to include a PROMPTS.md file. Since your purpose is sharing and educating, it helps others see what approaches an experienced developer is using, even if you are just figuring it out.

One can use a Claude hook to maintain this deterministically. I instruct in AGENTS.md that they can read but not write it. It’s also been helpful for jumping between LLMs, to give them some background on what you’ve been doing.

csto12•15m ago
As someone who doesn’t code in C and does more analytics work (SQL), is the code generated here “production grade?” One of the major criticisms I hear about llms is they tend to generate code that you wouldn’t want to maintain, is that the case here?
adefa•6m ago
I ran a similar experiment last month and ported Qwen 3 Omni to llama cpp. I was able to get GGUF conversion, quantization, and all input and output modalities working in less than a week. I submitted the work as a PR to the codebase and understandably, it was rejected.

https://github.com/ggml-org/llama.cpp/pull/18404

https://huggingface.co/TrevorJS/Qwen3-Omni-30B-A3B-GGUF

Sino-Soviet Split

https://en.wikipedia.org/wiki/Sino-Soviet_split
1•JumpCrisscross•3m ago•0 comments

I Learned to Love Lifting Heavy

https://www.nytimes.com/2026/01/17/well/move/weight-lifting-aging.html
1•mooreds•4m ago•0 comments

RFC 7519: JWTs

https://ciamweekly.substack.com/p/rfc-7519-jwts
1•mooreds•4m ago•0 comments

They are now putting composite armor on spacecraft to shield from orbital debris

https://spacenews.com/portal-space-selects-space-armor-debris-shield-for-2026-mission/
1•ck2•4m ago•1 comments

Healthcare RCM market map – January 2026

https://www.stedi.com/blog/healthcare-rcm-market-map-january-2026
1•mooreds•4m ago•0 comments

Iran's state broadcaster IRIB was hacked to air a message from Reza Pahlavi

https://twitter.com/MarioNawfal/status/2012972763783258389
1•seymon•4m ago•0 comments

Spec Driven Development: When Architecture Becomes Executable

https://www.infoq.com/articles/spec-driven-development/
1•msolujic•6m ago•0 comments

Our problems are too vast, our distance from them too great

https://longreads.com/2026/01/13/scale-climate-doomsday-clock/
1•treadump•6m ago•0 comments

Ralph Wiggum as a Degenerate Evolutionary Search

https://ianreppel.org/ralph-wiggum-as-a-degenerate-evolutionary-search/
1•i7l•6m ago•0 comments

Microslop: A Web Browser Extension

https://github.com/4O4-wasd/Microslop
1•Nales•7m ago•0 comments

What we get wrong about dopamine

https://www.bbc.com/future/article/20260116-what-we-get-wrong-about-dopamine
1•oxag3n•9m ago•0 comments

MacPacker: Preview archives on macOS without extracting. Extract single files

https://github.com/sarensw/MacPacker
1•avra•10m ago•0 comments

X1.95 solar flare

https://www.swpc.noaa.gov/news/x-class-flare-activity-observed-18-january-2026
1•sva_•11m ago•0 comments

Land Acknowledgement

https://en.wikipedia.org/wiki/Land_acknowledgement
1•vinnyglennon•12m ago•0 comments

The Gnome Village

https://happihacking.com/blog/posts/2025/the-gnome-village/
1•birdculture•14m ago•0 comments

Fossgis 2026 – German language conference of FOSSGIS e.V. (OSGeo Local Chapter)

https://www.fossgis-konferenz.de/2026/
1•slow_typist•14m ago•0 comments

Debugging consent and conversion tracking with a headless scan

https://consentcheck.online/
1•marstay•14m ago•1 comments

Detecting Podcast Ads on a Phone

https://earsay.app/blog/how-ad-detection-works.php
1•earsayapp•17m ago•0 comments

Trump: One Year Later

https://www.nytimes.com/2026/01/18/opinion/trump-one-year-later.html
1•xenophon•17m ago•1 comments

A gaming success story: how Warhammer became one of Britain's biggest companies

https://www.theguardian.com/lifeandstyle/2026/jan/18/a-gaming-success-story-how-warhammer-became-...
3•GeoAtreides•18m ago•0 comments

Your Agents can now orchestrate Ralph using skills!

https://github.com/davidkimai/ralph-zero
1•davidkimai•20m ago•1 comments

Show HN: Nvidia's CUDA libraries are generic and not optimized for LLM inference

https://github.com/Venkat2811/yali
1•venkat_2811•22m ago•1 comments

Evolution Unleashed (2018)

https://aeon.co/essays/science-in-flux-is-a-revolution-brewing-in-evolutionary-theory
3•DiabloD3•28m ago•0 comments

Show HN: Zpace – See which node_modules, venvs, and caches are eating your disk

https://github.com/AzisK/Zpace
1•azisk1•28m ago•0 comments

Digg.com Is Back

https://about.digg.com/
4•howToTestFE•29m ago•3 comments

Breaking the Zimmermann Telegram (2018)

https://medium.com/lapsed-historian/breaking-the-zimmermann-telegram-b34ed1d73614
8•tony-allan•29m ago•0 comments

ttl: traceroute with MTU discovery, NAT/IX detection, route flap alerts & more

https://github.com/lance0/ttl
2•indigodaddy•29m ago•0 comments

Crow: Crobots robotic combat for training World Model AIs

https://github.com/dcgrigsby/crow
2•todsacerdoti•29m ago•0 comments

Show HN: I wrote an implementation of the game Hitori using Claude Code

https://senthil.learntosolveit.com/posts/2026/01/18/hitori.html
1•orsenthil•30m ago•1 comments

They Quit Their Day Jobs to Bet on Current Events

https://www.npr.org/2026/01/17/nx-s1-5672615/kalshi-polymarket-prediction-market-boom-traders-sla...
2•backpackerBMW•34m ago•0 comments