frontpage.

Hey all! We're a small (semi-stealth) team that's been working on a tool to rewrite AI inference code from Python to C++ (similar to llama.cpp, whisper.cpp, and so on). Today, we're launching `muna transpile`.

It takes a Python function and generates a self-contained, header-only C++ library and a corresponding CMakeLists.txt file. It pulls in required libraries automatically (e.g. llama.cpp, onnxruntime, mlx, and so on). You can then use it to build and ship an application or library.

Try it yourself on Kokoro TTS: ``` $ pip install muna $ muna transpile https://github.com/muna-ai/muna-predictors/blob/main/text-to... --trust-remote-code --install-deps $ cd cpp && cmake -B build && cmake --build build $ ./kokoro_tts --text "Hello Hacker News!" --voice af ```

The command above will transpile our Kokoro sample from Python to C++, compile it, and run the example (which uses the generated header-only lib).

Taking a step back: on-device AI is becoming a hot topic, but we think the dominant thought of cloud vs. on-device is misguided. If you look deeper, you'll realize that the key is portability. ggerganov proved this by building llama.cpp, which developers have deployed on everything from Blackwell GPUs to raspberry pi. The first step is always creating a bare-metal, hardware-optimized C/C++ implementation. `muna transpile` automates this for anything you can fit into a Python function.

Note that this is free and freely-usable: your Python source code goes in, it's still your source code when it comes out (just converted to C++). We're working on building more stuff on top of this (e.g. choosing where inference runs in one line of code; and agent skills that run AI models locally), so we're using this as an opportunity to expand support for different kinds of AI models / Python functions.

Try it out and lmk what you think.

Ask HN: Will GPU and RAM prices ever go down?

From hunger to luxury: The story behind the most expensive rice (2025)

Substack makes money from hosting Nazi newsletters

A New Crypto Winter Is Here and Even the Biggest Bulls Aren't Certain Why

Moltbook was peak AI theater

Why Claude Cowork is a math problem Indian IT can't solve

Show HN: Built an space travel calculator with vanilla JavaScript v2

Why a 175-Year-Old Glassmaker Is Suddenly an AI Superstar

Micro-Front Ends in 2026: Architecture Win or Enterprise Tax?

These White-Collar Workers Actually Made the Switch to a Trade

The Wonder Drug That's Plaguing Sports

Show HN: Which chef knife steels are good? Data from 540 Reddit tread

Federated Credential Management (FedCM)

Token-to-Credit Conversion: Avoiding Floating-Point Errors in AI Billing Systems

The Story of Heroku (2022)

Obey the Testing Goat

Claude Opus 4.6 extends LLM pareto frontier

Brute Force Colors (2022)

Google Translate apparently vulnerable to prompt injection

(Bsky thread) "This turns the maintainer into an unwitting vibe coder"

Software development is undergoing a Renaissance in front of our eyes

Can you beat ensloppification? I made a quiz for Wikipedia's Signs of AI Writing

Spec-Driven Design with Kiro: Lessons from Seddle

Agents need good developer experience too

The Dark Factory

Free data transfer out to internet when moving out of AWS (2024)

Interop 2025: A Year of Convergence

Prejudice Against Leprosy

Slint: Cross Platform UI Library

AI and Education: Generative AI and the Future of Critical Thinking

Show HN: Our command line tool to transpile AI Inference from Python to C++