NativeJIT: A C++ expression –> x64 JIT

38•nateb2022•5h ago

Comments

anon-3988•3h ago

Interesting, this is very similar to llvmlite.Builder which is a wrapper over llvm. I am probably going to create something similar for my Python -> C -> assembly JIT.

Twirrim•3h ago

There's also libgccjit, https://gcc.gnu.org/wiki/JIT, though all of the third party language bindings appear to be stale for it.

globalnode•3h ago

that project sounds interesting as well, but what do you do with libraries in python.. have the generated C code translate back to python calls?

anon-3988•3h ago

The point is not to compile entire Python programs, the point is to optimize specific parts of Python that matters. To illustrate, consider a calculating sum of 1 to N in python

def sum(N): x = 0 for i in range(N): x += i return x

There's absolute zero reason why this code has to involve pushing and popping stuff on the python virtual stack. This should be compiled into assembly with a small conversion between C/PyObject.

The goal is to get to a point where we can even do non-trivial things inside this optimized context.

Python will never be able to go down to assembly because Python support doing "weird shit" like dynamically creating modules, hell, even creating a Python file, running eval on that, and loading it as a new module. How are you even going to transpile that to assembly?

So I approach the problem the same way numba is approaching. But hopefully more modern and simpler (implementation wise). Planning on doing it using Rust and the backend should be agnostic (GCC, Clang, whatever C compiler there is)

lhames•17m ago

The LLVM ORC and Clang-REPL projects would be worth checking out if you haven't already: there's a healthy community of high performance computing folks working in this space over at https://compiler-research.org.

In particular, this talk might be interesting:

"Unlocking the Power of C++ as a Service: Uniting Python's Usability with C++'s Performance"

Video: https://www.youtube.com/watch?v=rdfBnGjyFrc Slides: https://llvm.org/devmtg/2023-10/slides/techtalks/Vassilev-Un...

b0a04gl•3h ago

how deterministic is the emit really. if i feed same expression tree twice,same node layout same captures. do i get exact same bytes out every time (ignoring reloc) or not. if output produced is byte stable across runs for same input graph ,that opens up memoized JIT paths.worth checking if current impl already does this or needs a pass to normalise alloc order

jdnend•2h ago

Why wouldn't it be deterministic?

xnacly•1h ago

Several possible reasons: - parallelism - concurrent machine code gen - different optimisations for different runs, producing differing machine code order, etc

nurettin•1h ago

It really sounds like a job for Java (Microsoft, I know, I know.)

adwn•1h ago

> It really sounds like a job for Java

Why?

kookamamie•52m ago

> auto & rsquared = expression.Mul(expression.GetP1(), expression.GetP1());

This is C++, no? Why not use operator overloading for the project?

Want to meet people, try charging them for it?

Bought an Ampere Altra System

Gridfinity: The modular, open-source grid storage system

LetsEncrypt – Expiration Notification Service Has Ended

NativeJIT: A C++ expression –> x64 JIT

The Book of Shaders

I made my VM think it has a CPU fan

Jane Austen's Boldest Novel Is Also Her Least Understood

Ask HN: What Are You Working On? (June 2025)

Cell Towers Can Double as Cheap Radar Systems for Ports and Harbors (2014)

LLM's Illusion of Alignment

Amber insect fossils reveal "zombie" fungi likely lived alongside dinosaurs

Revisiting Knuth's "Premature Optimization" Paper

Touching the back wall of the Apple store

We accidentally solved robotics by watching 1M hours of YouTube

To the Postbox

The $25k car is going extinct?

Use keyword-only arguments in Python dataclasses

Ultrasound toothbrush promises painless checks for hidden gum problems

4-10x faster in-process pub/sub for Go

Anticheat Update Tracking

Building untrusted container images safely at scale

Continuous Glucose Monitoring

ICE test train reaches speeds of up to 405.0 km/h

Finding a former Australian prime minister’s passport number on Instagram (2020)

Many ransomware strains will abort if they detect a Russian keyboard installed (2021)

The Medley Interlisp Project: Reviving a Historical Software System [pdf]

Nearly 20% of cancer drugs defective in 4 African nations

Error handling in Rust

Louvre shuts down with staff sounding the alarm on mass tourism

NativeJIT: A C++ expression –> x64 JIT

Comments

Want to meet people, try charging them for it?

Bought an Ampere Altra System

Gridfinity: The modular, open-source grid storage system

LetsEncrypt – Expiration Notification Service Has Ended

NativeJIT: A C++ expression –> x64 JIT

The Book of Shaders

I made my VM think it has a CPU fan

Jane Austen's Boldest Novel Is Also Her Least Understood

Ask HN: What Are You Working On? (June 2025)

Cell Towers Can Double as Cheap Radar Systems for Ports and Harbors (2014)

LLM's Illusion of Alignment

Amber insect fossils reveal "zombie" fungi likely lived alongside dinosaurs

Revisiting Knuth's "Premature Optimization" Paper

Touching the back wall of the Apple store

We accidentally solved robotics by watching 1M hours of YouTube

To the Postbox

The $25k car is going extinct?

Use keyword-only arguments in Python dataclasses

Ultrasound toothbrush promises painless checks for hidden gum problems

4-10x faster in-process pub/sub for Go

Anticheat Update Tracking

Building untrusted container images safely at scale

Continuous Glucose Monitoring

ICE test train reaches speeds of up to 405.0 km/h

Finding a former Australian prime minister’s passport number on Instagram (2020)

Many ransomware strains will abort if they detect a Russian keyboard installed (2021)

The Medley Interlisp Project: Reviving a Historical Software System [pdf]

Nearly 20% of cancer drugs defective in 4 African nations

Error handling in Rust

Louvre shuts down with staff sounding the alarm on mass tourism