High-performance 2D graphics rendering on the CPU using sparse strips [pdf]

https://github.com/LaurenzV/master-thesis/blob/main/main.pdf

98•PaulHoule•2h ago

Comments

miguel_martin•1h ago

Also checkout blaze: https://gasiulis.name/parallel-rasterization-on-cpu/

hollowturtle•1h ago

the demo is astonishing

raphlinus•5m ago

Thanks for the pointer, we were not actually aware of this, and the claimed benchmark numbers look really impressive.

pixelpoet•1h ago

This looks interesting; recently I wrote some code for rendering high precision N-body paths with millions of vertices[0], I wonder if a GPU implementation this RLE representation would work well and maintain simplicity.

[0] https://www.youtube.com/watch?v=rmyA9AE3hzM

amelius•1h ago

Side question. Is there some kind of benchmark to test the correctness of renderers?

embedding-shape•1h ago

Correctness of what exactly? It's a "render" of reality-like environment, so all of them make some tradeoff somewhere, and won't be 100% "correct" at least compared to reality :)

jmpeax•47m ago

Correctness with respect to the benchmark. A slow reference renderer could produce the target image, and renderers need to achieve either exact or close reproduction to the reference. Otherwise, you could just make substantial approximations and claim a performance victory.

user____name•15m ago

Bezier curves can generate degenerate geometry when flattened and stroke geometry has to handle edge cases. See for instance the illustration on the last page of the Polar Stroking paper: https://arxiv.org/pdf/2007.00308

There are also things like interpretting (conflating) coverage as alpha for analytical antialiasing methods, which lead to visible hairline cracks.

percentcer•48m ago

This was the original goal of the Cornell box (https://en.wikipedia.org/wiki/Cornell_box, i.e. carefully measure the radiosity of a simple, real-world scene and then see how closely you can come to simulating it).

For realtime rendering a common thing to do is to benchmark against a known-good offline renderer (e.g. Arnold, Octane)

fngjdflmdflg•13m ago

Fascinating project. Based on section 3.9, it seems the output is in the form of a bitmap. So I assume you have to do a full memory copy to the GPU to display the image in the end. With skia moving to WebGPU[0] and with WebGPU supporting compute shaders, I feel that 2D graphics is slowly becoming a solved problem in terms of portability and performance. Of course there are cases where you would a want a CPU renderer. Interestingly the web is sort of one of them because you have to compile shaders at runtime on page load. I wonder if it could make sense in theory to have multiple stages to this, sort of like how JS JITs work, were you would start with a CPU renderer while the GPU compiles its shaders. Another benefit, as the author mentions, is binary size. WebGPU (via dawn at least) is rather large.

[0] https://blog.chromium.org/2025/07/introducing-skia-graphite-...

raphlinus•6m ago

The output of this renderer is a bitmap, so you have to do an upload to GPU if that's what your environment is. As part of the larger work, we also have Vello Hybrid which does the geometry on CPU but the pixel painting on GPU.

We have definitely thought about having the CPU renderer while the shaders are being compiled (shader compilation is a problem) but haven't implemented it.

High-performance 2D graphics rendering on the CPU using sparse strips [pdf]

Unexpected things that are people

Writing your own BEAM

Spatial intelligence is AI’s next frontier

The lazy Git UI you didn't know you need

Dependent types and how to get rid of them

Zeroing in on Zero-Point Motion Inside a Crystal

Using Generative AI in Content Production

Unix v4 Tape Found

Error ABI

The Physics of News, Rumors, and Opinions

Launch HN: Hypercubic (YC F25) – AI for COBOL and Mainframes

Rademacher Complexity and Models of Group Competition

Building a high-performance ticketing system with TigerBeetle

Linux in a Pixel Shader – A RISC-V Emulator for VRChat

Omnilingual ASR: Advancing automatic speech recognition for 1600 languages

Benchmarking leading AI agents against Google reCAPTCHA v2

Head in the Zed Cloud

Registered OAuth Parameters

Sysgpu – Experimental descendant of WebGPU written in Zig

What Caused Performance Issues in My Tiny RPG

Redmond, WA, turns off Flock Safety cameras after ICE arrests

Canadian military will rely on public servants to boost its ranks by 300k

3D Heterogeneous Integration Powers New DARPA Fab

Pose Animator – An open source tool to bring SVG characters to life (2020)

Cybersecurity breach at Congressional Budget Office remains a live threat

Show HN: Davia – Open source visual, editable wiki from your codebase

LLMs are steroids for your Dunning-Kruger

Time to start de-Appling

Interesting SPI Routing with iCE40 FPGAs

High-performance 2D graphics rendering on the CPU using sparse strips [pdf]

Comments

High-performance 2D graphics rendering on the CPU using sparse strips [pdf]

Unexpected things that are people

Writing your own BEAM

Spatial intelligence is AI’s next frontier

The lazy Git UI you didn't know you need

Dependent types and how to get rid of them

Zeroing in on Zero-Point Motion Inside a Crystal

Using Generative AI in Content Production

Unix v4 Tape Found

Error ABI

The Physics of News, Rumors, and Opinions

Launch HN: Hypercubic (YC F25) – AI for COBOL and Mainframes

Rademacher Complexity and Models of Group Competition

Building a high-performance ticketing system with TigerBeetle

Linux in a Pixel Shader – A RISC-V Emulator for VRChat

Omnilingual ASR: Advancing automatic speech recognition for 1600 languages

Benchmarking leading AI agents against Google reCAPTCHA v2

Head in the Zed Cloud

Registered OAuth Parameters

Sysgpu – Experimental descendant of WebGPU written in Zig

What Caused Performance Issues in My Tiny RPG

Redmond, WA, turns off Flock Safety cameras after ICE arrests

Canadian military will rely on public servants to boost its ranks by 300k

3D Heterogeneous Integration Powers New DARPA Fab

Pose Animator – An open source tool to bring SVG characters to life (2020)

Cybersecurity breach at Congressional Budget Office remains a live threat

Show HN: Davia – Open source visual, editable wiki from your codebase

LLMs are steroids for your Dunning-Kruger

Time to start de-Appling

Interesting SPI Routing with iCE40 FPGAs