Data Compression Nerds Hate This One Trick [video]

https://media.ccc.de/v/eh22-8-more-than-just-quite-ok-data-compression-nerds-hate-this-one-trick

89•doener•9h ago

Comments

lifthrasiir•7h ago

Slides: https://filmroellchen.eu/talks/QOI%20EH25/#/0/1

As that data compression nerd, QOI is both a refresher and an annoyance. It serves as a good example that input modelling is very important, so much that QOI almost beats PNG in spite of very suboptimal coding. But QOI also represents a missed opportunity because, well, we can do much better if we are willing to use something like QOI instead of standard formats! Weirdly enough people seem to value only the simplicity when it comes to QOI, which is not the main takeaway I believe. Maybe QOIR [1] might result in a better balanced format in the future though...

[1] https://github.com/nigeltao/qoir/

staunton•7h ago

Let's say you had a lossless image format that's 20% smaller (on average for pictures people send over networks) than PNG. Let's say it takes 10% more computing power than PNG. Do you stand to make money? What would it be used for?

I can't imagine people will start storing their family pictures in a new format they've never heard of which is not supported by any software they use for "just" 20% better compression. Do they even want lossless compression in the first place (if you don't ask them directly and call it that)?

lifthrasiir•6h ago

That's the portability bit the presenter mentions, and is a very important concern in practice. But how about recompression? For example many PNG files are suboptimally compressed partly because PNG is an old format and also because many softwares have been too dumb to produce a well-optimized PNG. In that case we may benefit from a transparent recompression, which may be done either by using a better library like libdeflate or by internally using a separate format that can be quickly transformed from and back to PNG. In fact Dropbox did so for JPEG files [1]. When I'm saying "so much better" I was thinking about such opportunities that benefit end users.

[1] https://github.com/dropbox/lepton

staunton•6h ago

Dropbox apparently abandoned the project. Do you know what their takeaways were from trying to improve the JPEG storage?

For example, was it worth it in the end? Did they announce anything? Did they switch to another method or give up on the idea, or do we not know?

lifthrasiir•6h ago

Dropbox apparently still uses Lepton or any successor internally, but the open source version is abandoned because it posed larger maintenance burden than internal projects.

staunton•3h ago

Has Dropbox said this anywhere? Or are you assuming it based on something like "this kind of project is very easy to maintain internally so there's no reason why they would have stopped using it"?

lifthrasiir•2h ago

At least at the time of abandonment, Dropbox did say (emphasis mine):

> While we did ensure that the reported vulnerabilities don’t affect our internal use of Lepton, we unfortunately don’t have the capacity to properly fix these and future issues in this public repo.

As far as I know this is indeed the last known public mention of its use, but given that Lepton was already in use and dropping it would substantially increase its traffic, it is reasonable to assume that its use somehow continues to this day.

forkerenok•7h ago

This is the website of the presented project for those who prefer text:

https://qoiformat.org/

An interesting and somewhat inspiring bit of trivia from the video: the creator barely understands modern image compression techniques (from their own words), but this hasn't stopped them from coming up with that impressive result.

Galanwe•6h ago

In this day and age, storage is not as important as IO, so I hate it when I see benchmarks with compression ratios + CPU times. That's not helpful.

What I want to see is the total IO + CPU time across libraries for my specific IO and CPU constraints.

Sure, it makes benchmarks more involved to display, as scalars are not enough anymore, you need multiple curves, but that's meaningful at least.

To illustrate, if I have very fast IO, then I probably don't care of the compression ratio, it will be faster to download the raw payload and have 0 decompression cost.

On the other end of the spectrum, if I have very slow IO, I would gladly have a much slower decompression algorithm but higher compression ratio for a faster overall timing.

This is especially important because cloud storage for instance are rather cheap, but slow. Caches/CDNs are very fast, but storage is expensive. Etc

vlovich123•5h ago

In the specific examples mentioned, it almost doesn’t matter. It achieves nearly identical sizes (within 10-20%) that the transfer size of the image given typical sizes is nearly irrelevant unless you’re downloading on dial-up.

lifthrasiir•4h ago

> What I want to see is the total IO + CPU time across libraries for my specific IO and CPU constraints.

This knob is typically called as a (compression) level, even though it isn't advertised as such because it is hard to translate level into your metric. Some libraries like zstd support IO-adaptive operations as a simpler alternative too.

hedora•2h ago

Read and write time are cpu_time/your_cpu_performance + data_size/your_bandwidth.

The benchmark doesn't know what your CPU performance and bandwidth are (or will be on some future device). You should be able to multiply these costs out yourself.

philistine•11m ago

You're assuming that IO and CPU are equal.

CPU speed is getting faster at a much higher rate than IO. It makes sense, one is infrastructure, the other is the silicon in your pocket.

That's why you're getting downvoted, and why algos always push more and more to the CPU. Statistically, the correct posture is to expect the CPU to consistently get faster. I mean, if you make an image format now, you want it to work with CPUs in 30 years. Will IO speed gains overtake CPU improvements in 30 years? Most certainly not.

willvarfar•6h ago

Reminds me of one of my most exhilerating old-timer stories. I once took 15 seconds off of the boot time of a smartphone - halving the startup time! - by replacing the MNG (think PNG but multi-frame animation; no surprise it never took off) startup animation with a custom encode-runs-of-pixel-offsets that nicely captured the way the logo swirled. Then encoded this as static data in a dll which the phone had a built-in system for compressing which worked well for this data, so doubly faster. This made me a complete legend for a while, although that phone model ended up never shipping.

lifthrasiir•6h ago

MNG was unfortunately designed so badly that Mozilla had to create APNG to replace it. In retrospect, MNG was only capable to do about 10% of what SVG 1.0 can do on top of an incomprensible binary format. (One may argue that SVG is also badly designed for this reason.)

juped•5h ago

It's rare that I see such a fractally wrong comment. I guess APNG did come from Mozilla, which I think is the only true subpart of the above?

lifthrasiir•4h ago

Okay, I admit the "badly designed" part is my claim, but I'm very confident in that claim and can explain why.

The original impetus that led to MNG was animated GIF. While GIF was also designed as extensible, GIF practically only had a handful number of extensions and wasn't that complex. MNG in comparison was essentially a binary version of SVG as I have described, except that it had no vector graphics and can more accurately be described as a binary slideshow format [1]. No one asked for this! The PNG development group forgot its goal and made a much bloated specification. As a result libmng, while working, was a quite big library to be incorporated into Gecko and ultimately rejected in favor of APNG which only required a small patch to libpng. It is amusing that the original PNG specification had chunks corresponding to GIF extensions including animation one; those chunks might have been widespread in the alternate universe!

If the group's goal was indeed a slideshow format, it should have been two separate formats, one for multi-part images and one for animations and object transformations. The former is necessary because one aspect of GIF was that a difference from the previous frame can be encoded, so this "Delta-PNG" format should have been made into the mainline PNG specification as an option. (These additional images could have been used as previews, for example.) And anything else should be a simple textual format that refers to external files. MNG instead had chunks for JPEG images---came with its standalone sister format called JNG---, which is absurd when you think about it---why should there be JNG when JFIF and Exif already exist? The modern SVG happens to be perfectly usable as this second format, and it is fine being textual.

[1] If you can't believe what I've said, see the table of contents of the actual specification: http://www.libpng.org/pub/mng/spec/

jmmcd•4h ago

Link [1] doesn't seem to mention svg or vector graphics at all.

magicalhippo•3h ago

I think the point was that like SVG, MNG has the ability to transform existing objects in order to compose an image.

See this example[1] for illustration.

[1]: http://www.libpng.org/pub/mng/spec/#example4

willvarfar•2h ago

yeah I remember no joy working with implementations of it. The whole thing smelt like a standards body mess when really they should have got one good engineer to just make something workable in a weekend. Its possibly worse than the OpenGLES mess.

kurthr•2h ago

Similar but different old-time story about a scroll wheel on a smartphone not working. I had unfortunately been flown to a foreign country to solve a FW issue where scrolling had been made unreliable with a development team I could barely communicate with, and build times measured in fractional days.

Of course it turned out that ID had decided they wanted a longer bleep-boop every time you spun the wheel, and the real-time screen position update was dependent on the sound completing. Click-ticks were completely unacceptable, but adding a scroll buffer and detecting it's state while filling an audio buffer with tiny fractional bleeps allowed reasonable latency scrolling. Of course buffer cleanup and beep-slice management was the most annoying. Sadly, the phone shipped.

pixl97•1h ago

> and the real-time screen position update was dependent on the sound completing.

Over the years I saw tons of software in windows that made mistakes like this. Some stupid animations or sounds that were synced with the actual logic of the application holding everything up.

Years ago there was some app our team installed that just took forever to install around 500MB of data, like 30 minutes. When we monitored it there was nothing going on most of the time. We ended up just batching out all the files, registry keys, and dll registrations to install in less than 5 minutes.

encom•5h ago

Even ironic clickbait is irritating.

atiedebee•2h ago

Going off of the description, since I currently can't watch videos or anything with audio:

> How one guy in his bedroom (kind of) beat all of PNG's combined multi-decade effort in one year, and why that's strange.

PNG is a 30 year old format and hasn't changed much over the years (as far as I am aware). PNG at it's core is still just some basic filters with DEFLATE on top. The decades of work would mostly be spend on optimizing the encoding speed and finding better heuristics for selecting filters. There is a lot more impressive codecs nowadays.

Apart from that I am still a big fan of QOI. It has amazing results for how simple the format is.

sujayakar•2h ago

Here's an interesting negative result.

After watching this video, my first thought was whether recent results from columnar compression (e.g. https://docs.vortex.dev/references#id1) applied "naively" like QOI would have good results.

I started with a 1.79MiB sprite file for a 2D game I've been hacking on, and here are the results:

  PNG: 1.79 MiB
  QOI: 2.18 MiB
  BtrBlocks: 3.69 MiB

(Source: https://gist.github.com/sujayakar/aab7b4e9df01f365868ec7ca60...)

So, there's magic to being Quite OK that is more than just applying compression techniques than elsewhere :)

Algebraic Semantics for Machine Knitting

ClickHouse gets lazier (and faster): Introducing lazy materialization

I should have loved biology too

Are polynomial features the root of all evil? (2024)

Abusing DuckDB-WASM by making SQL draw 3D graphics (Sort Of)

Surprises in Logic (2016)

Recover (YC W21) Is Hiring

Supabase raises $200M Series D at $2B valuation

Show HN: Morphik – Open-source RAG that understands PDF images, runs locally

Launch HN: Infra.new (YC W23) – DevOps copilot with guardrails built in

David Tong Lectures on Theoretical Physics

The raccoons who made computer magazine ads great

Join the W3C Exploration Interest Group: where standards start

Libro: a command-line tool to track your books

Show HN: I built a Ruby gem that handles memoization with a ttl

Using physics simulations to find targeting strategies in tenpin bowling

Pike – a dynamic programming language with a syntax similar to Java and C

WebAssembly: How to Allocate Your Allocator

Attacking My Landlord's Boiler

An Utterly Incomplete Look at Research from 1825

Evertop: E-ink IBM XT clone with 100+ hours of battery life

Living with Lab Mice

Does RL Incentivize Reasoning in LLMs Beyond the Base Model?

Detecting if an expression is constant in C

Verus: Verified Rust for low-level systems code

The Cold Start Problem: Using Network Effects to Scale Your Product – A Review

Handheld detector for all types of ionizing radiation improves radiation safety

We Diagnosed and Fixed the 2023 Voyager 1 Anomaly from 15B Miles Away [video]

A Real-Time Algorithm for Non-Convex Powered Descent Guidance [pdf]

I Open-Sourced My AI Toy Company That Runs on ESP32 and OpenAI Realtime API

Data Compression Nerds Hate This One Trick [video]

Comments

Algebraic Semantics for Machine Knitting

ClickHouse gets lazier (and faster): Introducing lazy materialization

I should have loved biology too

Are polynomial features the root of all evil? (2024)

Abusing DuckDB-WASM by making SQL draw 3D graphics (Sort Of)

Surprises in Logic (2016)

Recover (YC W21) Is Hiring

Supabase raises $200M Series D at $2B valuation

Show HN: Morphik – Open-source RAG that understands PDF images, runs locally

Launch HN: Infra.new (YC W23) – DevOps copilot with guardrails built in

David Tong Lectures on Theoretical Physics

The raccoons who made computer magazine ads great

Join the W3C Exploration Interest Group: where standards start

Libro: a command-line tool to track your books

Show HN: I built a Ruby gem that handles memoization with a ttl

Using physics simulations to find targeting strategies in tenpin bowling

Pike – a dynamic programming language with a syntax similar to Java and C

WebAssembly: How to Allocate Your Allocator

Attacking My Landlord's Boiler

An Utterly Incomplete Look at Research from 1825

Evertop: E-ink IBM XT clone with 100+ hours of battery life

Living with Lab Mice

Does RL Incentivize Reasoning in LLMs Beyond the Base Model?

Detecting if an expression is constant in C

Verus: Verified Rust for low-level systems code

The Cold Start Problem: Using Network Effects to Scale Your Product – A Review

Handheld detector for all types of ionizing radiation improves radiation safety

We Diagnosed and Fixed the 2023 Voyager 1 Anomaly from 15B Miles Away [video]

A Real-Time Algorithm for Non-Convex Powered Descent Guidance [pdf]

I Open-Sourced My AI Toy Company That Runs on ESP32 and OpenAI Realtime API