AV2 video codec delivers 30% lower bitrate than AV1, final spec due in late 2025

https://videocardz.com/newz/av2-video-codec-delivers-30-lower-bitrate-than-av1-final-spec-due-in-late-2025

84•ksec•3h ago

Comments

jonplackett•2h ago

It’s pretty amazing people are still finding ways to make video smaller.

Is this just people being clever or is it also more processing power being thrown at the problem when decoding / encoding?

dajonker•1h ago

I believe patents play a big role here as well. Anything new must be careful to not (accidentally) violate any active patent, so there might be some tricks that can't currently be used for AV1/AV2

olowe•1h ago

It’s more money and more user’s compute being thrown at the problem to get the streaming service’s CDN bill down.

occz•1h ago

While funny, that's not really what I would call accurate. Users get reduced data consumption, potentially higher quality selection if the bandwidth now allows for a higher resolution to be streamed, and possibly lower disk usage should they decide to offline the videos.

Better codecs are an overall win for everyone involved.

calcifer•33m ago

> Users get reduced data consumption, potentially higher quality selection if the bandwidth now allows for a higher resolution to be streamed

They also get increased power usage, lesser battery life, higher energy bills, and potentially earlier device failures.

> Better codecs are an overall win for everyone involved.

Right.

zweifuss•1h ago

A bit of both. Also, the modern Codecs have slightly different tradeoffs (image quality (PSNR, SSIM), computational complexity (CPU vs DSP vs Memory), storage requirements, bit rate) and therefore there isn't one that is best for every use case.

amiga386•1h ago

Yes, and it's allowing the format to change to allow more cleverness or apply more processing power.

For example, changes from one frame to the next are encoded in rectangular areas called "superblocks" (similar to a https://en.wikipedia.org/wiki/Macroblock). You can "move" the blocks (warp them), define their change in terms of other parts of the same frame (intra-frame prediction) or by referencing previous frames (inter-frame prediction), and so on... but you have to do it within a block, as that's the basic element of the encoding.

The more tightly you can define blocks around the areas that are actually changing from frame to frame, the better. Also, it takes data to describe where these blocks are, so there are special limitations on how blocks are defined, to minimise how many bits are needed to describe them.

AV2 now lets you define blocks differently, which makes it easier to fit them around the areas of the frame that are changing. It has also doubled the size of the largest block, so if you have some really big movement on screen, it takes fewer blocks to encode that.

That's just one change, the headline improvement comes from all the different changes, but this is an important one.

There is new cleverness in the encoders, but they need to be given the tools to express that cleverness -- new agreement about what types of transforms, predictions, etc. are allowed and can be encoded in the bitstream.

https://youtu.be/Se8E_SUlU3w?t=242

seydor•1h ago

All this high speed fiber for nothing...

ttoinou•1h ago

well we might start streaming 8K content ! Or maybe this could be used for 16K VR videos

stop50•1h ago

Or the next <insert shooter with ridiculous size here>

throwaway48476•40m ago

It's a shame European energy efficiency rules made 8K non viable. It's a great resolution.

lxgr•25m ago

Meh, I'll take 1080p at a higher bitrate and in a wider color gamut over the pixel soup many VOD services serve and have the audacity to still call "UHD" any day.

SG-•1h ago

it's almost like a majority of the world is still consuming data and video on mobile networks.

brnt•1h ago

Saturation is only a goal if it's a bottleneck. It's good to remove bottlenecks.

uyzstvqs•53m ago

Concurrent maximum efficiency + maximum availability is the way to go. This principle also applies to compute power and energy markets.

AlienRobot•41m ago

More media -> More speed demand -> More speed -> More media -> More speed demand...

lxgr•29m ago

Exactly. I'm glad I'm still in the return period for my 1TB microSD card!

IshKebab•1h ago

We must be reaching the limit at which video codecs can only achieve better quality by synthesizing details. That's already pretty prevalent in still images - phone cameras do it, and there are lots of AI resizing algorithms that do it.

It doesn't look like AV2 does any of that yet though fortunately (except film grain synthesis but I think that's fine).

meindnoch•1h ago

I think Fabrice Bellard have been experimenting with neural codecs like that.

magicalhippo•48m ago

The one I recall done by him was NNCP[1], a lossless compressor.

[1]: https://bellard.org/nncp/

throwaway48476•39m ago

Computings own Da Vinci.

ksec•48m ago

>We must be reaching the limit at which video codecs can only achieve better quality by synthesizing details.

Not quite yet as shown in H.267. But at some point the computational requirement vs bandwidth saving benefits would no longer make sense.

rarisma•1h ago

30% over AV1 is crazy, it doesn't feel too long since AV1 released but that was in 2019.

lxgr•33m ago

Yet I still only got hardware support for it on my first devices last year. The downside of "rapid" iteration on video codecs is that content needs to always be stored in multiple formats (or alternatively battery life on the client suffers from software playback, which is the route e.g. Youtube seems to be preferring).

amiga386•1m ago

Hopefully that improves. The guy giving the presentation on AV2 made clear there was "rigorous scrutiny for hardware decoding complexity", and they were advised by Realtek and AMD on this.

So it seems like they checked that all their ideas could be implemented efficiently in hardware as they went along, with advice from real hardware producers.

Hopefully AV2-capable hardware will appear much quicker than AV1-capable hardware did.

latexr•1h ago

I always thought the name AV1 was partly a play on/homage to AVI (Audio Video Interlace), but AV2 breaks that. Even if it’s meant to be embedded into other container formats such as MP4, there are files with the .av1 extension and there is a video/AV1 MIME type (and possibly a UTI?). Does this mean we now need to duplicate all that to .av2 and video/AV2? What about the AVIF file format?

ttoinou•1h ago

Do you mean the file extension should only reflect the file format and not the codecs it has inside ?

Maybe that’s what we did in the past and it was a bad idea. It’d be useful to know if you can read the file by looking only at its extension

galad87•1h ago

That wouldn't scale well, something like .av1opusflacwebvtt?

lxgr•31m ago

> Do you mean the file extension should only reflect the file format and not the codecs it has inside ?

That's pretty much always been the case. File extensions are just not expressive enough to capture all the nuances of audio and video codecs. MIME types are a bit better.

Audio is a bit of an exception with the popularity of MP3 (which is both a codec and a relatively minimal container format for it).

amiga386•5m ago

File extension shouldn't matter at all, because data should have associated metadata (e.g. HTTP content-type, CSS image-set, HTML <video><source type=""/></video>)

> It’d be useful to know if you can read the file by looking only at its extension

That would be madness, and there's already a workaround - the filename itself.

For most people, all that matters is an MKV file is a video file, and your configured player for this format is VLC. Only in a small number of cases does it matter about an "inner" format, or choice of parameter - e.g. for videos, what video codec or audio codec is in use, what the bitrate is, what the frame dimensions are.

For where it _matters_, people write "inner" file formats in the filename, e.g. "Gone With The Wind (1939) 1080p BluRay x265 HEVC FLAC GOONiES.mkv", to let prospective downloaders choose what to download from many competing encodings of exactly the same media.

Most people don't care, for example, that their Word document is A4 landscape, so much that they need to know _in the filename_.

uyzstvqs•1h ago

Files with the .av1 extension are for raw AV1 data. For AV2 this should become .av2, yes. That's by design, as they're two different incompatible formats. Typically you use a container like Matroska (.mkv, video/x-matroska), WebM or MP4 which contains your video stream with a type code specifying the codec (av01, av02).

AVIF is also a container format, and I believe should be adaptable to AV2, even if the name stands for "AV1 image format". It could simply just be renamed to AOMedia Video Image Format for correctness.

jeden•1h ago

I wait on new codec invented #AI

occz•59m ago

You'll be waiting for a long time then, probably. Making codecs is actually a hard problem, the type of thing that AI completely falls over when tasked with.

circuit10•46m ago

Considering AI is good at predicting things and that’s largely what compression does, I could see machine learning techniques being useful as a part of a codec though (which is a completely different thing from asking ChatGPT to write you a codec)

lyu07282•21m ago

Yeah in the future we might use some sort of learned spatial+temporal representation to compress video, same for audio. Its easier to imagine for audio: Instead of storing the audio samples, we store text + some feature vectors that uses some model to "render" the audio samples.

lxgr•27m ago

Compression is actually a very good use case for neural networks (i.e. don't have an LLM develop a codec, but rather train a neural network to do the compression itself).

It works amazingly well with text compression, for example: https://bellard.org/nncp/

Gigachad•1h ago

Is anyone else getting a cloudflare blocked on this page?

ksec•30m ago

Let's hope they get more things right 2nd time around. AOM will do Live Session on 20th of October: The Future of Innovation is Open [1].

May be more data and numbers. Including Encoding Complexity increase, decoding complexity. Hardware Decoder roadmap. Compliance and Test kits. Future Profile. Involvement and improvement to both AVIF the format and the AV2 image codec. Better than JPEG-XL? Are the ~30% BDRATE compared to current best AV1 encoder or AV1 1.0 as anchor point? Live Encoding improvements?

[1] https://aomedia.org/events/live-session-the-future-of-innova...

HTML's Best Kept Secret: The Output Tag

Daniel Kahneman opted for assisted suicide in Switzerland

AMD and Sony's PS6 chipset aims to rethink the current graphics pipeline

Vietnam Airlines Data Breach

I built physical album cards with NFC tags to teach my son music discovery

(Re)Introducing the Pebble Appstore

AV2 video codec delivers 30% lower bitrate than AV1, final spec due in late 2025

How hard do you have to hit a chicken to cook it? (2020)

Tangled, a Git collaboration platform built on atproto

Synthetic aperture radar autofocus and calibration

Intelligent Search in Rails with Typesense

Does our “need for speed” make our wi-fi suck?

Programming in the Sun: A Year with the Daylight Computer

Show HN: I invented a new generative model and got accepted to ICLR

Show HN: Semantic search over the National Gallery of Art

Show HN: A Digital Twin of my coffee roaster that runs in the browser

Lánczos Interpolation Explained (2022)

A Molecular Motor Minimizes Energy Waste

Automated code reviews via mutation testing

OpenGL: Mesh shaders in the current year

HATEOAS for Haunted Houses

Ryanair flight landed at Manchester airport with six minutes of fuel left

ThalamusDB: Query text, tables, images, and audio

After nine years of grinding, Replit found its market. Can it keep it?

Hardware Stockholm Syndrome

Show HN: Lights Out: my 2D Rubik's Cube-like Game

In a post-truth world truth-seeking is more important

Love C, hate C: Web framework memory problems

Let's Take Esoteric Programming Languages Seriously

Verge Genomics (YC S15) Is Hiring for Multiple Engineering and Product Roles

HTML's Best Kept Secret: The Output Tag

Daniel Kahneman opted for assisted suicide in Switzerland

AMD and Sony's PS6 chipset aims to rethink the current graphics pipeline

Vietnam Airlines Data Breach

I built physical album cards with NFC tags to teach my son music discovery

(Re)Introducing the Pebble Appstore

AV2 video codec delivers 30% lower bitrate than AV1, final spec due in late 2025

How hard do you have to hit a chicken to cook it? (2020)

Tangled, a Git collaboration platform built on atproto

Synthetic aperture radar autofocus and calibration

Intelligent Search in Rails with Typesense

Does our “need for speed” make our wi-fi suck?

Programming in the Sun: A Year with the Daylight Computer

Show HN: I invented a new generative model and got accepted to ICLR

Show HN: Semantic search over the National Gallery of Art

Show HN: A Digital Twin of my coffee roaster that runs in the browser

Lánczos Interpolation Explained (2022)

A Molecular Motor Minimizes Energy Waste

Automated code reviews via mutation testing

OpenGL: Mesh shaders in the current year

HATEOAS for Haunted Houses

Ryanair flight landed at Manchester airport with six minutes of fuel left

ThalamusDB: Query text, tables, images, and audio

After nine years of grinding, Replit found its market. Can it keep it?

Hardware Stockholm Syndrome

Show HN: Lights Out: my 2D Rubik's Cube-like Game

In a post-truth world truth-seeking is more important

Love C, hate C: Web framework memory problems

Let's Take Esoteric Programming Languages Seriously

Verge Genomics (YC S15) Is Hiring for Multiple Engineering and Product Roles

AV2 video codec delivers 30% lower bitrate than AV1, final spec due in late 2025

Comments