FFmpeg Assembly Language Lessons

127•flykespice•2h ago

Comments

cr125rider•2h ago

I can’t imagine the scale that FFMPEG operates at. A small improvement has to be thousands and thousands of hours of compute saved. Insanely useful project.

prisenco•1h ago

Their commitment to performance is a beautiful thing.

Imagine all projects were similarly committed.

byteknight•1h ago

Seems so easy! You only need the entire world even tangentially related to video to rely solely on your project for a task and you too can have all the developers you need to work on performance!

ackfoobar•45m ago

I seem to recall that they lamented on twitter the low amount of (monetary or code) contribution they got, despite how heavily they are used.

hluska•15m ago

You know friend, if open source actually worked like that I wouldn’t be so allergic to releasing projects. But it doesn’t - a large swath of the economy depends on unpaid labour being treated poorly by people who won’t or can’t contribute.

Almondsetat•11m ago

Yeah no, I'd like non-performance critical programs to focus on other things than performance thank you

therealmarv•7m ago

like Slack or Jira... lol.

zahlman•1h ago

It'd be nice, though, to have a proper API (in the traditional sense, not SaaS) instead of having to figure out these command lines in what's practically its own programming language....

codys•1h ago

FFMpeg does have an API. It ships a few libraries (libavcodec, libavformat, and others) which expose a C api that is used in the ffmpeg command line tool.

They publish doxygen generated documentation for the APIs, available here: https://ffmpeg.org/doxygen/trunk/

zahlman•1h ago

Don't know how I overlooked that, thanks. Maybe because the one Python wrapper I know about is generating command lines and making subprocess calls.

javier2•37m ago

If you are processing user data, the subprocess approach makes it easier to handle bogus or corrupt data. If something is off, you can just kill the subprocess. If something is wrong with the linked C api, it can be harder to handle predictably.

Wowfunhappy•36m ago

They're relatively low level APIs. Great if you're a C developer, but for python just calling the command line probably does make more sense.

ansk•2m ago

For future reference, if you want proper python bindings for ffmpeg* you should use pyav.

* To be more precise, these are bindings for the libav* libraries that underlie ffmpeg

xxpor•23m ago

I get why the CLI is so complicated, but I will say AI has been great at figuring out what I need to run given an English language input. It's been one of the highest value uses of AI for me.

KwanEsq•1h ago

Prior discussion 2025-02-22, 222 comments: https://news.ycombinator.com/item?id=43140614

sylware•1h ago

There is serious abuse of nasm macro-preprocessor. Going to be tough to move away to another assembler.

oguz-ismail•1h ago

Where? There's very little code in those lessons

pveierland•1h ago

The lessons reference `cglobal` in `x86inc.asm`:

https://github.com/FFmpeg/FFmpeg/blob/master/libavutil/x86/x...

loeg•1h ago

Why move away?

ngcc_hk•1h ago

More interesting than I thought it could be. A domain specific tutorial is so much better.

Alifatisk•1h ago

How do they make these assembly instructions portable across different cpus?

KeplerBoy•1h ago

They don't. It's just x86-64.

ahartmetz•1h ago

The lessons yes, but the repo contains assembly for the 5-6 architectures in wide use in consumer hardware today. Separate files of course. https://github.com/FFmpeg/FFmpeg/tree/master/libavcodec

KeplerBoy•14m ago

Yeah, sure. I was specifically referring to the tutorials. Ffmpeg needs to run everywhere, although I believe they are more concerned about data center hardware than consumer hardware. So probably also stuff like power pc.

CannotCarrot•1h ago

I think there's a generic C fallback, which can also serve as a baseline. But for the big (targeted) architectures, there one handwritten assembly version per arch.

faluzure•47m ago

Yup.

On startup, it runs cpuid and assigns each operation the most optimal function pointer for that architecture.

In addition to things like ‘supports avx’ or ‘supports sse4’ some operations even have more explicit checks like ‘is a fifth generation celeron’. The level of optimization in that case was optimizing around the cache architecture on the cpu iirc.

Source: I did some dirty things with chromes native client and ffmpeg 10 years ago.

NullCascade•1h ago

What is the actual process of identifying hotspots caused suboptimal compiler generated assembly?

Would it ever make sense to write handwritten compiler intermediate representation like LLVM IR instead of architecture-specific assembly?

molticrystal•56m ago

It would be interesting to look into this to see if anybody has every hand tuned LLVM IR.

My best guess is you were doing codegen for several different instruction sets and the optimization or side channel prevention is something that would be too difficult or specialized to automate so you have to do it by hand.

nisten•38m ago

I feel like I just got a 3 page intro to autism.

It's glorious.

abhisek•31m ago

Love it. Thanks for taking the time to write this. Hope it will encourage more folks to contribute.

FFmpeg Assembly Language Lessons

Show HN: I built an app to block Shorts and Reels

Who Invented Backpropagation?

The Weight of a Cell

Launch HN: Reality Defender (YC W22) – API for Deepfake and GenAI Detection

Web apps in a single, portable, self-updating, vanilla HTML file

Show HN: A Minimal Hacker News Reader for Apple Watch Built with SwiftUI

Typechecker Zoo

The Road That Killed Legend Jenkins Was Working as Designed

Walkie-Textie Wireless Communicator

A gigantic jet caught on camera: A spritacular moment for NASA astronaut

Electromechanical reshaping, an alternative to laser eye surgery

The Coming Robot Home Invasion

Image Fulgurator (2011)

Win10 users looking for a new OS? Apple $599 MacBook can't come at a better time

Sky Calendar

MCP doesn't need tools, it needs code

SystemD Service Hardening

Class-action suit claims Otter AI records private work conversations

Vibe coding tips and tricks

MCP tools with dependent types

8x19 Text Mode Font Origins

Texas law gives grid operator power to disconnect data centers during crisis

The Lives and Loves of James Baldwin

When you're asking AI chatbots for answers, they're data-mining you

Weather Radar APIs in 2025: A Founder's Complete Market Overview

Scientists discover surprising language 'shortcuts' in birdsong – like humans

LLMs and coding agents are a security nightmare

Llama-Scan: Convert PDFs to Text W Local LLMs

AI is predominantly replacing outsourced, offshore workers

FFmpeg Assembly Language Lessons

Comments

FFmpeg Assembly Language Lessons

Show HN: I built an app to block Shorts and Reels

Who Invented Backpropagation?

The Weight of a Cell

Launch HN: Reality Defender (YC W22) – API for Deepfake and GenAI Detection

Web apps in a single, portable, self-updating, vanilla HTML file

Show HN: A Minimal Hacker News Reader for Apple Watch Built with SwiftUI

Typechecker Zoo

The Road That Killed Legend Jenkins Was Working as Designed

Walkie-Textie Wireless Communicator

A gigantic jet caught on camera: A spritacular moment for NASA astronaut

Electromechanical reshaping, an alternative to laser eye surgery

The Coming Robot Home Invasion

Image Fulgurator (2011)

Win10 users looking for a new OS? Apple $599 MacBook can't come at a better time

Sky Calendar

MCP doesn't need tools, it needs code

SystemD Service Hardening

Class-action suit claims Otter AI records private work conversations

Vibe coding tips and tricks

MCP tools with dependent types

8x19 Text Mode Font Origins

Texas law gives grid operator power to disconnect data centers during crisis

The Lives and Loves of James Baldwin

When you're asking AI chatbots for answers, they're data-mining you

Weather Radar APIs in 2025: A Founder's Complete Market Overview

Scientists discover surprising language 'shortcuts' in birdsong – like humans

LLMs and coding agents are a security nightmare

Llama-Scan: Convert PDFs to Text W Local LLMs

AI is predominantly replacing outsourced, offshore workers