frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I fixed FFmpeg's subtitle conversion (the bug from 2014)

https://connollydavid.github.io/pgs-release/
27•slartibardfast0•3d ago
FFmpeg converts everything except subtitles across format boundaries. SRT to Blu-ray PGS? "Subtitle encoding currently only possible from text to text or bitmap to bitmap." Ticket #3819, filed 2014.

I built this with Claude Code over a few weeks. Claude wrote most of the encoder, found an integer overflow in the decoder buffer tracking, and ran review from five angles. I read the Panasonic and Sony patents, made the architectural calls, and told it when it was wrong about the spec. We argued about whether DTS computation belongs in the muxer. (It does, and also in fftools. We did both.)

Animation is an interesting problem. Advanced SubStation Alpha fades have to survive conversion to Blu-ray's PGS format. The encoder watches pixel changes between frames and classifies them: palette shift or full redraw. Fades become palette-only updates, no bitmap retransmission. Overlapping subtitles with different end times took four rewrites and an event lookahead window.

I'd like to maintain this properly and get the patches upstream eventually. If you hit a bug or have a subtitle workflow that doesn't work, open an issue. I'm dead curious what people do with this, but i have some plans for translation related plugins building on the OCR work.

Six iterations. 23 patches. libass and Tesseract were already in FFmpeg's filter library. I wired them into the main pipeline the same way sub2video works. Text to bitmap, bitmap to text, 114 OCR languages, RGBA-to-GIF. The development page has the history.

Pre-built for 6 platforms, no dependencies: https://connollydavid.github.io/pgs-release/

Comments

gus_massa•1d ago
Where is the source code? https://github.com/connollydavid/FFmpeg/tree/pgs3-8.0.1 ?
slartibardfast0•17m ago
https://github.com/connollydavid/FFmpeg/tree/pgs7-8.1

is the most recent.

I'm taking a phased approach with an agentic host repo that drives the source changes in FFmpeg. generally, i do a "sprint" for adding features, then a sprint reorganizing for coherence with an eye on possible upstreaming.

gyan•39m ago
If you do open PRs at code.ffmpeg.org, remember to disclose and describe the AI contribution. You will also need to add FATE tests.
slartibardfast0•17m ago
oh yes, every commit is fully open about Claude Code + the model.

honestly, it would've taken me about 2 years to do this otherwise.

there are many FATE tests added and a lot more to add when the hardware verification bits i'm planning are done.

edit: do you know if upstream have ever taken an AI assisted commit? i've not seen anything yet and would hate to be controversial or a burden on hardworking reviewers!

KomoD•6m ago
Open since 2014? No? It was opened in 2014 and closed as fixed in 2014.

OpenCode – Open source AI coding agent

https://opencode.ai/
1005•rbanffy•17h ago•476 comments

Show HN: I fixed FFmpeg's subtitle conversion (the bug from 2014)

https://connollydavid.github.io/pgs-release/
27•slartibardfast0•3d ago•6 comments

Why western carmakers' retreat from electric risks dooming them to irrelevance

https://www.theguardian.com/business/2026/mar/21/west-carmakers-retreat-electric-vehicle-risks-ir...
66•n1b0m•1h ago•67 comments

Mamba-3

https://www.together.ai/blog/mamba-3
204•matt_d•3d ago•41 comments

Mayor of Paris removed parking spaces, "drastically" reduced the number of cars

https://www.cnn.com/2026/03/21/travel/paris-transformation-anne-hidalgo-mayor
108•heresie-dabord•1h ago•86 comments

Meta's Omnilingual MT for 1,600 Languages

https://ai.meta.com/research/publications/omnilingual-mt-machine-translation-for-1600-languages/?...
41•j0e1•3d ago•7 comments

FFmpeg 101 (2024)

https://blogs.igalia.com/llepage/ffmpeg-101/
143•vinhnx•12h ago•5 comments

A Japanese glossary of chopsticks faux pas (2022)

https://www.nippon.com/en/japan-data/h01362/
355•cainxinth•18h ago•275 comments

Blocking Internet Archive Won't Stop AI, but Will Erase Web's Historical Record

https://www.eff.org/deeplinks/2026/03/blocking-internet-archive-wont-stop-ai-it-will-erase-webs-h...
235•pabs3•7h ago•67 comments

Fujifilm X RAW STUDIO webapp clone

https://github.com/eggricesoy/filmkit
106•notcodingtoday•2d ago•39 comments

Molly Guard

https://bookofjoe2.blogspot.com/2026/02/molly-guard.html
152•surprisetalk•1d ago•66 comments

How HN: Ironkernel – Python expressions, Rust parallel

https://github.com/YuminosukeSato/ironkernel
9•acc_10000•2d ago•3 comments

The Story of Marina Abramovic and Ulay (2020)

https://www.sydney-yaeko.com/artsandculture/marina-and-ulay
23•NaOH•2d ago•15 comments

Ghostling

https://github.com/ghostty-org/ghostling
259•bjornroberg•16h ago•48 comments

Linux Applications Programming by Example: The Fundamental APIs (2nd Edition)

https://github.com/arnoldrobbins/LinuxByExample-2e
130•teleforce•14h ago•16 comments

Just make it hard to fail

https://nekolucifer.substack.com/p/just-make-it-really-hard-to-fail
13•andai•1h ago•7 comments

The Los Angeles Aqueduct Is Wild

https://practical.engineering/blog/2026/3/17/the-los-angeles-aqueduct-is-wild
388•michaefe•3d ago•192 comments

We rewrote our Rust WASM parser in TypeScript and it got faster

https://www.openui.com/blog/rust-wasm-parser
248•zahlekhan•17h ago•157 comments

Attention Residuals

https://github.com/MoonshotAI/Attention-Residuals
202•GaggiX•20h ago•28 comments

The worst volume control UI in the world (2017)

https://uxdesign.cc/the-worst-volume-control-ui-in-the-world-60713dc86950
179•andsoitis•3d ago•87 comments

Padel Chess – tactical simulator for padel

https://www.padelchess.me/
57•AlexGerasim•3d ago•29 comments

Cryptography in Home Entertainment (2004)

https://mathweb.ucsd.edu/~crypto/Projects/MarkBarry/
64•rvnx•2d ago•35 comments

Show HN: We built a terminal-only Bluesky / AT Proto client written in Fortran

https://github.com/FormerLab/fortransky
110•FormerLabFred•16h ago•55 comments

Turing Award Honors Bennett and Brassard for Quantum Information Science

https://amturing.acm.org
53•throw0101d•3d ago•0 comments

Atuin v18.13 – better search, a PTY proxy, and AI for your shell

https://blog.atuin.sh/atuin-v18-13/
66•cenanozen•4h ago•51 comments

France's aircraft carrier located in real time by Le Monde through fitness app

https://www.lemonde.fr/en/international/article/2026/03/20/stravaleaks-france-s-aircraft-carrier-...
595•MrDresden•1d ago•475 comments

The Ugliest Airplane: An Appreciation

https://www.smithsonianmag.com/air-space-magazine/ugliest-airplane-appreciation-180978708/
97•randycupertino•2d ago•48 comments

VisiCalc Reconstructed

https://zserge.com/posts/visicalc/
216•ingve•4d ago•79 comments

Lent and Lisp

https://leancrew.com/all-this/2026/02/lent-and-lisp/
69•surprisetalk•3d ago•3 comments

ArXiv declares independence from Cornell

https://www.science.org/content/article/arxiv-pioneering-preprint-server-declares-independence-co...
774•bookstore-romeo•1d ago•268 comments