frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: FFmpeg in plain English – LLM-assisted FFmpeg in the browser

https://vidmix.app/ffmpeg-in-plain-english/
95•bjano•3d ago
I found that I am using ChatGPT more and more to get the FFmpeg command I need, but the process can be a bit tedious: copy-pasting commands, dealing with input file names and locations, making sure the prompt contains enough info about the input files.

This site attempts to solve that. You just describe what you want to do, pick the input files and an LLM (currently DeepSeek) generates the FFmpeg command. You can then run it directly in your browser or use the command elsewhere.

Comments

orderone_ai•8h ago
This is such a convenient tool for a casual user, and a great application of an LLM to a narrow task that probably couldn't be handled quite so easily everywhere. Also a great example of the emerging 'chat driven' UX trend, which I'm really liking.
yakattak•7h ago
Honestly what a great application of LLMs. FFmpeg is a very powerful tool, and as with most powerful tools is very complicated to run correctly. Do the files get uploaded though? Or does it just grab the location on disk?
26d0•6h ago
This appears to use ffmpeg.wasm
danielvaughn•7h ago
I tried to implement something very similar recently, and had the hardest time getting the LLM to produce anything remotely resembling actual ffmpeg commands.
cchance•7h ago
load up the man pages into the context maybe?
ilaksh•6h ago
You were using a weak LLM then. The difference between one of the leading edge LLMs like Gemini 2.5 Pro, o3, or Claude 4 and an average LLM or one you can run on your typical PC/laptop is night and day.
jeanlucas•6h ago
for real? to me it works fine with just chatGPT (free)
toddmorey•7h ago
The Warp terminal[1] is excellent for this type of thing. In agent mode, you just describe what you want to have happen and it generates the proper command(s) (that you can approve before running).

I use it a lot to convert videos and turn a folder of tiff files into pngs at 1/2 size, etc. It's great at generating FFMEG commands and chaining the right tools together.

[1] https://www.warp.dev

jjcm•6h ago
Great work on this - I made a terminal command similar to this (llmpeg), and was actively trying to get exactly this working - a webasm compiled version of ffmpeg that could encode in the browser. I for the life of me couldn't get the provided examples on https://github.com/ffmpegwasm/ffmpeg.wasm to run.

Just for my own development curiosity, was there anything specific you had to do to get ffmpegwasm to work?

nzoschke•6h ago
I don’t know about the browser but I’ve been playing with a WASM build through this Go wrapper. Nice to not need CGO.

https://codeberg.org/gruf/go-ffmpreg

zahlman•4h ago
I'm a little confused as to how someone makes an "mpreg" joke in the project name, but censors "lame" in the README.
aspenmayer•2h ago
Lame could be considered an ableist slur, I guess?
iamflimflam1•5h ago
I had the same problem - would love to see a working example.
dheera•5h ago
I made a script of this sort too originally called "ffmpeg-english" last year. Interesting how people reacted to it:

https://news.ycombinator.com/item?id=40410637

I then made a more general version of it for all commands:

https://github.com/dheera/scripts/blob/master/helpme

Example usage:

    $ helpme ffmpeg assemble all the .jpg files into an .mp4 timelapse video at 8fps
bsenftner•6h ago
I think you need to add some liberal filename handling. I have directories of videos generated by various AI video models, and they have spaces in the filenames, not just one, but the prompt to generate the video plus the major parameters are all the filename. They are long, pains in the ass to work with, but they are there. Would be nice if your tool could work with them.
pdyc•5h ago
interesting, i created a video editor to generate ffmpeg commands because it was difficult to adjust ffmpeg commands on commandline https://newbeelearn.com/tools/videoeditor/ . This lets you tweak ffmpeg commands visually.
somesun•3h ago
a similar website https://ffmpeg.app/
p0w3n3d•2h ago

  I'm sorry Dave, I'm afraid I cannot do that
runeks•29m ago
How do you know my name?!
jjbinx007•1h ago
I recently used Gemini to help with some dashcam videos that weren't being saved properly. I was sure most of the data were there but the files wouldn't play in VLC or MPC, so I asked Gemini.

It suggested various things to try, and after pasting in the error messages each time it suggested more and more radical things. Eventually it suggested a program called Untrunc, where you give it a working video file as a reference and then the file that's faulty and as if by magic it worked!

Just wanted to mention this in case anyone else is struggling to get FFmpeg to repair a file.

pjc50•1h ago
Interesting - I took a look at how this works and why it needs a reference, and the answer is the usual one of the 'moov atom': a critical piece of metadata. Lots of programs output it at the end of the file, but that makes it vulnerable to truncation, and for streaming it's useful to move it to the beginning of the file. ffmpeg refers to this as "fast start".

Show HN: Refine – A Local Alternative to Grammarly

https://refine.sh
134•runjuu•5h ago•53 comments

How I build software quickly

https://evanhahn.com/how-i-build-software-quickly/
65•kiyanwang•3h ago•26 comments

Let's Learn x86-64 Assembly (2020)

https://gpfault.net/posts/asm-tut-0.txt.html
272•90s_dev•11h ago•62 comments

Show HN: Ten years of running every day, visualized

https://nodaysoff.run
485•friggeri•3d ago•206 comments

Apple's Browser Engine Ban Persists, Even Under the DMA

https://open-web-advocacy.org/blog/apples-browser-engine-ban-persists-even-under-the-dma/
119•yashghelani•2h ago•54 comments

Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs

https://arxiv.org/abs/2502.17424
114•martythemaniak•10h ago•31 comments

East Asian air cleanup likely contributed to acceleration in global warming

https://www.nature.com/articles/s43247-025-02527-3
13•defrost•44m ago•1 comments

A Century of Quantum Mechanics

https://home.cern/news/news/physics/century-quantum-mechanics
40•bookofjoe•3d ago•34 comments

How does a screen work?

https://www.makingsoftware.com/chapters/how-a-screen-works
434•chkhd•20h ago•88 comments

OpenCut: The open-source CapCut alternative

https://github.com/OpenCut-app/OpenCut
348•nateb2022•13h ago•108 comments

Binding Application in Idris

https://andrevidela.com/blog/2025/binding-application/
25•matt_d•3d ago•1 comments

The underground cathedral protecting Tokyo from floods (2018)

https://www.bbc.com/future/article/20181129-the-underground-cathedral-protecting-tokyo-from-floods
116•barry-cotter•3d ago•36 comments

APKLab: Android Reverse-Engineering Workbench for VS Code

https://github.com/APKLab/APKLab
116•nateb2022•13h ago•8 comments

A technical look at Iran's internet shutdowns

https://zola.ink/blog/posts/a-technical-look-at-irans-internet-shutdown
190•znano•17h ago•78 comments

Telefónica DE shifts VMware support to Spinnaker due to cost

https://www.theregister.com/2025/07/11/telefnica_germany_shifts_vmware_support/
14•rbanffy•1h ago•5 comments

Hypercapitalism and the AI talent wars

https://blog.johnluttig.com/p/hypercapitalism-and-the-ai-talent
94•walterbell•14h ago•61 comments

Show HN: FFmpeg in plain English – LLM-assisted FFmpeg in the browser

https://vidmix.app/ffmpeg-in-plain-english/
95•bjano•3d ago•21 comments

Show HN: Built a desktop app to organize photos locally with duplicate detection

https://organizer.flipfocus.nl/
13•mcvanhassel•4d ago•11 comments

Show HN: ArchGW – An intelligent edge and service proxy for agents

https://github.com/katanemo/archgw/
88•honorable_coder•1d ago•8 comments

Burning a Magnesium NeXT Cube (1993)

https://simson.net/ref/1993/cubefire.html
40•leoapagano•3d ago•12 comments

The Scourge of Arial (2001)

https://www.marksimonson.com/notebook/view/the-scourge-of-arial/
38•andsoitis•9h ago•23 comments

Myanmar’s proliferating scam centers

https://asia.nikkei.com/static/vdata/infographics/myanmar-scam-centers/
79•WaitWaitWha•5h ago•13 comments

The upcoming GPT-3 moment for RL

https://www.mechanize.work/blog/the-upcoming-gpt-3-moment-for-rl/
206•jxmorris12•4d ago•87 comments

Concurrent Programming with Harmony

https://harmony.cs.cornell.edu/book/
4•todsacerdoti•3d ago•0 comments

GLP-1s are breaking life insurance

https://www.glp1digest.com/p/how-glp-1s-are-breaking-life-insurance
328•alexslobodnik•15h ago•392 comments

Show HN: A Raycast-compatible launcher for Linux

https://github.com/ByteAtATime/raycast-linux
169•ByteAtATime•17h ago•50 comments

Bitcoin passes $120k milestone as US Congress readies for 'crypto week'

https://www.ft.com/content/1d4c5942-7190-45e1-9167-a5eacfd93982
5•sandbach•1h ago•2 comments

C3 solved memory lifetimes with scopes

https://c3-lang.org/blog/forget-borrow-checkers-c3-solved-memory-lifetimes-with-scopes/
114•lerno•2d ago•94 comments

James Webb, Hubble space telescopes face reduction in operations

https://www.astronomy.com/science/james-webb-hubble-space-telescopes-face-reduction-in-operations-over-funding-shortfalls/
97•geox•8h ago•59 comments

Five companies now control over 90% of the restaurant food delivery market

https://marketsaintefficient.substack.com/p/five-companies-now-control-over-90
236•goinggetthem•13h ago•240 comments