frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

SUSE Enterprise Linux 16 is here, and its killer feature is digital sovereignty

https://www.zdnet.com/article/suse-enterprise-linux-16-is-here-and-its-killer-feature-is-digital-...
1•CrankyBear•34s ago•0 comments

Show HN: I built a CLI tool to automatically break up large PRs

https://github.com/DiffEnder/pr-splitter
1•smith-kyle•1m ago•0 comments

The Man Who Invented AGI

https://www.wired.com/story/the-man-who-invented-agi/
1•CharlesW•2m ago•0 comments

Concrete's Greatest Weakness

https://practical.engineering/blog/2025/11/4/concretes-greatest-weakness
1•crescit_eundo•2m ago•0 comments

Make 2025 the Year of Maximum Enthusiasm

https://semi-rad.com/2025/01/make-2025-the-year-of-maximum-enthusiasm/
1•mooreds•3m ago•0 comments

Does momentum exist in prediction markets? A short analysis

https://nodumbideas.com/p/does-momentum-exist-in-prediction
1•nodumbideas•3m ago•0 comments

Ask HN: Recommend a Source Code Pro -like font for reading (ebooks)

1•nile-crocodile•4m ago•0 comments

I Taught an AI to Dream

https://blog.minibase.ai/p/i-taught-an-ai-to-dream
1•mkmccarty3•4m ago•1 comments

Voxel Grid Visibility

https://cod.ifies.com/voxel-visibility/
3•Scaevolus•5m ago•0 comments

We need to give LLMs continuous-time inputs and outputs

https://matml.bearblog.dev/we-need-to-give-llms-human-like-vision/
1•tokenomics•5m ago•0 comments

Humongous Numbers in Python

https://www.tomechangosubanana.com/2025/humongous-numbers-in-python/
1•speckx•6m ago•0 comments

Fun with macOS dtruss (2022)

https://www.deepanseeralan.com/tech/fun-with-dtruss-macOS/
1•indigodaddy•7m ago•0 comments

Requiem for the Rangefinder

https://www.lux.camera/requiem-iphone-air/
1•tosh•8m ago•0 comments

Micron memory fab project poised to transform Central New York economy

https://www.fingerlakes1.com/2025/10/19/micron-project-poised-to-transform-central-new-york-economy/
1•walterbell•9m ago•0 comments

What Happened When Small-Town America Became Data Center, U.S.A.

https://www.wsj.com/us-news/what-happened-when-small-town-america-became-data-center-u-s-a-410f25e9
2•ChrisArchitect•11m ago•1 comments

After 35 Years, a Solution to the CIA's Kryptos Puzzle Has Been Found

https://www.scientificamerican.com/article/a-solution-to-the-cias-kryptos-code-is-found-after-35-...
1•freeqaz•11m ago•1 comments

Show HN: macOS menu bar app that monitors your Ethernet connection status

https://github.com/montanaflynn/EthernetConnectionStatus
1•anonfunction•12m ago•1 comments

Think Weirder TOC

https://compellingsciencefiction.com/posts/think-weirder-is-released.html
1•mojoe•12m ago•0 comments

Git's First Major Release in 11 Years: What's in Git 3.0

https://www.deployhq.com/blog/git-3-0-on-the-horizon-what-git-users-need-to-know-about-the-next-m...
4•deployhq•13m ago•1 comments

YouTube AI error costs creator his channel over alleged link to Japanese account

https://piunikaweb.com/2025/11/04/youtube-ai-error-terminates-enderman-channel/
2•rabinovich•14m ago•0 comments

Ask HN: What is your doomsday plan for your passwords

1•ivape•16m ago•4 comments

Learn GPU Programming with Mojo GPU Puzzles Tutorial – Introduction

https://www.youtube.com/watch?v=-VsP4kT6DjA
1•mdunnoconnor•16m ago•0 comments

DevTrends MCP – Real-Time Developer Intelligence for AI Coding Assistants

https://apify.com/peghin/devtrends-mcp
1•PEGHIN•17m ago•1 comments

Ask HN: Ultrawide monitors, yay or nay? If yay, what brand?

1•embedding-shape•18m ago•1 comments

When a "feature" is worse than a bug (Hello,Sketchfab)

https://medium.com/@egorich42/when-a-feature-is-worse-than-a-bug-hello-sketchfab-b88a10d52511
1•Egorich42•18m ago•0 comments

U.S. Military Task Force Provides Aid, Disaster Relief to Jamaica

https://news.usni.org/2025/10/31/u-s-military-task-force-provides-aid-disaster-relief-to-jamaica
1•Bender•20m ago•0 comments

Show HN: First Principles articles teaching a variety of languages

https://github.com/InfiniteConsult/FromFirstPrinciples
1•warren_jitsing•20m ago•0 comments

USDA Threatens Stores Giving Discounts to People on Food Stamps

https://newrepublic.com/post/202604/usda-threatens-grocery-stores-discounts-people-food-stamps
5•mrtesthah•20m ago•2 comments

Show HN: Self-hosted community platform with live chat and SEO friendly

https://baklab.app
1•Kholin•20m ago•0 comments

Using LibreOffice and other Free software for documents as a lawyer

https://neilzone.co.uk/2025/11/using-libreoffice-and-other-free-software-for-documents-as-a-lawyer/
1•speckx•21m ago•0 comments
Open in hackernews

Chaining FFmpeg with a Browser Agent

https://100x.bot/a/chaining-ffmpeg-with-browser-agent
48•shardullavekar•3h ago

Comments

sylware•3h ago
HTML <video> or <audio> element with "Streaming" URLs passed to the media player (or internally in the web browser for the big ones).
utopiah•3h ago
Have to admit, ffmpeg syntax is not trivial... but also the project is 24 years old and is basically the defacto industry standard. If you believe you will still be editing videos in 20 years with the CLI (or any other tool or any programming language) wrapping it then it's probably worth few hours learning how it actually works.
shardullavekar•3h ago
true, companies like Descript, Veed, or Kapwing exist because no coders find this syntax intimidating. Plus, a CLI tool stands out of a workflow. We wanted to change that.
petetnt•2h ago
Don't "no coders" find the concepts described in this article imdimitating?

The article states that whatever the article is trying to describe "Takes about ~20-30 mins. The cognitive load is high....". while their literal actual step of "Googling "ffmpeg combine static image and audio."" gives you the literal command you need to run from a known source (superuser.com sourced from ffmpeg wiki).

Anyone even slightly familiar with ffmpeg should be able to produce the same result in minutes. For someone who doesn't understand what ffmpeg is the article means absolutely nothing. How does a "no coder" understand what a "agent in a sandboxed container" is?

shardullavekar•2h ago
we took a basic example and described it. (will try adding a complex one)

we have our designer/intern in our minds who creates shorts, adds subtiles, crops them,and merges the audio generated. He is aware of ffmpeg and prefers using a SaaS UI on top of it.

However, we see him hanging out on chatgpt, or gemini all the time. He is literally the no coder we have in mind.

We just combined his type what you want + ffmpeg workflows.

EraYaN•1h ago
Wouldn't that intern just use an NLE (be it Premiere, Davinci Resole etc) anyway? If you need to style subtitles and edit shorts and video content, you'll need a proper editor anyway.
shardullavekar•1h ago
1. download a larger video from s3. 2. Use NLE and cut it into shorts. (crop, resize, subtitles etc.) 3. Upload shorts on YouTube, Instagram, Tiktok.

He does use davinci resolve but only for 2.

NLEs make ffmpeg a standalone yet easy to use tool.

Not denying that major heavy lifting is done by the NLE. We go a step ahead and make it embeddable in a larger workflow.

artpar•3h ago
I think that goes with almost every tool you want to use with llm. User should already know the tool ideally so mistakes by llm can be prevented before they happen.

Here making ffmpeg as "just another capability" allows it to be stitched together in workflows

jack_pp•3h ago
I agree, I suggest using this instead : https://github.com/kkroening/ffmpeg-python . While not perfect once you figure it out it is far easier to use and you can wrap more complicated workflows and reuse them later.
poly2it•2h ago
Kkroening's wrapper has been inactive for some time. I suggest using https://github.com/jonghwanhyeon/python-ffmpeg instead. It has proper async support and a better API.
jack_pp•2h ago
Thing is, if you want to use LLMs for mockups you got to use the old one.
esperent•2h ago
The syntax isn't too bad. The problem is that I have to use it a couple of times a year, on average. So every time I've forgotten and have to relearn. This doesn't happen with GUIs nearly as much, and it's why I prefer them over CLI tools for anything that I don't do at least once every week or two.
skydhash•1h ago
That’s why you write scripts, or put a couple snippets in your notes.
Sean-Der•2h ago
My question/curiosity is why do so many people use ffmpeg (frustrated by the syntax) when GStreamer is available?

`gst-launch-1.0 filesrc ! qt4demux ! matroskamux ! filesink...` people would be less frustrated maybe?

People would also learn a little more and be less frustrated when conversation about container/codec/colorspace etc... come up. Each have a dedicated element and you can better understand its I/O

artpar•2h ago
I did not know gstreamer wasm also exists, I'll check it out
goeiedaggoeie•2h ago
Still has a way to go, but very exciting.
jack_pp•3h ago
As someone who has used ffmpeg for 10+ years maintaining a relatively complex backend service that's basically a JSON to ffmpeg translator I did not fully understand this article.

Like the Before vs after section doesn't even seem to create the same thing, the before has no speedup, the after does.

In the end it seems they basically created a few services ("recipes") that they can reuse to do simple stuff like speed-up 2x or combine audio / video or whatever

shardullavekar•3h ago
thanks for calling it out, I will correct the before vs after section. But you can describe any ffmpeg capability in plain English and the underlying ffmpeg tool call takes care of it.
jack_pp•3h ago
I have written a lot of ffmpeg-python and plain ffmpeg commands using LLMs and while I am amazed at how good Gemini or chatGPT can handle ffmpeg prompts it is still not 100% so this seems to me like a big gamble on your part. However it might work for most users that only ask for simple things.
shardullavekar•2h ago
so creators on 100x will create well defined workflows that others can reuse. If a workflow is not found, llm creates one on the go and saves it.
jack_pp•2h ago
That sounds good, save the LLM generated workflows and have them edited by more seasoned users.

Or you could go one step further and create a special workflow which would allow you to define some inputs and iterate with an LLM until the user gets what he wants but for this you would need to generate outputs and have the user validate what the LLM has created before finally saving the recipe.

shardullavekar•1h ago
That's exactly how it is implemented!
IsTom•3h ago
> Half of scripting FFmpeg is just fighting with shell quote escaping for filter_complex.

-filter_complex_script is a thing

4gotunameagain•3h ago
This is yc propping up a startup they have backed, there isn't much substance here.
coachgodzup•2h ago
I considered FFmpeg a great project, but I usually avoid to use it directly because of his quite complex syntax. I'm reconsidering it because coupled with an llm is very straightforward and more immediate than an usual graphical editor
orbital-decay•1h ago
At some point command line becomes unwieldy. FFmpeg would definitely benefit from a non-arcane DSL like AviSynth or a node-based UI.
skeeter2020•1h ago
This doesn't make any sense; the Before and After examples accomplish different things. I also don't get who the target audience is; people intimidated by a CLI tool but at home with technical agents?
shardullavekar•1h ago
people intimidated by a CLI tool but find tools like chatgpt easy to use and those who have video editing as a part of larger workflow.
sanjit•1h ago
An aside but related?

FFmpeg has complex syntax because it’s dealing with the _complexity of video_. I agree with everyone about knowing (and helping create or contribute to) our tools.

Today I largely forget about the _legacy_ of video, the technical challenges, and how critical it was to get it right.

There are an incredible number of output formats and considerations for _current_ screens (desktop, tablet, mobile, tv, etc…). Then we have a whole other world on the creation side for capture, edit, live broadcast…

On legacy formats it used to be so complex with standards, requirements, and evolving formats. Today, we don’t even think about why we have 29.97fps around? Interlacing?

We have a mix of so many incredible (and sometimes frustrating) codecs, needs and final outputs, so it’s really amazing the power we have with a tool like FFmpeg… It’s daunting but really well thought out.

So just a big thanks to the FFmpeg team for all their incredible work over the years…

shardullavekar•57m ago
no 2nd thoughts about it, we are only making ffmpeg more accessible and embeddable.
echelon•5m ago
> FFmpeg has complex syntax because it’s dealing with the _complexity of video_.

It's dealing with 3D data (more if you count audio, other tracks) and multi-dimensional transforms from a command line.

kwanbix•45m ago
I use ChatGPT for this kind of complexity.

It works 99% of the time for my use case.

shardullavekar•36m ago
jack_pp made a point in the comments, worth noting.
Dachande663•26m ago
ffmpeg is the only community where I've asked for help and been told "if you have to ask, you're too stupid to use this project". Needless to say, it was a welcoming community I continued engaging with.