frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: A different kind of AI Video generation

44•fcpguru•5d ago
Hello!

I'm Andrew Arrow, a developer and Final Cut Pro user. My history with FCP goes back years and years, I tried to be a video editor as a career but ended up a software developer. I purchased the full version of the software a long, long time ago. And to Apple's credit, every single time I goto the App Store on any mac I still have access to download it. I must have been through 20, 30? I don't know, but a lot of macs over the years. Just recently I downloaded it again and noticed the XML Export feature.

Wow. There's a lot of stuff there. I mean a lot. I started playing around with generating the XML and that has lead to this project.

https://github.com/andrewarrow/cutlass/blob/main/README.md

Read through those go structs and xml tags. Keep going. And going. It just never stops! It's a very sophisticated XML format that can describe any timeline, effect, or animation you can imagine. Apple's documentation is scattered, the XML is notoriously finicky, and one wrong attribute crashes your import.

And of course with Claude Code (or others, pick your favorite AI) you can have your AI write some amazing things in go using cutlass.

```go // Generate a timeline with precise keyframe animations

video := fcp.Video{

    Ref: assetID,

    Offset: "0s", 

    Duration: fcp.ConvertSecondsToFCPDuration(10.0),

    AdjustTransform: &fcp.AdjustTransform{

        Params: []fcp.Param{{

            Name: "position",

            KeyframeAnimation: &fcp.KeyframeAnimation{

                Keyframes: []fcp.Keyframe{

                    {Time: "0s", Value: "0 0"},

                    {Time: "240240/24000s", Value: "100 50"},

                },

            },

        }},

    },
} ```

So I haven't been writing code like that with my human brain. I spend most of my time just telling Claude the video I want to create, and he uses the cutlass library to achieve it. I'm hooking up FCP directly to Claude. A different kind of AI Video generation.

Would love feedback, thanks for reading this far! -aa

Comments

iLoveOncall•1d ago
It's video editing, not video generation at all, not even a "different kind". Really cool though.
santana16•1d ago
Yeah, but further developed it could pass for editing
iinnPP•1d ago
Creating a video through edits is generation of a video.

This is a great project that demonstrates a unique way to tackle a problem from someone with a passion for those problems.

echelon•1d ago
Editing will remain of utmost strategic important to automate until generation dominates the field. At that point, we'll probably generate from storyboards.

A lot of small startups are trying to automate timeline construction with VLMs. I've counted about a dozen, some with seed stage funding. If you can crack this, there's absolutely a path to a unicorn. But in the long term, generation will disrupt nonlinear video editors, because whatever high level software we build will be able to dispatch to both generation and editing tasks.

Fwiw, I also work in this space and spend a lot of time thinking about it. Prior to AI, I also spent a significant amount of time making films the old fashioned way.

ozgung•1d ago
Could you name a few of these startups please?
absolute_unit22•1d ago
I’m curious as well.

I’ve been exploring this space for a potential project - curious to see what these startups are doing

jasonjmcghee•1d ago
I feel like they've really changed how they are presenting themselves but Runway was the big one that came to mind for me that focus on more than simple video generation but actually meant for film makers.

This is the best page I found- maybe there's a better one...

https://runwayml.com/product

But its interface looks like a video editor- timeline, etc

fcpguru•1d ago
Yeah I look at the amount of code I have now and it's just crazy. It would have taken me 10 years to get it this far before AI. So much of the code was generated. But I do wonder if all of FCP will be legacy soon...
Y_Y•1d ago
It is a law of "AI" that every project must have a name collision with Nvidia:

https://github.com/NVIDIA/cutlass

barefootford•1d ago
Hey I’ve been trying to get Claude code to generate Final Cut xml myself. Mostly I just have a Claude.md with the FCP xml reference and some guidelines. What does this do differently?
parentheses•1d ago
The idea here (I think) is that it (1) can be a program with code used to generate the XML and (2) is typed.

Just by virtue of being a go program it enables even more sophisticated validation and automation if you want to implement it.

fcpguru•1d ago
hehe now I feel funny working on the python version too. It depends on my mood, yeah sometimes I want go. But sometimes I want a little python.
fcpguru•1d ago
Well, the format is incredibly complex. You have to generate the XML correct in every last detail or FCP will crash when you import it. I learned the hard way you need a robust validator system and not just tests alone. Before XML goes out the door you need a last line of defense to catch problems. The go version of cutlass is pretty good at this now and python is catching up.
EGreg•1d ago
Wow, this is fantastic. I have been searching for a tool that would let me output videos like that.

I was going to create a javascript-based version of FCP to essentially implement the keyframes and transitions with arbitrary videos from youtube/vimeo and try to queue them up before they play. Then people would come to our site or use our widget to play all the videos.

I even considered doing a kenburns effect and zooming/panning/cropping the videos. I wanted to have the AI do some takes, finding highlights in the video by transcribing it.

And adding the stupid text overlays etc. would be done in javascript so we don’t need to generate so many versions of the video for different languages or styles.

What do you think? Should we use cutlass xml as the format? Probably better to just make our own json no?

Does anything like this already exist? There are a bunch of sites to edit video timelines. Maybe there is a js lib, like impress.js has for presentations etc?

fcpguru•1d ago
Oh I have just the wiki for you!

https://github.com/andrewarrow/cutlass/wiki

The format is soooo complex. But if you want every feature of FCP it's all there.

tardismechanic•1d ago
This is amazing!

I wonder if OpenAI Structured Output [1] could be leveraged to constrain generation to required JSON Schema (and then convert to XML later)...

[1] https://platform.openai.com/docs/guides/structured-outputs

fcpguru•1d ago
omg i would never want to see the json file! lol the format is sooo complex. I think it needs to stay xml. But I love the idea of openai just speaking this.
DoctorOW•1d ago
I'm not sure what value AI is providing here? Generating key frames seems not just possible, but easier to do within the GUI? Does the AI understand the relationship between script and video?
fcpguru•1d ago
you can say things like "add a new cli menu option to cutlass that takes as input a list of words and make a 1080x1920 9 min video with random png from assets folder and place each word on the screen in font 400 with face color pink and outline color yellow width 15"

And claude will write the code in a way that generates an actually valid fcpxml file.

A fun one is: "review all the tests in ./tests to get deep knowledge of what you can do with fcpxml in this repo. Add a new cli menu option called stress-test that tries to generate a 9 min video with lots and lots of stuff. Just throw every single complex thing you can think of with many many lanes, transforms, use files from ../assets and create something wild in 1080x1920"

Show HN: TokenDagger – A tokenizer faster than OpenAI's Tiktoken

https://github.com/M4THYOU/TokenDagger
250•matthewolfe•14h ago•68 comments

Show HN: New Ensō – first public beta

https://untested.sonnet.io/notes/new-enso-first-public-beta/
223•rpastuszak•15h ago•81 comments

Show HN: Crush Check – AI relationship text analyzer

https://crushcheck.app
2•omarfarooq360•1h ago•0 comments

Show HN: Local LLM Notepad – run a GPT-style model from a USB stick

https://github.com/runzhouye/Local_LLM_Notepad
7•davidye324•3h ago•0 comments

Show HN: We're two coffee nerds who built an AI app to track beans and recipes

https://beanbook.app
47•rokeyzhang•8h ago•33 comments

Show HN: Open-Source International Space Station Tracker ESP32/Arduino for $20

https://github.com/GuitarML/SpaceStationTracker
62•keyth72•3d ago•17 comments

Show HN: Praxos – Context Management for AI Agents

9•mogusian•2h ago•3 comments

Show HN: I built a daily sunlight tracker

https://www.lumehealth.io/products
40•vickipow•4d ago•31 comments

Show HN: A continuation of IRS Direct File that can be self-hosted

https://github.com/openfiletax/openfile
5•elijahwright_•4h ago•0 comments

Show HN: Private real-time dictation app for Mac

https://github.com/aviaryan/Very-Fast-Dictation
10•aviaryan•12h ago•4 comments

Show HN: Timezone converter that tells you if your meeting time sucks

https://timezig.com
2•skrid•5h ago•0 comments

Show HN: C.O.R.E – Opensource, user owned, shareable memory for Claude, Cursor

https://github.com/RedPlanetHQ/core
3•Manik_agg•5h ago•0 comments

Show HN: Octelium – FOSS Alternative to Teleport, Cloudflare, Tailscale, Ngrok

https://github.com/octelium/octelium
334•geoctl•1d ago•144 comments

Show HN: Audiopipe – Pipeline for audio diarization, denoising and transcription

https://github.com/nullwiz/audiopipe
4•nullwiz•7h ago•0 comments

Show HN: Attach Gateway – one-command OIDC/DID auth for local LLMs

https://github.com/attach-dev/attach-gateway
4•hammadtariq•8h ago•0 comments

Show HN: ArcFont – Font Embedding Model

https://github.com/JErnestoMtz/ArcFont
2•jernestomg•9h ago•0 comments

Show HN: I'm an airline pilot – I built interactive graphs/globes of my flights

https://jameshard.ing/pilot
1508•jamesharding•3d ago•196 comments

Show HN: QuizKnit, an open source quiz creator

https://quizknit.com
3•jibolash•9h ago•0 comments

Show HN: Summle – A little maths Game

https://summle.net
25•kirchhoff•4d ago•8 comments

Show HN: Sharpe Ratio Calculation Tool

https://www.fundratios.com/
22•navquant•1d ago•16 comments

Show HN: MailMap – Turn emails into interactive stories on Google Maps

https://mailmap.site
3•Premananda•12h ago•0 comments

Show HN: AGL a toy language that compiles to Go

https://github.com/alaingilbert/agl
83•alain_gilbert•5d ago•12 comments

Show HN: Vet – A tool for safely running remote shell scripts

https://getvet.sh
80•a10r•2d ago•37 comments

Show HN: Semantic-dictionary – A Python dictionary with semantic lookup

https://github.com/eu90h/semantic-dictionary
4•eu90h•13h ago•0 comments

Show HN: DotnetEbpf - Write Linux eBPF kernel applications in C#

https://github.com/KallDrexx/DotnetEbpf
4•KallDrexx•14h ago•0 comments

Show HN: Ravana, Multi AI Assistant Browser Desktop App

https://github.com/yogirk/ravana
3•ashtavakra•14h ago•1 comments

Show HN: My Cross-Platform MySQL Parser

https://abbychau.github.io/mysql-parser/
4•abbychau•14h ago•0 comments

Show HN: A tool to benchmark LLM APIs (OpenAI, Claude, local/self-hosted)

https://llmapitest.com/
52•mrqjr•1d ago•9 comments

Show HN: Sink – Sync any directory with any device on your local network

https://github.com/sirbread/sink
144•sirbread•3d ago•89 comments

Show HN: AI image alt text generation tool – Turn images into text with AI

https://imagerr.ai/
4•andrej-wp•16h ago•0 comments