frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Launch HN: Mosaic (YC W25) – Agentic Video Editing

https://mosaic.so
21•adishj•1h ago
Hey HN! We’re Adish & Kyle from Mosaic (https://mosaic.so). Mosaic lets you create and run your own multimodal video editing agents in a node-based canvas. It’s different from traditional video editing tools in two ways: (1) the user interface and (2) the visual intelligence built into our agent.

We were engineers at Tesla and one day had a fun idea to make a YouTube video of Cybertrucks in Palo Alto. We recorded hours of cars driving by, but got stuck on how to scrub through all this raw footage to edit it down to just the Cybertrucks.

We got frustrated trying to accomplish simple tasks in video editors like DaVinci Resolve and Adobe Premiere Pro. Features are hidden behind menus, buttons, and icons, and we often found ourselves Googling or asking ChatGPT how to do certain edits.

We thought that surely now, with multimodal AI, we could accelerate this process. Better yet, an AI video editor could automatically apply edits based off what it sees and hears in your video. The idea quickly snowballed and we began our side quest to build “Cursor for Video Editing”.

We put together a prototype and to our amazement, it was able to analyze and add text overlays based on what it saw or heard in the video. We could now automate our Cybertruck counting with a single chat prompt. That prototype is shown here: https://www.youtube.com/watch?v=GXr7q7Dl9X0.

After that, we spent a chunk of time building our own timeline-based video editor and making our multimodal copilot powerful and stateful. In natural language, we could now ask chat to help with AI asset generation, enhancements, searching through assets, and automatically applying edits like dynamic text overlays. That version is shown here: https://youtu.be/X4ki-QEwN40.

After talking to users though, we realized that the chat UX has limitations for video: (1) the longer the video, the more time it takes to process. Users have to wait too long between chat responses. (2) Users have set workflows that they use across video projects. Especially for people who have to produce a lot of content, the chat interface is a bottleneck rather than an accelerant.

That took us back to first principles to rethink what a “non-linear editor” really means. The result: a node-based canvas which enables you to create and run your own multimodal video editing agents. https://screen.studio/share/SP7DItVD.

Each tile in the canvas represents a video editing operation and is configurable, so you still have creative control. You can also branch and run edits in parallel, creating multiple variants from the same raw footage to A/B test different prompts, models, and workflows. In the canvas, you can see inline how your content evolves as the agent goes through each step.

The idea is that canvas will run your video editing on autopilot, and get you 80-90% of the way there. Then you can adjust and modify it in an inline timeline editor. We support exporting your timeline state out to traditional editing tools like DaVinci Resolve, Adobe Premiere Pro, and Final Cut Pro.

We’ve also used multimodal AI to build in visual understanding and intelligence. This gives our system a deep understanding of video concepts, emotions, actions, spoken word, light levels, shot types.

We’re doing a ton of additional processing in our pipeline, such as saliency analysis, audio analysis, and determining objects of significance—all to help guide the best edit. These are things that we as human editors internalize so deeply we may not think twice about it, but reverse-engineering the process to build it into the AI agent has been an interesting challenge.

Some of our analysis findings: Optimal Safe Rectangles: https://assets.frameapp.ai/mosaicresearchimage1.png Video Analysis: https://assets.frameapp.ai/mosaicresearchimage2.png Saliency Analysis: https://assets.frameapp.ai/mosaicresearchimage3.png Mean Movement Analysis: https://assets.frameapp.ai/mosaicresearchimage4.png

Use cases for editing include: - Removing bad takes or creating script-based cuts from videos / talking-heads - Repurposing longer-form videos into clips, shorts, and reels (e.g. podcasts, webinars, interviews) - Creating sizzle reels or montages from one or many input videos - Creating assembly edits and rough cuts from one or many input videos - Optimizing content for various social media platforms (reframing, captions, etc.) - Dubbing content with voice cloning and lip syncing.

We also support use cases for generating content such as motion graphic animations, cinematic captions, AI UGC content, adding contextual AI-generated B-Rolls to existing content, or modifying existing video footage (changing lighting, applying VFX).

Currently, our canvas can be used to build repeatable agentic workflows, but we’re working on a fully autonomous agent which will be able to do things like: style transfer using existing video content, define its own editing sequence / workflow without needing a canvas, do research and pull assets from web references, and so on.

You can try it today at https://edit.mosaic.so. You can sign up for free and get started playing with the interface by uploading videos, making workflows on the canvas, and editing them in the timeline editor. We do paywall node runs to help cover model costs. Our API docs are at https://docs.mosaic.so. We’d love to hear your feedback!

Comments

tonyoconnell•24m ago
This is so cool. Good luck with your venture.
adishj•22m ago
Thank you :)
callamdelaney•18m ago
Hey, good luck with Mosaic.

Some feedback initially on the landing page, looks great but I thought that there is, for me, too much motion going on on the homepage and the use cases page. May be an unpopular opinion!

cjbarber•17m ago
Agreed, homepage was confusing for me also. I tried to scroll around and see a demo. For a product like this that is so visual, I expected to be able to find a 30s demo clip somewhere but couldn't see one on the homepage or product page (and the scrolling on the product page was annoying for me).
adishj•13m ago
the sad part is spent so long on the product page scrolling animation haha

very valid point though — I think a demo clip of a BEFORE vs AFTER immediately somewhere in the hero even or right below it would be helpful

thanks for the feedback

adishj•14m ago
valid points, thanks for the feedback. i had gone for a certain aesthetic but you're right in that it may be a bit too overwhelming.
cjbarber•18m ago
I think this is a great endeavor. I was thinking about a channel that I like watching on YouTube. They travel to exotic places by boat and film themselves, nature documentary style. To make good videos requires going to these places, a ton of filming, AND a ton of editing. They put out a video every 2 weeks or so on their trips. I imagine the editing is the hard part.

This is a long winded way of saying that I think creators need what you're making! People who have hours of awesome footage but have to spend dozens of hours cutting it down need this. Then also people who have awesome footage but aren't good at editing or hiring an editor, same thing. I'd love to see someone solve this so that 90th percentile editing is available to all, and then it can be more about who has the interesting content, rather than who has the interesting content and editing skills.

adishj•2m ago
thanks! Mosaic can already do the rough cuts for you — so you can upload all your footage from your travel, and prompt it to "make a 2 minute highlight reel of your trip to Japan", for instance.

soon, we also plan to incorporate style transfer, so you could even give it a video from the channel you enjoy watching + your raw footage, and have the agent edit your footage in the same style of the reference video.

penne_pastaa•18m ago
this is so cool, can we see some demos of edits you'd make with it?
adishj•5m ago
thanks! check out the demo video here of the latest version of the interface: https://screen.studio/share/SP7DItVD

i playback parts of the cinematic edit I made to the conversation between Dwarkesh Patel and Satya Nadella (e.g. added cinematic captions, motion graphics)

i can post the full edit as well if you're interested

jaccola•14m ago
Very cool. It definitely feels to me that the power of pro tools should be available to more people with AI.

Would have been nice if there was a killer demo on your landing page of a video made with Mosaic.

adishj•8m ago
that's our perspective as well.

a lot of tooling is being built around generative AI in particular, but there's still a big gap for people that want to share their own stories / experiences / footage but aren't well-versed with pro tools.

valid feedback on the landing page — something we'll add in.

Treating AI's amnesia with context portability, knowledge graphs and ontologies

https://mmc.vc/research/agentic-enablers-treating-ais-amnesia-and-other-disorders/
1•advikipedia•49s ago•1 comments

‘Extremely Rare’ Pink Grasshopper Spotted in New Zealand

https://scienceclock.com/extremely-rare-pink-grasshopper-spotted-in-new-zealand/
1•ashishgupta2209•2m ago•0 comments

Build vs. Buy: What This Week's Outages Should Teach You

https://www.toddhgardner.com/blog/build-vs-buy-outages
1•toddgardner•3m ago•0 comments

Radical CS (2023) [pdf]

https://csrc.nist.gov/csrc/media/Presentations/2023/radical-cs/images-media/sess-1-rogaway-bcm-wo...
1•zdw•3m ago•0 comments

Why do women feel the cold more than men?

https://www.rte.ie/brainstorm/2025/1118/1111894-why-do-women-feel-the-cold-more-than-men/
1•austinallegro•3m ago•0 comments

Why I Chose Astro for My Portfolio (2024)

https://www.nickstambaugh.dev/posts/why-astro
1•sieep•4m ago•0 comments

Outdated Samsung handset linked to fatal emergency call failure in Australia

https://www.theregister.com/2025/11/18/samsung_emergency_call_failure/
1•doener•4m ago•0 comments

In the A.I. Race, Chinese Talent Still Drives American Research

https://www.nytimes.com/2025/11/19/technology/ai-research-chinese-talent.html
1•blondie9x•5m ago•0 comments

Random lasers from peanut kernel doped with birch leaf–derived carbon dots

https://www.degruyterbrill.com/document/doi/10.1515/nanoph-2025-0312/html
1•PaulHoule•8m ago•0 comments

Zero Allocation JSON Logger for Go

https://github.com/rs/zerolog
1•mooreds•8m ago•0 comments

GitHub Maybe Down Again

https://www.githubstatus.com/incidents/zzl9nl31lb35
2•erdaniels•8m ago•0 comments

DeepMind's latest: An AI for handling mathematical proofs

https://arstechnica.com/ai/2025/11/deepminds-latest-an-ai-for-handling-mathematical-proofs/
1•quapster•9m ago•0 comments

The Fate of Data Model Dependency

https://medium.com/@HobokenDays/the-fate-of-shared-data-model-cf8a3dc88ac9
1•NewarkDays•10m ago•0 comments

SFO is getting a new direct flight to Warsaw

https://www.sfchronicle.com/bayarea/article/sfo-warsaw-21172190.php
1•danielam•10m ago•0 comments

Monitor Plus is shutting down

https://support.mozilla.org/en-US/kb/monitor-plus-shutting-down
2•azinman2•11m ago•0 comments

Sam 3D: Powerful 3D Reconstruction for Physical World Images

https://ai.meta.com/blog/sam-3d/?_fb_noscript=1
2•meetpateltech•11m ago•0 comments

Why BSDs?

https://blog.thechases.com/posts/why-bsds/
1•jaypatelani•11m ago•0 comments

Saudi Arabia Backs Elon Musk's XAI with Data Center Deal

https://www.nytimes.com/2025/11/19/technology/saudi-arabia-elon-musk-xai.html
2•fleahunter•11m ago•0 comments

Meta Segment Anything Model 3

https://ai.meta.com/blog/segment-anything-model-3/?_fb_noscript=1
4•alcinos•12m ago•0 comments

Ask HN: Image Recognition

1•davidajackson•12m ago•0 comments

To Evade Sanctions, the Kremlin Turns to Convicted Money Launderer Ilan Shor

https://www.lawfaremedia.org/article/to-evade-sanctions--the-kremlin-turns-to-a-convicted-money-l...
2•sigwinch•14m ago•0 comments

Breaking the Drone Speed World Record in Dubai [video]

https://www.youtube.com/watch?v=KZqLMKzgaW0
1•marklit•16m ago•0 comments

Global Privacy Control

https://en.wikipedia.org/wiki/Global_Privacy_Control
1•amelius•17m ago•0 comments

Elaris UI

https://github.com/ambystechcom/Ambystech.Elaris.UI
1•tavobarrientos•17m ago•0 comments

Slicing Is All You Need: Towards a Universal One-Sided Distributed MatMul

https://arxiv.org/abs/2510.08874
1•matt_d•17m ago•0 comments

Server virtualization market heats up to win VMware refugees

https://www.theregister.com/2025/11/17/gartner_server_virtualization_guide/
1•speckx•18m ago•0 comments

Specialized CSV readers for Rust leveraging hybrid SIMD techniques

https://docs.rs/simd-csv/latest/simd_csv/
1•todsacerdoti•18m ago•0 comments

Show HN: Terminal roguelike using Entity-Component-System in pure Ruby

https://github.com/Davidslv/vanilla-roguelike
2•davidslv•19m ago•1 comments

Not redefining Chrome, but fixing the workflow

https://chromewebstore.google.com/detail/tapone-—-all-in-one-short/cgjeegkgkbnfldccmknbacfaaejd...
1•kii9999•20m ago•0 comments

Show HN: I built a 100ml shot with 25 g protein and 100 calories

https://protein.inc/#flavors
1•aakiverse•20m ago•0 comments