frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Chaining VEO 2's 8-second clips into 2-minute videos with sync audio

https://storage.googleapis.com/smarketly-d31fb.firebasestorage.app/videos/NJZFGTZo1wYaqkofZuNnF9yy9Lt2/6bcfa3e3-6b1f-4a33-835a-c6c3c35390c6.mp4?GoogleAccessId=firebase-adminsdk-fbsvc%40smarketly-d31fb.iam.gserviceaccount.com&Expires=16730323200&Signature=WJbuzWrNeOuZDdE8LraQChUEi2TfcloVx0%2F2RbHBPffPuayIdqTTXsi12jgeghtf5dPHlghULkship%2B3wlofZaawxQfi5XwZ9s2rnxHMcY5qImBHI4BcDafV2aZhC90DJdUHXk6M4%2F8cph7Q623d%2FkjmuEkYeNcuVHPOtERxpklgfcYS10xzp6b83EEkajY%2BEp1jyvU%2FyfwZolAIpVlo%2BJGbFBpGWbIleBNC3aFpPv8cSD4lPSYjvh5%2Fs9mQmFEgVgaJAiwLASUJO7W0bQ%2Bpn0CArW%2FMIyxSzfveclxAWm3yhsICGjf5HqkzzcuDg8O9NwxB2eSb3ptTrEDUAl4Kfw%3D%3D
2•abilafredkb•6h ago
The technical challenge: VEO 2 only generates 8-second clips with no audio. I needed to create longer marketing videos, so I built a system that chains multiple clips into cohesive 2-minute videos with synchronized narration. The core technical problems I solved: 1. Script Continuity Across Clips VEO 2 doesn't maintain context between generations. I built a script parser that breaks longer narratives into 8-second segments while maintaining visual and narrative continuity. Each segment gets contextual prompts that reference the previous clip's end state. 2. Seamless Visual Transitions To avoid jarring cuts between clips, I analyze the final frame of each segment and use it to inform the opening prompt of the next segment. This creates natural visual flow across the 8-second boundaries. 3. Audio Synchronization Pipeline Since VEO 2 is silent, I integrated Google's Gemini Flash audio model with a custom FFmpeg pipeline:

Parse the script timing Generate audio segments that match video pacing Use FFmpeg to sync audio with video transitions Handle audio crossfades between clip boundaries

4. Cost Optimization At current VEO 2 pricing, a 2-minute video costs ~$60 in API calls. I built request batching and caching to minimize redundant generations. The system can theoretically generate longer videos, but I capped it at 2 minutes due to cost considerations. This powers the video generation feature in my marketing tool Smarketly, but the core chaining technique could work for any VEO 2 application needing longer content. Technical questions I'm still working on:

Better frame-to-frame consistency algorithms More efficient prompt engineering for visual continuity

Happy to share code snippets or discuss the technical implementation!

But you can check what i'm building this on by visiting https://smarketly.lema-lema.com

Scots join class action suit against M&S after hackers stole personal data

https://www.dailyrecord.co.uk/news/hundreds-scots-join-class-action-35282317
1•chrisjj•1m ago•0 comments

Writing your own CUPS printer driver in 100 lines of Python

https://behind.pretix.eu/2018/01/20/cups-driver/
1•todsacerdoti•1m ago•0 comments

The Future of Software (AIOS)

https://www.harmonized-ai.com/
1•ZekeV•2m ago•1 comments

Why Top Posting Has Won

https://www.solipsys.co.uk/new/WhyTopPostingHasWon.html?ye25hn
1•ColinWright•7m ago•0 comments

Migrating Uber's Compute Platform to Kubernetes

https://www.uber.com/en-AU/blog/migrating-ubers-compute-platform-to-kubernetes-a-technical-journey/
2•ijidak•7m ago•0 comments

Show HN: SEO for ChatGPT, Perplexity and Gemini

https://ayzeo.fyrma.io/
2•johndoes•8m ago•1 comments

Alan Yentob Has Died

https://en.wikipedia.org/wiki/Alan_Yentob
2•edward•10m ago•0 comments

Ask HN: Why is all about AI?

2•Haeuserschlucht•11m ago•1 comments

Inhibiting 15-PGDH blocks BBB deterioration and protects mice from Alzheimer's

https://www.pnas.org/doi/10.1073/pnas.2417224122
2•pera•11m ago•1 comments

Do not encrypt comments in SOPS

https://github.com/getsops/sops/issues/921
1•mooreds•20m ago•0 comments

Phrenology's Bumpy Path

https://worldhistory.substack.com/p/phrenologys-bumpy-path
1•crescit_eundo•20m ago•0 comments

MinIO Guts Management Dashboard

https://github.com/minio/object-browser/pull/3509
1•sigmonsays•21m ago•1 comments

Welcome to Pocket Users

https://www.wallabag.it/en/news/welcome-to-pocket-users
1•gpi•21m ago•0 comments

Tonnetz

https://en.wikipedia.org/wiki/Tonnetz
1•IdealeZahlen•22m ago•0 comments

Testing phishing email detection Machine Learning

https://phishingdetect.onrender.com
1•theokere•23m ago•1 comments

Drawing power out of CCS port

https://openinverter.org/forum/viewtopic.php?t=3551
1•faebi•25m ago•0 comments

Show HN: ATFile – Store files on the ATmosphere/Bluesky

https://tangled.sh/@zio.sh/atfile
2•electricduck•25m ago•0 comments

The History of Radio Buttons

https://www.jitbit.com/alexblog/242-the-history-of-a-radio-button/
1•susam•27m ago•0 comments

Show HN: An mcp server for jmap email clients

https://github.com/willmeyers/jmap-mcp-server
1•willmeyers•28m ago•0 comments

Rich: Enrich your CSVs with new columns using an LLM

https://blog.marcua.net/2025/05/22/rich-enrich-your-csvs-with-new-columns.html
1•marcua•28m ago•0 comments

HD-64 – An HDMI RF-modulator replacement for the Commodore 64

https://github.com/sideprojectslab/HD-64
2•ethanpil•29m ago•0 comments

Immigration Is the Only Thing Propping Up California's Population

https://www.wsj.com/business/california-population-growth-immigration-h-1b-visa-4b526478
1•xqcgrek2•30m ago•1 comments

Mainstreaming the Values of the Natural Building Movement [video]

https://www.youtube.com/watch?v=DmImdlUsmxI
1•surprisetalk•34m ago•0 comments

Powhsm: Special purpose PowHSM firmware for the RSK PowPeg

https://github.com/rsksmart/rsk-powhsm
2•csomar•36m ago•0 comments

Show HN: I made a voice first calorie tracker thats easy to use

https://apps.apple.com/za/app/saylo-ai/id6745614063
1•Dclav•39m ago•2 comments

The Voices: Generating Face from Voice Using AI

https://substack.com/@oddmaido/p-164411816
1•breadislove•42m ago•0 comments

Farmers can help rescue water-loving birds

https://knowablemagazine.org/content/article/food-environment/2025/how-farmers-can-help-save-migrating-water-loving-birds
1•rntn•43m ago•0 comments

Implementing complex numbers and FFT with just datatypes (2023)

https://gist.github.com/VictorTaelin/5776ede998d0039ad1cc9b12fd96811c
1•surprisetalk•44m ago•0 comments

What Svelte Promises, Rich Harris – Svelte Summit Spring 2025 [video]

https://www.youtube.com/watch?v=1dATE70wlHc
1•keybits•44m ago•0 comments

UE1 homebrew 1-bit vacuum computer in action [video]

https://www.youtube.com/watch?v=kT3TMb9n26Q
2•todsacerdoti•49m ago•0 comments