I built this because I was tired of "AI tools" that were just wrappers around expensive APIs with high latency. As a developer who lives in the terminal (Arch/Nushell), I wanted something that felt like a CLI tool and respected my hardware.
The Tech:
GPU Heavy: It uses decord and PyTorch for scene analysis. I’m calculating action density and spectral flux locally to find hooks before hitting an LLM.
Local Audio: I’m using ChatterBox locally for TTS to avoid recurring costs and privacy leaks.
Rendering: Final assembly is offloaded to NVENC.
Looking for Collaborators: I’m currently looking for PRs specifically around:
Intelligent Auto-Zoom: Using YOLO/RT-DETR to follow the action in a 9:16 crop.
Voice Engine Upgrades: Moving toward ChatterBoxTurbo or NVIDIA's latest TTS.
It's fully dockerized, and also has a makefile. Would love some feedback on the pipeline architecture!
ramon156•38m ago
I don't get this reasoning. You were tired of LLM wrappers, but what is your tool? These two requirements (felt like a CLI and respects your hardware) do not line up.
Still a cool tool though! Although it seems partly AI generated.
rustyhancock•31m ago
I've started including a statement of AI usage in my docs.
HN is a niche audience but it seems like it's the first question everyone has when opening a repo.
Which is odd because the first question we should have is, does it work.
Personally I can't see myself ever writing the bulk of the README again, life's too short.
HeartofCPU•55m ago
It looks like it’s written by a LLM
myky22•45m ago
Wow, great job.
I did smth similar 4 years ago with YOLO ultralytics.
Back then I used chat messsges spike as one of several variables to detect highs and fails moments. It needed a lot a human validation but was so fun.
divyaprakash•3h ago
The Tech:
Looking for Collaborators: I’m currently looking for PRs specifically around: It's fully dockerized, and also has a makefile. Would love some feedback on the pipeline architecture!ramon156•38m ago
Still a cool tool though! Although it seems partly AI generated.
rustyhancock•31m ago
HN is a niche audience but it seems like it's the first question everyone has when opening a repo.
Which is odd because the first question we should have is, does it work.
Personally I can't see myself ever writing the bulk of the README again, life's too short.