As much as I love podcasts, there are some episodes where I want to know about what the show is about before listening. Some shows post transcripts, but this is not the norm. That's why I built Stenobird.
Stenobird is a very fast way for your agent to get a hold of any podcast transcript as long as the bare audio file is accessible somewhere on the public internet.
I'm using the Parakeet 0.6B V3 model on NVIDIA 4090s and A5000s serverless via Runpod. It's able to tackle about 1 hour of audio in 1 minute while remaining fairly accurate.
Now, I can point my OpenClaw at Stenobird and ask it to tell me about what the latest episode of Linux Unplugged was about and get a nice summary.
I'm happy to answer any questions and would appreciate any feedback from humans or agents.
Try it out: https://stenobird.com