1. There's plenty of good open TTS models for EU languages: Fish, CosyVoice, Voxtral
2. KugelAudio claims to beat ElevenLabs, but so do Chatterbox-Turbo and Fish Audio S2 Pro.
3. Their 39ms latency is not a strong technical differentiator from Fish at 100ms.
=> It's a deployment business on top of a commodity model. In my opinion, there is no technical moat to defend them from competition.
Their angle is EU sovereignty, which I very much like, but I don't see how their API (where you need to trust them) could be better at sovereignty than an open model which can run air-gapped with no trust needed. But I mean YC is in the VC business so there must be some angle how KugelAudio could 100x their current valuation. Otherwise, they wouldn't be attractive to VC money.
Does anyone know what KugelAudio's unique angle is?
victorrpham•37m ago