I am surprised you went with ElevenLabs for the transcription layer. In my experience the pricing there makes the unit economics pretty tough for high volume calls. We moved a similar stack to Deepgram recently and the latency is significantly better. If you have the ops capacity self-hosting Whisper is also a solid option to cut costs, though I guess it depends on your concurrency requirements.
storystarling•22m ago