with sip this logic changes. https://platform.openai.com/docs/guides/realtime-sip
sounds like we can listen to the webhook and start from there?
But for now yea waiting for Webhook is way to do it.
Outside of OpenAI lots of mechanisms exist stuff like STIR/SHAKEN[0]
More than I expected.
According to the OpenAI forums this is a common problem. I see they've addressed this in the post by prompting the model to stick to one language, but previously this didn’t work consistently, and in their Playground the newest `User transcript model` is still the same as before (`gpt-4o-transcribe`), so I don’t have high hopes. Must be hard to implement.
edit: Tried it again (with a prompt requesting English like always). By my 6th message it suddenly started transcribing to Finnish, and after that it became more common. Better than it used to be, but in many ways still useless. Though I'm sure it works better for people with lighter accents.
I have a TestFlight beta for those who want to try it out, hope to have the new model included in the next beta build:
zebomon•2h ago