I have tried a few systems (eSpeak, Piper, QWEN) and none of them have given satisfactory results. Huggingface seems to have no text-to-speech models with particular acclaim, either. I have been using OpenAI's gpt-4o-mini model, but that seems to be approaching end-of-life.
Is there an LLM (or non-LLM) system that you would recommend?