State-of-the-art text-to-speech model for 600+ languages, supporting:
Built with OmniVoice by Xiaomi Next-gen Kaldi team.
Recommended: 3–10 seconds audio.