Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations
The article presents FreeStyleTTS, a model for expressive text-to-speech (TTS) synthesis with minimal human annotations. This approach leverages a large language model to transform expressive TTS into a style retrieval…
Continue reading