Expressive TTS Driven by Natural Language Prompts Using Few Human Annotations
The article discusses the development of FreeStyleTTS, a model for expressive text-to-speech (TTS) synthesis that requires minimal human annotations. This model utilizes a large language model to transform expressive TTS…
Continue reading