emotional text-to-speech Papers

Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition

root February 23, 2024 0

The study presents Daisy-TTS, a text-to-speech system that simulates a broad spectrum of emotions. It uses a prosody encoder to learn emotionally-separable prosody embedding, which acts as a proxy for…

Press ESC to close

emotional text-to-speech

Please allow ads on our site