E3 TTS: Easy End-to-End Diffusion-based Text to Speech
The article introduces a novel end-to-end text-to-speech model called E3 TTS, which is based on diffusion. Unlike previous models, E3 TTS does not rely on intermediate representations such as spectrogram…
Continue reading