high-fidelity synthesis Papers

DiffAR: Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation

root October 4, 2023 0

This paper introduces a new diffusion autoregressive model (DIFFAR) for generating high-quality raw speech waveforms. The model generates overlapping frames sequentially, each conditioned on a portion of the previously generated…

Press ESC to close

high-fidelity synthesis

Please allow ads on our site