Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation
The article discusses Auffusion, a Text-to-Audio (TTA) system that leverages the power of diffusion models and large language models. Auffusion adapts Text-to-Image (T2I) diffusion models to the TTA task, improving…
Continue reading