Text-Audio Alignment Papers

Artificial Intelligence Computation and Language

Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation

root January 4, 2024 0

The article discusses Auffusion, a Text-to-Audio (TTA) system that leverages the power of diffusion models and large language models. Auffusion adapts Text-to-Image (T2I) diffusion models to the TTA task, improving…

Press ESC to close

Text-Audio Alignment

Please allow ads on our site