Mix-up Papers - BytesArchive

Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation

root October 4, 2023 0

The research focuses on enhancing Automated Audio Captioning (AAC), which generates descriptions for various sounds. The latest systems use seq2seq models like Transformers. This study aims to improve these models…

Press ESC to close

Mix-up

Please allow ads on our site