multi-modal audiovisual learning

TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion

root January 26, 2024 0

The paper introduces TDFNet, a model for audio-visual speech separation. This technology is significant for applications like speech recognition and assistive technologies. While existing methods demand more computational resources and…

Machine Learning Sound

TMac: Temporal Multi-Modal Graph Learning for Acoustic Event Classification

root September 25, 2023 0

The study proposes a new method called TMac for acoustic event classification. This method uses temporal multi-modal graph learning to improve the processing of audiovisual data in deep learning models….

Page 1 of 1

Press ESC to close

multi-modal audiovisual learning

Please allow ads on our site