Class-Incremental Learning for Multi-Label Audio Classification
The article presents a new method for class-incremental learning of potentially overlapping sounds for multi-label audio classification…
The article presents a new method for class-incremental learning of potentially overlapping sounds for multi-label audio classification…
The article discusses the challenge of audio-to-audio (A2A) style transfer, especially in the context of transferring emotional…
The paper introduces a Cross-Speaker Encoding (CSE) network to improve multi-talker speech recognition. Current methods, single-input multiple-output…
The paper discusses RaD-Net, a repairing and denoising network for speech signal improvement. The authors have improved…
The paper proposes HyperGANStrument, a novel neural synthesizer that enhances the generation capability of GANStrument by introducing…
The paper introduces MAGNET, a masked generative sequence modeling method that operates directly over several streams of…
This article introduces a system for real-time and continuous turn-taking prediction in spoken dialogue systems (SDSs). The…
The article discusses the limitations of conventional audio classification methods and introduces a novel method that incorporates…
The research focuses on the match-mismatch classification with EEG recording using self-supervised speech representation and contextual text…
The article discusses the development of a full-frequency dynamic convolution (FFDConv) for sound event detection. Traditional 2D…