Self-supervised Pretraining for Robust Personalized Voice Activity Detection in Adverse Conditions
The paper proposes a method to enhance the performance of a personalized voice activity detection (VAD) model in adverse conditions using self-supervised pretraining on a large unlabelled dataset. The model…
Continue reading