January 11, 2024

RaD-Net: A Repairing and Denoising Network for Speech Signal Improvement

The paper discusses RaD-Net, a repairing and denoising network for speech signal improvement. The authors have improved their previous two-stage neural network model by replacing the repairing network with COM-Net from TEA-PSE. They also introduced multi-resolution discriminators and multi-band discriminators during the training phase. A three-step training strategy was employed to optimize the model. The proposed systems ranked 2nd in track 1 and 3rd in track 2 of the ICASSP 2024 Speech Signal Improvement Challenge.

Publication date: 11 Jan 2024
Project Page: https://github.com/mishliu/RaD-Net
Paper: https://arxiv.org/pdf/2401.04389

Post Views: 331

COM-Net, multi-band discriminators, multi-resolution discriminators, RaD-Net, Speech Signal Improvement

RaD-Net: A Repairing and Denoising Network for Speech Signal Improvement

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

HyperGANStrument: Instrument Sound Synthesis and Editing with Pitch-Invariant Hypernetworks

Cross-Speaker Encoding Network for Multi-Talker Speech Recognition

Leave a Reply Cancel reply

Please allow ads on our site