Deepfake audio as a data augmentation technique for training automatic speech to text transcription models
The paper discusses the use of deepfake audio as a data augmentation technique to train robust speech-to-text transcription models. The authors argue that finding a diverse and large labeled dataset…
Continue reading