AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
The article presents a semi-supervised method for audio-visual speech recognition (AV-CPL), employing both labeled and unlabeled videos with continuously regenerated pseudo-labels. This method enables the recognition model to be trained…
Continue reading