One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition
The paper presents a novel framework named SLIDAR that is capable of joint speaker diarization and automatic speech recognition. SLIDAR can process inputs of any length and can handle any…
Continue reading