Cross-Speaker Encoding Network for Multi-Talker Speech Recognition
The paper introduces a Cross-Speaker Encoding (CSE) network to improve multi-talker speech recognition. Current methods, single-input multiple-output (SIMO) and single-input single-output (SISO) models, have limitations. The CSE network addresses these…
Continue reading