January 31, 2024

On Speaker Attribution with SURT

The article presents an improved version of SURT (Streaming Unmixing and Recognition Transducer) for speaker-attributed transcription in multi-talker speech recognition. The authors propose methods for both short mixtures and long recordings by adding an auxiliary speaker branch to SURT. The updated model ensures consistency in relative speaker labels across different utterance groups in a recording. The study was validated through experiments on synthetic LibriSpeech mixtures and demonstrated on the AMI corpus.

Publication date: 31 Jan 2024
Project Page: Unavailable
Paper: https://arxiv.org/pdf/2401.15676

Post Views: 261

root

Exit mobile version

Please allow ads on our site

Looks like you're using an ad blocker. Please support us by disabling these ad blocker.

Press ESC to close

Share Article:

root

Generalisations of Euler’s Tonnetz on triangulated surfaces

Evaluating Echo State Network for Parkinson’s Disease Prediction using Voice Features

Please allow ads on our site