NTT speaker diarization system for CHiME-7: multi-domain, multi-microphone End-to-end and vector clustering diarization
This article discusses NTT Corporation’s speaker diarization system, designed for multi-domain, multi-microphone casual conversations. The system uses weighted prediction error-based dereverberation, applies end-to-end neural diarization with vector clustering to each…
Continue reading