Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal Features
This article investigates the use of both verbal and non-verbal cues in device-directed speech detection (DDSD), a system that distinguishes between queries directed at a voice assistant and background speech….
Continue reading