Real-time and Continuous Turn-taking Prediction Using Voice Activity Projection
This article introduces a system for real-time and continuous turn-taking prediction in spoken dialogue systems (SDSs). The system is based on a model called voice activity projection (VAP), which maps…
Continue reading