November 4, 2023

Low-latency Real-time Voice Conversion on CPU

This article discusses the development of a new voice conversion model called Low-latency Low-resource Voice Conversion (LLVC). This model is designed to convert speech in real-time, with a latency of under 20ms at a bitrate of 16kHz, and can run on a consumer CPU. The model uses a generative adversarial architecture and knowledge distillation to achieve this performance. The authors claim that LLVC achieves the lowest resource usage and latency of any open-source voice conversion model. The model is available on GitHub.

Publication date: 1 Nov 2023
Project Page: https://github.com/KoeAI/LLVC
Paper: https://arxiv.org/pdf/2311.00873

Post Views: 309

low-latency, model distillation, open-source toolkit, Singing Voice Conversion, Streaming CTR prediction

Low-latency Real-time Voice Conversion on CPU

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

In-Context Prompt Editing For Conditional Audio Generation

Investigating Self-Supervised Deep Representations for EEG-based Auditory Attention Decoding

Leave a Reply Cancel reply

Please allow ads on our site