September 25, 2023

CrossSinger: A Cross-Lingual Multi-Singer High-Fidelity Singing Voice Synthesizer Trained on Monolingual Singers

This paper presents CrossSinger, a cross-lingual singing voice synthesizer based on Xiaoicesing2. The system is notable for its ability to produce high-fidelity singing voices from monolingual singers in multiple languages. This is achieved by using the International Phonetic Alphabet to unify the representation for all languages, and incorporating language information into the model for better pronunciation. The system was tested on a combination of three singing voice datasets in Japanese, English, and Chinese, and was found to perform well, even in code-switch scenarios.

Publication date: 25 Sep 2023
Project Page: Not Provided
Paper: https://arxiv.org/pdf/2309.12672

Post Views: 319

root

Exit mobile version

Please allow ads on our site

Looks like you're using an ad blocker. Please support us by disabling these ad blocker.

Press ESC to close

Share Article:

root

Deepfake audio as a data augmentation technique for training automatic speech to text transcription models

Profile-Error-Tolerant Target-Speaker Voice Activity Detection

Please allow ads on our site