Jeffreys divergence-based regularization of neural network output distribution applied to speaker recognition

The article introduces a new loss function for speaker recognition with deep neural networks, based on Jeffreys Divergence. This divergence, added to the cross-entropy loss function, allows for maximizing the target value of the output distribution while smoothing the non-target values. It is shown that this loss function provides highly discriminative features and outperforms the state-of-the-art for speaker recognition, especially on out-of-domain data. Additionally, the article includes a theoretical justification of the effectiveness of this loss function and its impact on different dataset types.

Publication date: 29 Dec 2023
Project Page: Not provided
Paper: https://arxiv.org/pdf/2312.16885

Post Views: 317

Jeffreys divergence-based regularization of neural network output distribution applied to speaker recognition

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer

Accent-VITS:accent transfer for end-to-end TTS

Leave a Reply Cancel reply

Please allow ads on our site