Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition

The article presents a new method, CLC: Contrastive Learning for Conversations, to improve the performance of Automatic Speech Recognition (ASR) models. The method leverages artifacts in unsuccessful conversations with assistant systems for self-supervised learning. The authors demonstrated that their method can improve ASR models’ performance on OD3, a new public large-scale semi-synthetic meta-dataset of audio task-oriented dialogues, by up to 19.2%. The gains also transferred to real-world systems, improving performance by up to 6.7% over baselines.

Publication date: 5 Jan 2024
Project Page: https://github.com/amazon-science/amazon-od3
Paper: https://arxiv.org/pdf/2401.02417

Post Views: 317

automatic speech recognition, Contrastive Learning, natural language understanding, Self-supervised Learning, Task Oriented Dialogue

Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

Generalist embedding models are better at short-context clinical semantic search than specialized embedding models

LLM Augmented LLMs: Expanding Capabilities through Composition

Leave a Reply Cancel reply

Please allow ads on our site