Byte Pair Encoding Is All You Need For Automatic Bengali Speech Recognition

The article presents a study on the use of Byte Pair Encoding (BPE) for automatic Bengali speech recognition. BPE emerges as an effective tokenization method for tackling the out-of-vocabulary (OOV) challenge in various natural language and speech processing tasks. The study identifies the optimal number of BPE tokens for Bengali, a language known for its morphological complexity. Experimental evaluation reveals that approximately 500-1000 tokens result in superior OOV performance. The introduction of BPE tokenization to Bengali ASR achieves a substantial reduction in the word error rate.

Publication date: 31 Jan 2024
Project Page: Not provided
Paper: https://arxiv.org/pdf/2401.15532

Post Views: 236

Byte Pair Encoding Is All You Need For Automatic Bengali Speech Recognition

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

MunTTS: A Text-to-Speech System for Mundari

Validation of artificial neural networks to model the acoustic behaviour of induction motors

Leave a Reply Cancel reply

Please allow ads on our site