DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European Languages
The article introduces DISCO, a large-scale human annotated corpus for disfluency correction in four Indo-European languages: English, Hindi, German, and French. Disfluency correction is the process of removing disfluent elements…
Continue reading