SLM: Bridge the thin gap between speech and text foundation models
The authors introduce a multitask, multilingual, dual-modal Speech and Language Model (SLM). The SLM uses pretrained foundational speech and language models, preserving their capabilities while training a simple adapter with…
Continue reading