Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
The paper discusses the creation of the Aya Dataset, a multilingual instruction-following dataset spanning 65 languages. The researchers collaborated with fluent speakers worldwide to collect natural instances of instructions and…
Continue reading