The BLA Benchmark: Investigating Basic Language Abilities of Pre-Trained Multimodal Models
The article investigates the basic language capabilities of pre-trained multimodal models, questioning their understanding of image-text interaction. It introduces the BLA Benchmark, a tool designed to evaluate these models based…
Continue reading