T-MARS: Improving Visual Representations by Circumventing Text Feature Learning
T-MARS is a new data filtering approach designed to improve visual representations in large-scale image-text datasets. The method is motivated by the observation that a significant portion of images in…
Continue reading