Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning
The article discusses the use of pretrained Vision-Language Models (VLMs) as zero-shot reward models (RMs) for Reinforcement Learning (RL). The authors propose a method, called VLM-RMs, that uses these models…
Continue reading