Affordance Papers - BytesArchive

Computation and Language Computer Vision and Pattern Recognition

GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration

root November 21, 2023 0

The paper introduces a pipeline that enhances a Vision Language Model, GPT-4V(ision), by incorporating human action observations to facilitate robotic manipulation. This system analyzes videos of humans performing tasks and…

Press ESC to close

Affordance

Please allow ads on our site