EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition
This article introduces EmoCLIP, a new vision-language model that enhances learning of rich latent representations for zero-shot classification. The model is tested using zero-shot classification on four popular dynamic FER…
Continue reading