Distilling Knowledge from CNN-Transformer Models for Enhanced Human Action Recognition
This research presents an approach to improve human action recognition using knowledge distillation, and the combination of Convolutional Neural Networks (CNN) and Vision Transformer (ViT) models. The aim is to…
Continue reading