Video-adverb retrieval with compositional adverb-action embeddings
The study by Hummel et al. presents a framework for video-to-adverb retrieval and vice versa. The method aligns video embeddings with their corresponding compositional adverb-action text embedding in a joint…
Continue reading