September 27, 2023

Video-adverb retrieval with compositional adverb-action embeddings

The study by Hummel et al. presents a framework for video-to-adverb retrieval and vice versa. The method aligns video embeddings with their corresponding compositional adverb-action text embedding in a joint space. The adverb-action text embedding is learned using a residual gating mechanism. The framework outperforms previous works in retrieving adverbs from videos for unseen adverb-action compositions. The proposed method is relevant for video search and retrieval, and for understanding actions in videos in a detailed manner.

Publication date: 26 Sep 2023
Project Page: https://hummelth.github.io/ReGaDa/
Paper: https://arxiv.org/pdf/2309.15086

Post Views: 342

action recognition, adverb-action text embedding, residual gating mechanism, video embeddings, video-adverb retrieval

Video-adverb retrieval with compositional adverb-action embeddings

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning

Examining the Values Reflected by Children during AI Problem Formulation

Leave a Reply Cancel reply

Please allow ads on our site