Learning Temporal Sentence Grounding From Narrated EgoVideos
The paper introduces a method for Temporal Sentence Grounding (TSG) in long-form egocentric datasets such as Ego4D and EPIC-Kitchens. The method, known as Clip Merging (Cli Mer), learns to ground…
Continue reading