VGSG: Vision-Guided Semantic-Group Network for Text-based Person Search

The article presents a new method for Text-based Person Search (TBPS), which is a task of retrieving images of people based on textual descriptions. The proposed method, called Vision-Guided Semantic-Group Network (VGSG), aims to extract well-aligned fine-grained visual and textual features more efficiently and without the need for external tools. The VGSG includes a Semantic-Group Textual Learning (SGTL) module and a Vision-guided Knowledge Transfer (VGKT) module to extract textual local features guided by visual local clues. The authors claim that their method outperforms existing ones based on experimental results on two benchmarks.

Publication date: 14 Nov 2023
Project Page: Not provided
Paper: https://arxiv.org/pdf/2311.07514

Post Views: 274

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

GPT-4V(ision) as A Social Media Analysis Engine

Temporal Performance Prediction for Deep Convolutional Long Short-Term Memory Networks

Leave a Reply Cancel reply

Please allow ads on our site