A Lightweight Feature Fusion Architecture For Resource-Constrained Crowd Counting

This paper presents two lightweight models for crowd counting, which is the estimation of the number of people in a crowd from an image or a video. The models, ASFNet-S and ASFNet-B, use MobileNet and MobileViT backbones respectively and incorporate an adjacent feature fusion technique to extract diverse scale features from a pre-trained model. They offer improved performance while maintaining a compact and efficient design. The models are compared with state-of-the-art methods and found to give comparable results, while being more computationally efficient. The paper also includes a comparative and an extensive ablation study, as well as pruning to demonstrate the effectiveness of the models.

Publication date: 12 Jan 2024
Project Page: Not provided
Paper: https://arxiv.org/pdf/2401.05968

Post Views: 287

A Lightweight Feature Fusion Architecture For Resource-Constrained Crowd Counting

root

Leave a Reply Cancel reply

Press ESC to close

Share Article:

root

UAVD4L: A Large-Scale Dataset for UAV 6-DoF Localization

CoSSegGaussians: Compact and Swift Scene Segmenting 3D Gaussians

Leave a Reply Cancel reply

Please allow ads on our site