Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting
DOI:
https://doi.org/10.1609/aaai.v38i7.28553Keywords:
CV: Object Detection & Categorization, CV: Motion & Tracking, CV: Scene Analysis & Understanding, CV: SegmentationAbstract
Recent deep learning-based multi-view people detection (MVD) methods have shown promising results on existing datasets. However, current methods are mainly trained and evaluated on small, single scenes with a limited number of multi-view frames and fixed camera views. As a result, these methods may not be practical for detecting people in larger, more complex scenes with severe occlusions and camera calibration errors. This paper focuses on improving multi-view people detection by developing a supervised view-wise contribution weighting approach that better fuses multi-camera information under large scenes. Besides, a large synthetic dataset is adopted to enhance the model's generalization ability and enable more practical evaluation and comparison. The model's performance on new testing scenes is further improved with a simple domain adaptation technique. Experimental results demonstrate the effectiveness of our approach in achieving promising cross-scene multi-view people detection performance.Downloads
Published
2024-03-24
How to Cite
Zhang, Q., Gong, Y., Chen, D., Chan, A. B., & Huang, H. (2024). Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting. Proceedings of the AAAI Conference on Artificial Intelligence, 38(7), 7242–7250. https://doi.org/10.1609/aaai.v38i7.28553
Issue
Section
AAAI Technical Track on Computer Vision VI