Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting
Abstract
Recent deep learning-based multi-view people detection (MVD) methods have shown promising results on existing datasets. However, current methods are mainly trained and evaluated on small, single scenes with a limited number of multi-view frames and fixed camera views. As a result, these methods may not be practical for detecting people in larger, more complex scenes with severe occlusions and camera calibration errors. This paper focuses on improving multi-view people detection by developing a supervised view-wise contribution weighting approach that better fuses multi-camera information under large scenes. Besides, a large synthetic dataset is adopted to enhance the model's generalization ability and enable more practical evaluation and comparison. The model's performance on new testing scenes is further improved with a simple domain adaptation technique. Experimental results demonstrate the effectiveness of our approach in achieving promising cross-scene multi-view people detection performance.
Cite
Text
Zhang et al. "Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I7.28553Markdown
[Zhang et al. "Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/zhang2024aaai-multi-b/) doi:10.1609/AAAI.V38I7.28553BibTeX
@inproceedings{zhang2024aaai-multi-b,
title = {{Multi-View People Detection in Large Scenes via Supervised View-Wise Contribution Weighting}},
author = {Zhang, Qi and Gong, Yunfei and Chen, Daijie and Chan, Antoni B. and Huang, Hui},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2024},
pages = {7242-7250},
doi = {10.1609/AAAI.V38I7.28553},
url = {https://mlanthology.org/aaai/2024/zhang2024aaai-multi-b/}
}