Exploring Visual Context for Weakly Supervised Person Search

Abstract

Person search has recently emerged as a challenging task that jointly addresses pedestrian detection and person re-identification. Existing approaches follow a fully supervised setting where both bounding box and identity annotations are available. However, annotating identities is labor-intensive, limiting the practicability and scalability of current frameworks. This paper inventively considers weakly supervised person search with only bounding box annotations. We propose to address this novel task by investigating three levels of context clues (i.e., detection, memory and scene) in unconstrained natural images. The first two are employed to promote local and global discriminative capabilities, while the latter enhances clustering accuracy. Despite its simple design, our CGPS boosts the baseline model by 8.8% in mAP on CUHK-SYSU. Surprisingly, it even achieves comparable performance with several supervised person search models. Our code is available at https://github. com/ljpadam/CGPS.

Cite

Text

Yan et al. "Exploring Visual Context for Weakly Supervised Person Search." AAAI Conference on Artificial Intelligence, 2022. doi:10.1609/AAAI.V36I3.20209

Markdown

[Yan et al. "Exploring Visual Context for Weakly Supervised Person Search." AAAI Conference on Artificial Intelligence, 2022.](https://mlanthology.org/aaai/2022/yan2022aaai-exploring/) doi:10.1609/AAAI.V36I3.20209

BibTeX

@inproceedings{yan2022aaai-exploring,
  title     = {{Exploring Visual Context for Weakly Supervised Person Search}},
  author    = {Yan, Yichao and Li, Jinpeng and Liao, Shengcai and Qin, Jie and Ni, Bingbing and Lu, Ke and Yang, Xiaokang},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2022},
  pages     = {3027-3035},
  doi       = {10.1609/AAAI.V36I3.20209},
  url       = {https://mlanthology.org/aaai/2022/yan2022aaai-exploring/}
}