GigaHumanDet: Exploring Full-Body Detection on Gigapixel-Level Images

Liu, Chenglong; Wei, Haoran; Yang, Jinze; Liu, Jintao; Li, Wenxi; Guo, Yuchen; Fang, Lu

doi:10.1609/AAAI.V38I9.28873

GigaHumanDet: Exploring Full-Body Detection on Gigapixel-Level Images

Chenglong Liu, Haoran Wei, Jinze Yang, Jintao Liu, Wenxi Li, Yuchen Guo, Lu Fang

AAAI 2024 pp. 10092-10100

doi:10.1609/AAAI.V38I9.28873 /aaai/2024/liu2024aaai-gigahumandet/

Abstract

Performing person detection in super-high-resolution images has been a challenging task. For such a task, modern detectors, which usually encode a box using center and width/height, struggle with accuracy due to two factors: 1) Human characteristic: people come in various postures and the center with high freedom is difficult to capture robust visual pattern; 2) Image characteristic: due to vast scale diversity of input (gigapixel-level), distance regression (for width and height) is hard to pinpoint, especially for a person, with substantial scale, who is near the camera. To address these challenges, we propose GigaHumanDet, an innovative solution aimed at further enhancing detection accuracy for gigapixel-level images. GigaHumanDet employs the corner modeling method to avoid the potential issues of a high degree of freedom in center pinpointing. To better distinguish similar-looking persons and enforce instance consistency of corner pairs, an instance-guided learning approach is designed to capture discriminative individual semantics. Further, we devise reliable shape-aware bodyness equipped with a multi-precision strategy as the human corner matching guidance to be appropriately adapted to the single-view large scene. Experimental results on PANDA and STCrowd datasets show the superiority and strong applicability of our design. Notably, our model achieves 82.4% in term of AP, outperforming current state-of-the-arts by more than 10%.

PDF AAAI Semantic Scholar

Cite

Text

Liu et al. "GigaHumanDet: Exploring Full-Body Detection on Gigapixel-Level Images." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I9.28873

Markdown

[Liu et al. "GigaHumanDet: Exploring Full-Body Detection on Gigapixel-Level Images." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/liu2024aaai-gigahumandet/) doi:10.1609/AAAI.V38I9.28873

BibTeX

@inproceedings{liu2024aaai-gigahumandet,
  title     = {{GigaHumanDet: Exploring Full-Body Detection on Gigapixel-Level Images}},
  author    = {Liu, Chenglong and Wei, Haoran and Yang, Jinze and Liu, Jintao and Li, Wenxi and Guo, Yuchen and Fang, Lu},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {10092-10100},
  doi       = {10.1609/AAAI.V38I9.28873},
  url       = {https://mlanthology.org/aaai/2024/liu2024aaai-gigahumandet/}
}