Online Multi-Camera People Tracking with Spatial-Temporal Mechanism and Anchor-Feature Hierarchical Clustering

Abstract

Multi-camera Multi-object tracking (MTMC) surpasses conventional single-camera tracking by enabling seamless object tracking across multiple camera views. This capability is critical for security systems and improving situational awareness in various environments. This paper proposes a novel MTMC framework designed for online operation. The framework employs a three-stage pipeline: Multiobject Tracking (MOT), Multi-target Multi-camera Tracking (MTMC), and Cross Interval Synchronization (CIS). In the MOT stage, ReID features are extracted and localized tracklets are created. MTMC links these tracklets across cameras using spatial-temporal constraints and constraint hierarchical clustering with anchor features for improved inter-camera association. Finally, CIS ensures the temporal coherence of tracklets across time intervals. The proposed framework achieves robust tracking performance, validated on the challenging 2024 AI City Challenge with a HOTA score of 51.0556%, ranking sixth. The code is available at: https://github.com/ARV-MLCORE/AIC2024Track1ARV

Cite

Text

Cherdchusakulchai et al. "Online Multi-Camera People Tracking with Spatial-Temporal Mechanism and Anchor-Feature Hierarchical Clustering." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024. doi:10.1109/CVPRW63382.2024.00715

Markdown

[Cherdchusakulchai et al. "Online Multi-Camera People Tracking with Spatial-Temporal Mechanism and Anchor-Feature Hierarchical Clustering." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024.](https://mlanthology.org/cvprw/2024/cherdchusakulchai2024cvprw-online/) doi:10.1109/CVPRW63382.2024.00715

BibTeX

@inproceedings{cherdchusakulchai2024cvprw-online,
  title     = {{Online Multi-Camera People Tracking with Spatial-Temporal Mechanism and Anchor-Feature Hierarchical Clustering}},
  author    = {Cherdchusakulchai, Riu and Phimsiri, Sasin and Trairattanapa, Visarut and Tungjitnob, Suchat and Kudisthalert, Wasu and Kiawjak, Pornprom and Thamwiwatthana, Ek and Borisuitsawat, Phawat and Tosawadi, Teepakorn and Choppradit, Pakcheera and Mahakijdechachai, Kasisdis and Vatathanavaro, Supawit and Saetan, Worawit and Suttichaya, Vasin},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2024},
  pages     = {7198-7207},
  doi       = {10.1109/CVPRW63382.2024.00715},
  url       = {https://mlanthology.org/cvprw/2024/cherdchusakulchai2024cvprw-online/}
}