UTM: A Unified Multiple Object Tracking Model with Identity-Aware Feature Enhancement

Abstract

Recently, Multiple Object Tracking has achieved great success, which consists of object detection, feature embedding, and identity association. Existing methods apply the three-step or two-step paradigm to generate robust trajectories, where identity association is independent of other components. However, the independent identity association results in the identity-aware knowledge contained in the tracklet not be used to boost the detection and embedding modules. To overcome the limitations of existing methods, we introduce a novel Unified Tracking Model (UTM) to bridge those three components for generating a positive feedback loop with mutual benefits. The key insight of UTM is the Identity-Aware Feature Enhancement (IAFE), which is applied to bridge and benefit these three components by utilizing the identity-aware knowledge to boost detection and embedding. Formally, IAFE contains the Identity-Aware Boosting Attention (IABA) and the Identity-Aware Erasing Attention (IAEA), where IABA enhances the consistent regions between the current frame feature and identity-aware knowledge, and IAEA suppresses the distracted regions in the current frame feature. With better detections and embeddings, higher-quality tracklets can also be generated. Extensive experiments of public and private detections on three benchmarks demonstrate the robustness of UTM.

Cite

Text

You et al. "UTM: A Unified Multiple Object Tracking Model with Identity-Aware Feature Enhancement." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.02095

Markdown

[You et al. "UTM: A Unified Multiple Object Tracking Model with Identity-Aware Feature Enhancement." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/you2023cvpr-utm/) doi:10.1109/CVPR52729.2023.02095

BibTeX

@inproceedings{you2023cvpr-utm,
  title     = {{UTM: A Unified Multiple Object Tracking Model with Identity-Aware Feature Enhancement}},
  author    = {You, Sisi and Yao, Hantao and Bao, Bing-Kun and Xu, Changsheng},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2023},
  pages     = {21876-21886},
  doi       = {10.1109/CVPR52729.2023.02095},
  url       = {https://mlanthology.org/cvpr/2023/you2023cvpr-utm/}
}