UTM: A Unified Multiple Object Tracking Model with Identity-Aware Feature Enhancement
Abstract
Recently, Multiple Object Tracking has achieved great success, which consists of object detection, feature embedding, and identity association. Existing methods apply the three-step or two-step paradigm to generate robust trajectories, where identity association is independent of other components. However, the independent identity association results in the identity-aware knowledge contained in the tracklet not be used to boost the detection and embedding modules. To overcome the limitations of existing methods, we introduce a novel Unified Tracking Model (UTM) to bridge those three components for generating a positive feedback loop with mutual benefits. The key insight of UTM is the Identity-Aware Feature Enhancement (IAFE), which is applied to bridge and benefit these three components by utilizing the identity-aware knowledge to boost detection and embedding. Formally, IAFE contains the Identity-Aware Boosting Attention (IABA) and the Identity-Aware Erasing Attention (IAEA), where IABA enhances the consistent regions between the current frame feature and identity-aware knowledge, and IAEA suppresses the distracted regions in the current frame feature. With better detections and embeddings, higher-quality tracklets can also be generated. Extensive experiments of public and private detections on three benchmarks demonstrate the robustness of UTM.
Cite
Text
You et al. "UTM: A Unified Multiple Object Tracking Model with Identity-Aware Feature Enhancement." Conference on Computer Vision and Pattern Recognition, 2023. doi:10.1109/CVPR52729.2023.02095Markdown
[You et al. "UTM: A Unified Multiple Object Tracking Model with Identity-Aware Feature Enhancement." Conference on Computer Vision and Pattern Recognition, 2023.](https://mlanthology.org/cvpr/2023/you2023cvpr-utm/) doi:10.1109/CVPR52729.2023.02095BibTeX
@inproceedings{you2023cvpr-utm,
title = {{UTM: A Unified Multiple Object Tracking Model with Identity-Aware Feature Enhancement}},
author = {You, Sisi and Yao, Hantao and Bao, Bing-Kun and Xu, Changsheng},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2023},
pages = {21876-21886},
doi = {10.1109/CVPR52729.2023.02095},
url = {https://mlanthology.org/cvpr/2023/you2023cvpr-utm/}
}