DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion

Abstract

A typical pipeline for multi-object tracking (MOT) is to use a detector for object localization, and following re-identification (re-ID) for object association. This pipeline is partially motivated by recent progress in both object detection and re-ID, and partially motivated by biases in existing tracking datasets, where most objects tend to have distinguishing appearance and re-ID models are sufficient for establishing associations. In response to such bias, we would like to re-emphasize that methods for multi-object tracking should also work when object appearance is not sufficiently discriminative. To this end, we propose a large-scale dataset for multi-human tracking, where humans have similar appearance, diverse motion and extreme articulation. As the dataset contains mostly group dancing videos, we name it "DanceTrack". We expect DanceTrack to provide a better platform to develop more MOT algorithms that rely less on visual discrimination and depend more on motion analysis. We benchmark several state-of-the-art trackers on our dataset and observe a significant performance drop on DanceTrack when compared against existing benchmarks. The dataset, project code and competition is released at: https://github.com/DanceTrack.

Cite

Text

Sun et al. "DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.02032

Markdown

[Sun et al. "DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/sun2022cvpr-dancetrack/) doi:10.1109/CVPR52688.2022.02032

BibTeX

@inproceedings{sun2022cvpr-dancetrack,
  title     = {{DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion}},
  author    = {Sun, Peize and Cao, Jinkun and Jiang, Yi and Yuan, Zehuan and Bai, Song and Kitani, Kris and Luo, Ping},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {20993-21002},
  doi       = {10.1109/CVPR52688.2022.02032},
  url       = {https://mlanthology.org/cvpr/2022/sun2022cvpr-dancetrack/}
}