DiffusionTrack: Point Set Diffusion Model for Visual Object Tracking

Abstract

Existing Siamese or transformer trackers commonly pose visual object tracking as a one-shot detection problem i.e. locating the target object in a single forward evaluation scheme. Despite the demonstrated success these trackers may easily drift towards distractors with similar appearance due to the single forward evaluation scheme lacking self-correction. To address this issue we cast visual tracking as a point set based denoising diffusion process and propose a novel generative learning based tracker dubbed DiffusionTrack. Our DiffusionTrack possesses two appealing properties: 1) It follows a novel noise-to-target tracking paradigm that leverages multiple denoising diffusion steps to localize the target in a dynamic searching manner per frame. 2) It models the diffusion process using a point set representation which can better handle appearance variations for more precise localization. One side benefit is that DiffusionTrack greatly simplifies the post-processing e.g. removing window penalty scheme. Without bells and whistles our DiffusionTrack achieves leading performance over the state-of-the-art trackers and runs in real-time. The code is in https://github.com/VISION-SJTU/DiffusionTrack.

Cite

Text

Xie et al. "DiffusionTrack: Point Set Diffusion Model for Visual Object Tracking." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.01808

Markdown

[Xie et al. "DiffusionTrack: Point Set Diffusion Model for Visual Object Tracking." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/xie2024cvpr-diffusiontrack/) doi:10.1109/CVPR52733.2024.01808

BibTeX

@inproceedings{xie2024cvpr-diffusiontrack,
  title     = {{DiffusionTrack: Point Set Diffusion Model for Visual Object Tracking}},
  author    = {Xie, Fei and Wang, Zhongdao and Ma, Chao},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2024},
  pages     = {19113-19124},
  doi       = {10.1109/CVPR52733.2024.01808},
  url       = {https://mlanthology.org/cvpr/2024/xie2024cvpr-diffusiontrack/}
}