DiffusionTrack: Point Set Diffusion Model for Visual Object Tracking
Abstract
Existing Siamese or transformer trackers commonly pose visual object tracking as a one-shot detection problem i.e. locating the target object in a single forward evaluation scheme. Despite the demonstrated success these trackers may easily drift towards distractors with similar appearance due to the single forward evaluation scheme lacking self-correction. To address this issue we cast visual tracking as a point set based denoising diffusion process and propose a novel generative learning based tracker dubbed DiffusionTrack. Our DiffusionTrack possesses two appealing properties: 1) It follows a novel noise-to-target tracking paradigm that leverages multiple denoising diffusion steps to localize the target in a dynamic searching manner per frame. 2) It models the diffusion process using a point set representation which can better handle appearance variations for more precise localization. One side benefit is that DiffusionTrack greatly simplifies the post-processing e.g. removing window penalty scheme. Without bells and whistles our DiffusionTrack achieves leading performance over the state-of-the-art trackers and runs in real-time. The code is in https://github.com/VISION-SJTU/DiffusionTrack.
Cite
Text
Xie et al. "DiffusionTrack: Point Set Diffusion Model for Visual Object Tracking." Conference on Computer Vision and Pattern Recognition, 2024. doi:10.1109/CVPR52733.2024.01808Markdown
[Xie et al. "DiffusionTrack: Point Set Diffusion Model for Visual Object Tracking." Conference on Computer Vision and Pattern Recognition, 2024.](https://mlanthology.org/cvpr/2024/xie2024cvpr-diffusiontrack/) doi:10.1109/CVPR52733.2024.01808BibTeX
@inproceedings{xie2024cvpr-diffusiontrack,
title = {{DiffusionTrack: Point Set Diffusion Model for Visual Object Tracking}},
author = {Xie, Fei and Wang, Zhongdao and Ma, Chao},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2024},
pages = {19113-19124},
doi = {10.1109/CVPR52733.2024.01808},
url = {https://mlanthology.org/cvpr/2024/xie2024cvpr-diffusiontrack/}
}