MAST: A Memory-Augmented Self-Supervised Tracker

Abstract

Recent interest in self-supervised dense tracking has yielded rapid progress, but performance still remains far from supervised methods. We propose a dense tracking model trained on videos without any annotations that surpasses previous self-supervised methods on existing benchmarks by a significant margin (+15%), and achieves performance comparable to supervised methods. In this paper, we first reassess the traditional choices used for self-supervised training and reconstruction loss by conducting thorough experiments that finally elucidate the optimal choices. Second, we further improve on existing methods by augmenting our architecture with a crucial memory component. Third, we benchmark on large-scale semi-supervised video object segmentation (aka. dense tracking), and propose a new metric: generalizability. Our first two contributions yield a self-supervised network that for the first time is competitive with supervised methods on standard evaluation metrics of dense tracking. When measuring generalizability, we show self-supervised approaches are actually superior to the majority of supervised methods. We believe this new generalizability metric can better capture the real-world use-cases for dense tracking, and will spur new interest in this research direction.

Cite

Text

Lai et al. "MAST: A Memory-Augmented Self-Supervised Tracker." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00651

Markdown

[Lai et al. "MAST: A Memory-Augmented Self-Supervised Tracker." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/lai2020cvpr-mast/) doi:10.1109/CVPR42600.2020.00651

BibTeX

@inproceedings{lai2020cvpr-mast,
  title     = {{MAST: A Memory-Augmented Self-Supervised Tracker}},
  author    = {Lai, Zihang and Lu, Erika and Xie, Weidi},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.00651},
  url       = {https://mlanthology.org/cvpr/2020/lai2020cvpr-mast/}
}