Self-Paced Learning for Long-Term Tracking

Abstract

We address the problem of long-term object tracking, where the object may become occluded or leave-the-view. In this setting, we show that an accurate appearance model is considerably more effective than a strong motion model. We develop simple but effective algorithms that alternate between tracking and learning a good appearance model given a track. We show that it is crucial to learn from the "right" frames, and use the formalism of self-paced curriculum learning to automatically select such frames. We leverage techniques from object detection for learning accurate appearance-based templates, demonstrating the importance of using a large negative training set (typically not used for tracking). We describe both an offline algorithm (that processes frames in batch) and a linear-time online (i.e. causal) algorithm that approaches real-time performance. Our models significantly outperform prior art, reducing the average error on benchmark videos by a factor of 4.

Cite

Text

Iii and Ramanan. "Self-Paced Learning for Long-Term Tracking." Conference on Computer Vision and Pattern Recognition, 2013. doi:10.1109/CVPR.2013.308

Markdown

[Iii and Ramanan. "Self-Paced Learning for Long-Term Tracking." Conference on Computer Vision and Pattern Recognition, 2013.](https://mlanthology.org/cvpr/2013/iii2013cvpr-selfpaced/) doi:10.1109/CVPR.2013.308

BibTeX

@inproceedings{iii2013cvpr-selfpaced,
  title     = {{Self-Paced Learning for Long-Term Tracking}},
  author    = {Iii, James S. Supancic and Ramanan, Deva},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2013},
  doi       = {10.1109/CVPR.2013.308},
  url       = {https://mlanthology.org/cvpr/2013/iii2013cvpr-selfpaced/}
}