Multi-Object Tracking via Constrained Sequential Labeling
Abstract
This paper presents a new approach to tracking people in crowded scenes, where people are subject to long-term (partial) occlusions and may assume varying postures and articulations. In such videos, detection-based trackers give poor performance since detecting people occurrences is not reliable, and common assumptions about locally smooth trajectories do not hold. Rather, we use temporal mid-level features (e.g., supervoxels or dense point trajectories) as a more coherent spatiotemporal basis for handling occlusion and pose variations.Thus, we formulate tracking as labeling mid-level features by object identifiers, and specify a new approach, called constrained sequential labeling (CSL), for performing this labeling. CSL uses a cost function to sequentially assign labels while respecting the implications of hard constraints computed via constraint propagation. A key feature of this approach is that it allows for the use of flexible cost functions and constraints that capture complex dependencies that cannot be represented in standard network-flow formulations. To exploit this flexibility we describe how to learn constraints and give a provably correct learning algorithms for cost functions that achieves finitetime convergence at a rate that improves with the strength of the constraints. Our experimental results indicate that CSL outperforms the state-of-the-art on challenging real-world videos of volleyball, basketball, and pedestrians walking.
Cite
Text
Chen et al. "Multi-Object Tracking via Constrained Sequential Labeling." Conference on Computer Vision and Pattern Recognition, 2014. doi:10.1109/CVPR.2014.148Markdown
[Chen et al. "Multi-Object Tracking via Constrained Sequential Labeling." Conference on Computer Vision and Pattern Recognition, 2014.](https://mlanthology.org/cvpr/2014/chen2014cvpr-multiobject/) doi:10.1109/CVPR.2014.148BibTeX
@inproceedings{chen2014cvpr-multiobject,
title = {{Multi-Object Tracking via Constrained Sequential Labeling}},
author = {Chen, Sheng and Fern, Alan and Todorovic, Sinisa},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2014},
doi = {10.1109/CVPR.2014.148},
url = {https://mlanthology.org/cvpr/2014/chen2014cvpr-multiobject/}
}