The Benefits of Evaluating Tracker Performance Using Pixel-Wise Segmentations
Abstract
For years, the ground truth data for evaluating object trackers consists of axis-aligned or oriented boxes. This greatly reduces the workload of labeling the datasets in the common benchmarks. Nevertheless, boxes are a very coarse approximation of an object and the approximation by a box has a large degree of ambiguity. Furthermore, tracking approaches that are not restricted to boxes cannot be evaluated within the benchmarks without adding a penalty to them. We present a simple extension to the VOT evaluation procedure that enables to include these approaches. Furthermore, we present upper bounds for trackers restricted to boxes. Moreover, we present a new measure that captures how well an approach can cope with scale changes without the need of frame-wise labels. We present a learning-based approach which helps to identify frames with heavy occlusion automatically. The framework is tested on the segmentations of the VOT2016 dataset.
Cite
Text
Böttger and Follmann. "The Benefits of Evaluating Tracker Performance Using Pixel-Wise Segmentations." IEEE/CVF International Conference on Computer Vision Workshops, 2017. doi:10.1109/ICCVW.2017.232Markdown
[Böttger and Follmann. "The Benefits of Evaluating Tracker Performance Using Pixel-Wise Segmentations." IEEE/CVF International Conference on Computer Vision Workshops, 2017.](https://mlanthology.org/iccvw/2017/bottger2017iccvw-benefits/) doi:10.1109/ICCVW.2017.232BibTeX
@inproceedings{bottger2017iccvw-benefits,
title = {{The Benefits of Evaluating Tracker Performance Using Pixel-Wise Segmentations}},
author = {Böttger, Tobias and Follmann, Patrick},
booktitle = {IEEE/CVF International Conference on Computer Vision Workshops},
year = {2017},
pages = {1983-1991},
doi = {10.1109/ICCVW.2017.232},
url = {https://mlanthology.org/iccvw/2017/bottger2017iccvw-benefits/}
}