Context-Aware Deep Feature Compression for High-Speed Visual Tracking
Abstract
We propose a new context-aware correlation filter based tracking framework to achieve both high computational speed and state-of-the-art performance among real-time trackers. The major contribution to the high computational speed lies in the proposed deep feature compression that is achieved by a context-aware scheme utilizing multiple expert auto-encoders; a context in our framework refers to the coarse category of the tracking target according to appearance patterns. In the pre-training phase, one expert auto-encoder is trained per category. In the tracking phase, the best expert auto-encoder is selected for a given target, and only this auto-encoder is used. To achieve high tracking performance with the compressed feature map, we introduce extrinsic denoising processes and a new orthogonality loss term for pre-training and fine-tuning of the expert auto-encoders. We validate the proposed context-aware framework through a number of experiments, where our method achieves a comparable performance to state-of-the-art trackers which cannot run in real-time, while running at a significantly fast speed of over 100 fps.
Cite
Text
Choi et al. "Context-Aware Deep Feature Compression for High-Speed Visual Tracking." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018. doi:10.1109/CVPR.2018.00057Markdown
[Choi et al. "Context-Aware Deep Feature Compression for High-Speed Visual Tracking." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018.](https://mlanthology.org/cvpr/2018/choi2018cvpr-contextaware/) doi:10.1109/CVPR.2018.00057BibTeX
@inproceedings{choi2018cvpr-contextaware,
title = {{Context-Aware Deep Feature Compression for High-Speed Visual Tracking}},
author = {Choi, Jongwon and Chang, Hyung Jin and Fischer, Tobias and Yun, Sangdoo and Lee, Kyuewang and Jeong, Jiyeoup and Demiris, Yiannis and Choi, Jin Young},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2018},
doi = {10.1109/CVPR.2018.00057},
url = {https://mlanthology.org/cvpr/2018/choi2018cvpr-contextaware/}
}