Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T Tracking

Abstract

Due to the complementary benefits of visible (RGB) and thermal infrared (T) data, RGB-T object tracking attracts more and more attention recently for boosting the performance under adverse illumination conditions. Existing RGB-T tracking methods usually localize a target object with a bounding box, in which the trackers or detectors is often affected by the inclusion of background clutter. To address this problem, this paper presents a novel approach to suppress background effects for RGB-T tracking. Our approach relies on a novel cross-modal manifold ranking algorithm. First, we integrate the soft cross-modality consistency into the ranking model which allows the sparse inconsistency to account for the different properties between these two modalities. Second, we propose an optimal query learning method to handle label noises of queries. In particular, we introduce an intermediate variable to represent the optimal labels, and formulate it as a $l_1$-optimization based sparse learning problem. Moreover, we propose a single unified optimization algorithm to solve the proposed model with stable and efficient convergence behavior. Finally, the ranking results are incorporated into the patch-based object features to address the background effects, and the structured SVM is then adopted to perform RGB-T tracking. Extensive experiments suggest that the proposed approach performs well against the state-of-the-art methods on large-scale benchmark datasets.

Cite

Text

Li et al. "Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T Tracking." Proceedings of the European Conference on Computer Vision (ECCV), 2018. doi:10.1007/978-3-030-01261-8_49

Markdown

[Li et al. "Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T Tracking." Proceedings of the European Conference on Computer Vision (ECCV), 2018.](https://mlanthology.org/eccv/2018/li2018eccv-crossmodal/) doi:10.1007/978-3-030-01261-8_49

BibTeX

@inproceedings{li2018eccv-crossmodal,
  title     = {{Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T Tracking}},
  author    = {Li, Chenglong and Zhu, Chengli and Huang, Yan and Tang, Jin and Wang, Liang},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2018},
  doi       = {10.1007/978-3-030-01261-8_49},
  url       = {https://mlanthology.org/eccv/2018/li2018eccv-crossmodal/}
}