Exploiting Temporal Coherence for Self-Supervised One-Shot Video Re-Identification
Abstract
While supervised techniques in re-identification are extremely effective, the need for large amounts of annotations makes them impractical for large camera networks. One-shot re-identification, which uses a singular labeled tracklet for each identity along with a pool of unlabeled tracklets, is a potential candidate towards reducing this labeling effort. Current one-shot re-identification methods function by modeling the inter-relationships amongst the labeled and the unlabeled data, but fail to fully exploit such relationships that exist within the pool of unlabeled data itself. In this paper, we propose a new framework named Temporal Consistency Progressive Learning, which uses temporal coherence as a novel self-supervised auxiliary task in the one-shot learning paradigm to capture such relationships amongst the unlabeled tracklets. Optimizing two new losses, which enforce consistency on a local and global scale, our framework can learn learn richer and more discriminative representations. Extensive experiments on two challenging video re-identification datasets - MARS and DukeMTMC-VideoReID - demonstrate that our proposed method is able to estimate the true labels of the unlabeled data more accurately by up to $8\%$, and obtain significantly better re-identification performance compared to the existing state-of-the-art techniques.
Cite
Text
Raychaudhuri and Roy-Chowdhury. "Exploiting Temporal Coherence for Self-Supervised One-Shot Video Re-Identification." Proceedings of the European Conference on Computer Vision (ECCV), 2020. doi:10.1007/978-3-030-58583-9_16Markdown
[Raychaudhuri and Roy-Chowdhury. "Exploiting Temporal Coherence for Self-Supervised One-Shot Video Re-Identification." Proceedings of the European Conference on Computer Vision (ECCV), 2020.](https://mlanthology.org/eccv/2020/raychaudhuri2020eccv-exploiting/) doi:10.1007/978-3-030-58583-9_16BibTeX
@inproceedings{raychaudhuri2020eccv-exploiting,
title = {{Exploiting Temporal Coherence for Self-Supervised One-Shot Video Re-Identification}},
author = {Raychaudhuri, Dripta S. and Roy-Chowdhury, Amit K.},
booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
year = {2020},
doi = {10.1007/978-3-030-58583-9_16},
url = {https://mlanthology.org/eccv/2020/raychaudhuri2020eccv-exploiting/}
}