Appearance and Motion Enhancement for Video-Based Person Re-Identification

Abstract

In this paper, we propose an Appearance and Motion Enhancement Model (AMEM) for video-based person re-identification to enrich the two kinds of information contained in the backbone network in a more interpretable way. Concretely, human attribute recognition under the supervision of pseudo labels is exploited in an Appearance Enhancement Module (AEM) to help enrich the appearance and semantic information. A Motion Enhancement Module (MEM) is designed to capture the identity-discriminative walking patterns through predicting future frames. Despite a complex model with several auxiliary modules during training, only the backbone model plus two small branches are kept for similarity evaluation which constitute a simple but effective final model. Extensive experiments conducted on three popular video-based person ReID benchmarks demonstrate the effectiveness of our proposed model and the state-of-the-art performance compared with existing methods.

Cite

Text

Li et al. "Appearance and Motion Enhancement for Video-Based Person Re-Identification." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I07.6802

Markdown

[Li et al. "Appearance and Motion Enhancement for Video-Based Person Re-Identification." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/li2020aaai-appearance/) doi:10.1609/AAAI.V34I07.6802

BibTeX

@inproceedings{li2020aaai-appearance,
  title     = {{Appearance and Motion Enhancement for Video-Based Person Re-Identification}},
  author    = {Li, Shuzhao and Yu, Huimin and Hu, Haoji},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {11394-11401},
  doi       = {10.1609/AAAI.V34I07.6802},
  url       = {https://mlanthology.org/aaai/2020/li2020aaai-appearance/}
}