STA: Spatial-Temporal Attention for Large-Scale Video-Based Person Re-Identification

Abstract

In this work, we propose a novel Spatial-Temporal Attention (STA) approach to tackle the large-scale person reidentification task in videos. Different from the most existing methods, which simply compute representations of video clips using frame-level aggregation (e.g. average pooling), the proposed STA adopts a more effective way for producing robust clip-level feature representation. Concretely, our STA fully exploits those discriminative parts of one target person in both spatial and temporal dimensions, which results in a 2-D attention score matrix via inter-frame regularization to measure the importances of spatial parts across different frames. Thus, a more robust clip-level feature representation can be generated according to a weighted sum operation guided by the mined 2-D attention score matrix. In this way, the challenging cases for video-based person re-identification such as pose variation and partial occlusion can be well tackled by the STA. We conduct extensive experiments on two large-scale benchmarks, i.e. MARS and DukeMTMCVideoReID. In particular, the mAP reaches 87.7% on MARS, which significantly outperforms the state-of-the-arts with a large margin of more than 11.6%.

Cite

Text

Fu et al. "STA: Spatial-Temporal Attention for Large-Scale Video-Based Person Re-Identification." AAAI Conference on Artificial Intelligence, 2019. doi:10.1609/AAAI.V33I01.33018287

Markdown

[Fu et al. "STA: Spatial-Temporal Attention for Large-Scale Video-Based Person Re-Identification." AAAI Conference on Artificial Intelligence, 2019.](https://mlanthology.org/aaai/2019/fu2019aaai-sta/) doi:10.1609/AAAI.V33I01.33018287

BibTeX

@inproceedings{fu2019aaai-sta,
  title     = {{STA: Spatial-Temporal Attention for Large-Scale Video-Based Person Re-Identification}},
  author    = {Fu, Yang and Wang, Xiaoyang and Wei, Yunchao and Huang, Thomas S.},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2019},
  pages     = {8287-8294},
  doi       = {10.1609/AAAI.V33I01.33018287},
  url       = {https://mlanthology.org/aaai/2019/fu2019aaai-sta/}
}