Parsing-Based View-Aware Embedding Network for Vehicle Re-Identification

Abstract

Vehicle Re-Identification is to find images of the same vehicle from various views in the cross-camera scenario. The main challenges of this task are the large intra-instance distance caused by different views and the subtle inter-instance discrepancy caused by similar vehicles. In this paper, we propose a parsing-based view-aware embedding network (PVEN) to achieve the view-aware feature alignment and enhancement for vehicle ReID. First, we introduce a parsing network to parse a vehicle into four different views and then align the features by mask average pooling. Such alignment provides a fine-grained representation of the vehicle. Second, in order to enhance the view-aware features, we design a common-visible attention to focus on the common visible views, which not only shortens the distance among intra-instances, but also enlarges the discrepancy of inter-instances. The PVEN helps capture the stable discriminative information of vehicle under different views. The experiments conducted on three datasets show that our model outperforms state-of-the-art methods by a large margin.

Cite

Text

Meng et al. "Parsing-Based View-Aware Embedding Network for Vehicle Re-Identification." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020. doi:10.1109/CVPR42600.2020.00713

Markdown

[Meng et al. "Parsing-Based View-Aware Embedding Network for Vehicle Re-Identification." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020.](https://mlanthology.org/cvpr/2020/meng2020cvpr-parsingbased/) doi:10.1109/CVPR42600.2020.00713

BibTeX

@inproceedings{meng2020cvpr-parsingbased,
  title     = {{Parsing-Based View-Aware Embedding Network for Vehicle Re-Identification}},
  author    = {Meng, Dechao and Li, Liang and Liu, Xuejing and Li, Yadong and Yang, Shijie and Zha, Zheng-Jun and Gao, Xingyu and Wang, Shuhui and Huang, Qingming},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2020},
  doi       = {10.1109/CVPR42600.2020.00713},
  url       = {https://mlanthology.org/cvpr/2020/meng2020cvpr-parsingbased/}
}