Bridging the Sky and Ground: Towards View-Invariant Feature Learning for Aerial-Ground Person Re-Identification

Khalid, Wajahat; Liu, Bin; Li, Xulin; Waqas, Muhammad; Afgan, Muhammad Sher

Bridging the Sky and Ground: Towards View-Invariant Feature Learning for Aerial-Ground Person Re-Identification

Wajahat Khalid, Bin Liu, Xulin Li, Muhammad Waqas, Muhammad Sher Afgan

ICCV 2025 pp. 9749-9758

/iccv/2025/khalid2025iccv-bridging/

Abstract

Aerial-Ground Person Re-Identification (AG-ReID) is a practical yet challenging task that involves cross-platform matching between aerial and ground cameras. Existing person Re-Identification (Re-ID) methods are primarily designed for homogeneous camera settings, such as ground-to-ground or aerial-to-aerial matching. Therefore, these conventional Re-ID approaches underperform due to the significant viewpoint discrepancies introduced by cross-platform cameras in the AG-ReID task. To address this limitation, we propose a novel and efficient approach, termed View-Invariant Feature Learning for Aerial-Ground Person Re-Identification (VIF-AGReID), which explores view-invariant features without leveraging any auxiliary information. Our approach introduces two key components: (1) Patch-Level RotateMix (PLRM), an augmentation strategy that enhances rotational diversity within local regions of training samples, enabling the model to capture fine-grained view-invariant features, and (2) View-Invariant Angular Loss (VIAL), which mitigates the impact of perspective variations by imposing angular constraints that exponentially penalize large angular deviations, optimizing the similarity of positive pairs while enhancing dissimilarity for hard negatives. These components interact synergistically to drive view-invariant feature learning, enhancing robustness across diverse viewpoints. Extensive experiments on the CARGO, AG-ReIDv1, and AG-ReIDv2 benchmarks demonstrate the effectiveness of our method in addressing the AG-ReID task.

PDF ICCV Semantic Scholar

Cite

Text

Khalid et al. "Bridging the Sky and Ground: Towards View-Invariant Feature Learning for Aerial-Ground Person Re-Identification." International Conference on Computer Vision, 2025.

Markdown

[Khalid et al. "Bridging the Sky and Ground: Towards View-Invariant Feature Learning for Aerial-Ground Person Re-Identification." International Conference on Computer Vision, 2025.](https://mlanthology.org/iccv/2025/khalid2025iccv-bridging/)

BibTeX

@inproceedings{khalid2025iccv-bridging,
  title     = {{Bridging the Sky and Ground: Towards View-Invariant Feature Learning for Aerial-Ground Person Re-Identification}},
  author    = {Khalid, Wajahat and Liu, Bin and Li, Xulin and Waqas, Muhammad and Afgan, Muhammad Sher},
  booktitle = {International Conference on Computer Vision},
  year      = {2025},
  pages     = {9749-9758},
  url       = {https://mlanthology.org/iccv/2025/khalid2025iccv-bridging/}
}