AG-VPReID: A Challenging Large-Scale Benchmark for Aerial-Ground Video-Based Person Re-Identification

Abstract

We introduce AG-VPReID, a new large-scale dataset for aerial-ground video-based person re-identification (ReID) that comprises 6,632 subjects, 32,321 tracklets and over 9.6 million frames captured by drones (altitudes ranging from 15-120m), CCTV, and wearable cameras. This dataset offers a real-world benchmark for evaluating the robustness to significant viewpoint changes, scale variations, and resolution differences in cross-platform aerial-ground settings. In addition, to address these challenges, we propose AG-VPReID-Net, an end-to-end framework composed of three complementary streams: (1) an Adapted Temporal-Spatial Stream addressing motion pattern inconsistencies and facilitating temporal feature learning, (2) a Normalized Appearance Stream leveraging physics-informed techniques to tackle resolution and appearance changes, and (3) a Multi-Scale Attention Stream handling scale variations across drone altitudes. We integrate visual-semantic cues from all streams to form a robust, viewpoint-invariant whole-body representation. Extensive experiments demonstrate that AG-VPReID-Net outperforms state-of-the-art approaches on both our new dataset and existing video-based ReID benchmarks, showcasing its effectiveness and generalizability. Nevertheless, the performance gap observed on AG-VPReID across all methods underscores the dataset's challenging nature. The dataset, code and trained models are available at https://github.com/agvpreid25/AG-VPReID-Net.

Cite

Text

Nguyen et al. "AG-VPReID: A Challenging Large-Scale Benchmark for Aerial-Ground Video-Based Person Re-Identification." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.00124

Markdown

[Nguyen et al. "AG-VPReID: A Challenging Large-Scale Benchmark for Aerial-Ground Video-Based Person Re-Identification." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/nguyen2025cvpr-agvpreid/) doi:10.1109/CVPR52734.2025.00124

BibTeX

@inproceedings{nguyen2025cvpr-agvpreid,
  title     = {{AG-VPReID: A Challenging Large-Scale Benchmark for Aerial-Ground Video-Based Person Re-Identification}},
  author    = {Nguyen, Huy and Nguyen, Kien and Pemasiri, Akila and Liu, Feng and Sridharan, Sridha and Fookes, Clinton},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2025},
  pages     = {1241-1251},
  doi       = {10.1109/CVPR52734.2025.00124},
  url       = {https://mlanthology.org/cvpr/2025/nguyen2025cvpr-agvpreid/}
}