Strong Baseline: Multi-UAV Tracking via YOLOv12 with BoT-SORT-ReID
Abstract
Detecting and tracking multiple unmanned aerial vehicles (UAVs) in thermal infrared video is inherently challenging due to low contrast, environmental noise, and small target sizes. This paper provides a straightforward approach to address multi-UAV tracking in thermal infrared video, leveraging recent advances in detection and tracking. Instead of relying on the well-established YOLOv5 with DeepSORT combination, we present a tracking framework built on YOLOv12 and BoT-SORT, enhanced with tailored training and inference strategies. We evaluate our approach following the 4th Anti-UAV Challenge metrics and reach competitive performance. Notably, we achieved strong results without using contrast enhancement or temporal information fusion to enrich UAV features, highlighting our approach as a "Strong Baseline" for multi-UAV tracking tasks. We provide implementation details, in-depth experimental analysis, and a discussion of potential improvements. The code is available at https://github.com/wish44165/YOLOv12-BoT-SORT-ReID .
Cite
Text
Chen. "Strong Baseline: Multi-UAV Tracking via YOLOv12 with BoT-SORT-ReID." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.Markdown
[Chen. "Strong Baseline: Multi-UAV Tracking via YOLOv12 with BoT-SORT-ReID." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.](https://mlanthology.org/cvprw/2025/chen2025cvprw-strong/)BibTeX
@inproceedings{chen2025cvprw-strong,
title = {{Strong Baseline: Multi-UAV Tracking via YOLOv12 with BoT-SORT-ReID}},
author = {Chen, Yu-Hsi},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2025},
pages = {6573-6582},
url = {https://mlanthology.org/cvprw/2025/chen2025cvprw-strong/}
}