Self-Prompting Analogical Reasoning for UAV Object Detection
Abstract
Unmanned Aerial Vehicle Object Detection (UAVOD) presents unique challenges due to varying altitudes, dynamic backgrounds, and the small size of objects. Traditional detection methods often struggle with these challenges, as they typically rely on visual feature only and fail to extract the semantic relations between the objects. To address these limitations, we propose a novel approach named Self-Prompting Analogical Reasoning (SPAR). Our method utilizes the vision-language model (CLIP) to generate context-aware prompts based on image feature, providing rich semantic information that guides analogical reasoning. SPAR includes two main modules: self-prompting and analogical reasoning. Self-prompting module based on learnable description and CLIP-text encoder generates context-aware prompt by combining specific image feature; then an objectness prompt score map is produced by computing the similarity between pixel-level features and context-aware prompt. With this score map, multi-scale image features are enhanced and pixel-level features are chosen for graph construction. While for analogical reasoning module, graph nodes consists of category-level prompt nodes and pixel-level image feature nodes. Analogical inference is based graph convolution. Under the guidance of category-level nodes, different-scale object features have been enhanced, which helps achieve more accurate detection of challenging objects. Extensive experiments illustrate that SPAR outperforms traditional methods, offering a more robust and accurate solution for UAVOD.
Cite
Text
Li et al. "Self-Prompting Analogical Reasoning for UAV Object Detection." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I17.34026Markdown
[Li et al. "Self-Prompting Analogical Reasoning for UAV Object Detection." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/li2025aaai-self/) doi:10.1609/AAAI.V39I17.34026BibTeX
@inproceedings{li2025aaai-self,
title = {{Self-Prompting Analogical Reasoning for UAV Object Detection}},
author = {Li, Nianxin and Ye, Mao and Zhou, Lihua and Tang, Song and Gan, Yan and Liang, Zizhuo and Zhu, Xiatian},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2025},
pages = {18412-18420},
doi = {10.1609/AAAI.V39I17.34026},
url = {https://mlanthology.org/aaai/2025/li2025aaai-self/}
}