Scattering Prompt Tuning: A Fine-Tuned Foundation Model for SAR Object Recognition
Abstract
Synthetic Aperture Radar (SAR) serves as a vital tool in various earth observation applications, providing robust imaging under challenging weather conditions. While the fine-tuned foundation models excel in many downstream tasks, they struggle with SAR object recognition because of SAR's unique imaging and scattering characteristics. In this study, we propose a novel approach named Scattering Prompt Tuning (SPT) based vision foundation model. It uses SAR image scattering information as a prompt and integrates learnable parameters into the pre-trained model's input space to help learn SAR's unique information. We also employ a lightweight Residual AdapterMLP for fine-tuning, design a Sequential Feature Aggregation (SFA) to selectively fuse features from different transformer blocks effectively, and develop a Dynamic Distributional Contrast loss (DCLoss) to maintain the proper distance between different objects in feature space. Additionally, a four-stage training strategy, incorporating semi-supervised learning, is deployed to enhance SAR object recognition performance further Our approach reaches a Top-1 accuracy of 37.9% and an AUROC of 0.83 on the final dataset, winning the first place in the SAR Classification track of PBVS 2024 Multi-modal Aerial View Object Classification Challenge, which is better than the latest advanced fine-tuned foundation models.
Cite
Text
Guo et al. "Scattering Prompt Tuning: A Fine-Tuned Foundation Model for SAR Object Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024. doi:10.1109/CVPRW63382.2024.00311Markdown
[Guo et al. "Scattering Prompt Tuning: A Fine-Tuned Foundation Model for SAR Object Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024.](https://mlanthology.org/cvprw/2024/guo2024cvprw-scattering/) doi:10.1109/CVPRW63382.2024.00311BibTeX
@inproceedings{guo2024cvprw-scattering,
title = {{Scattering Prompt Tuning: A Fine-Tuned Foundation Model for SAR Object Recognition}},
author = {Guo, Weilong and Li, Shengyang and Yang, Jian},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2024},
pages = {3056-3065},
doi = {10.1109/CVPRW63382.2024.00311},
url = {https://mlanthology.org/cvprw/2024/guo2024cvprw-scattering/}
}