Active Transferability Estimation
Abstract
As transfer learning techniques are increasingly used to transfer knowledge from the source model to the target task, it becomes important to quantify which source models are suitable for a given target task without performing computationally expensive fine-tuning. Inspired by active learning techniques, we propose ACT (ACtive Transferability), a new strategy to improve the performance of transferability estimation methods, by leveraging an informative subset of the target data. By leveraging the model’s internal and output representations, we introduce two techniques – class-agnostic and class-aware – to identify informative subsets and show that ACT can be applied to any existing transferability metric to improve their performance and reliability. Our experimental results across multiple source model architectures, target datasets, and transfer learning tasks show that ACT metrics are consistently better or on par with the state-of-the-art transferability metrics.
Cite
Text
Menta et al. "Active Transferability Estimation." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024. doi:10.1109/CVPRW63382.2024.00272Markdown
[Menta et al. "Active Transferability Estimation." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2024.](https://mlanthology.org/cvprw/2024/menta2024cvprw-active/) doi:10.1109/CVPRW63382.2024.00272BibTeX
@inproceedings{menta2024cvprw-active,
title = {{Active Transferability Estimation}},
author = {Menta, Tarun Ram and Jandial, Surgan and Patil, Akash and Bachu, Saketh and Vimal, K. B. and Krishnamurthy, Balaji and Balasubramanian, Vineeth N. and Sarkar, Mausoom and Agarwal, Chirag},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2024},
pages = {2659-2670},
doi = {10.1109/CVPRW63382.2024.00272},
url = {https://mlanthology.org/cvprw/2024/menta2024cvprw-active/}
}