Which Model to Transfer? Finding the Needle in the Growing Haystack
Abstract
Transfer learning has been recently popularized as a data-efficient alternative to training models from scratch, in particular for computer vision tasks where it provides a remarkably solid baseline. The emergence of rich model repositories, such as TensorFlow Hub, enables the practitioners and researchers to unleash the potential of these models across a wide range of downstream tasks. As these repositories keep growing exponentially, efficiently selecting a good model for the task at hand becomes paramount. We provide a formalization of this problem through a familiar notion of regret and introduce the predominant strategies, namely task-agnostic (e.g. ranking models by their ImageNet performance) and task-aware search strategies (such as linear or kNN evaluation). We conduct a large-scale empirical study and show that both task-agnostic and task-aware methods can yield high regret. We then propose a simple and computationally efficient hybrid search strategy which outperforms the existing approaches. We highlight the practical benefits of the proposed solution on a set of 19 diverse vision tasks.
Cite
Text
Renggli et al. "Which Model to Transfer? Finding the Needle in the Growing Haystack." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00899Markdown
[Renggli et al. "Which Model to Transfer? Finding the Needle in the Growing Haystack." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/renggli2022cvpr-model/) doi:10.1109/CVPR52688.2022.00899BibTeX
@inproceedings{renggli2022cvpr-model,
title = {{Which Model to Transfer? Finding the Needle in the Growing Haystack}},
author = {Renggli, Cedric and Pinto, André Susano and Rimanic, Luka and Puigcerver, Joan and Riquelme, Carlos and Zhang, Ce and Lučić, Mario},
booktitle = {Conference on Computer Vision and Pattern Recognition},
year = {2022},
pages = {9205-9214},
doi = {10.1109/CVPR52688.2022.00899},
url = {https://mlanthology.org/cvpr/2022/renggli2022cvpr-model/}
}