Task Adaptive Parameter Sharing for Multi-Task Learning

Matthew Wallingford, Hao Li, Alessandro Achille, Avinash Ravichandran, Charless Fowlkes, Rahul Bhotika, Stefano Soatto

CVPR 2022 pp. 7561-7570

doi:10.1109/CVPR52688.2022.00741 /cvpr/2022/wallingford2022cvpr-task/

Abstract

Adapting pre-trained models with broad capabilities has become standard practice for learning a wide range of downstream tasks. The typical approach of fine-tuning different models for each task is performant, but incurs a substantial memory cost. To efficiently learn multiple downstream tasks we introduce Task Adaptive Parameter Sharing (TAPS), a simple method for tuning a base model to a new task by adaptively modifying a small, task-specific subset of layers. This enables multi-task learning while minimizing the resources used and avoids catastrophic forgetting and competition between tasks. TAPS solves a joint optimization problem which determines both the layers that are shared with the base model and the value of the task-specific weights. Further, a sparsity penalty on the number of active layers promotes weight sharing with the base model. Compared to other methods, TAPS retains a high accuracy on the target tasks while still introducing only a small number of task-specific parameters. Moreover, TAPS is agnostic to the particular architecture used and requires only minor changes to the training scheme. We evaluate our method on a suite of fine-tuning tasks and architectures (ResNet, DenseNet, ViT) and show that it achieves state-of-the-art performance while being simple to implement.

PDF CVPR Semantic Scholar

Cite

Text

Wallingford et al. "Task Adaptive Parameter Sharing for Multi-Task Learning." Conference on Computer Vision and Pattern Recognition, 2022. doi:10.1109/CVPR52688.2022.00741

Markdown

[Wallingford et al. "Task Adaptive Parameter Sharing for Multi-Task Learning." Conference on Computer Vision and Pattern Recognition, 2022.](https://mlanthology.org/cvpr/2022/wallingford2022cvpr-task/) doi:10.1109/CVPR52688.2022.00741

BibTeX

@inproceedings{wallingford2022cvpr-task,
  title     = {{Task Adaptive Parameter Sharing for Multi-Task Learning}},
  author    = {Wallingford, Matthew and Li, Hao and Achille, Alessandro and Ravichandran, Avinash and Fowlkes, Charless and Bhotika, Rahul and Soatto, Stefano},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2022},
  pages     = {7561-7570},
  doi       = {10.1109/CVPR52688.2022.00741},
  url       = {https://mlanthology.org/cvpr/2022/wallingford2022cvpr-task/}
}