Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models

Yu, Yu-Chu; Huang, Chi-Pin; Chen, Jr-Jen; Chang, Kai-Po; Lai, Yung-Hsuan; Yang, Fu-En; Wang, Yu-Chiang Frank

doi:10.1007/978-3-031-73347-5_13

Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models

Yu-Chu Yu, Chi-Pin Huang, Jr-Jen Chen, Kai-Po Chang, Yung-Hsuan Lai, Fu-En Yang, Yu-Chiang Frank Wang

ECCV 2024

doi:10.1007/978-3-031-73347-5_13 /eccv/2024/yu2024eccv-select/

Abstract

Large-scale vision-language models (VLMs) have shown a strong zero-shot generalization capability on unseen-domain data. However, adapting pre-trained VLMs to a sequence of downstream tasks often leads to the forgetting of previously learned knowledge and a reduction in zero-shot classification performance. To tackle this problem, we propose a unique Selective Dual-Teacher Knowledge Transfer framework that leverages the most recent fine-tuned and the original pre-trained VLMs as dual teachers to preserve the previously learned knowledge and zero-shot capabilities, respectively. With only access to an unlabeled reference dataset, our proposed framework performs a selective knowledge distillation mechanism by measuring the feature discrepancy from the dual-teacher VLMs. Consequently, our selective dual-teacher knowledge distillation mitigates catastrophic forgetting of previously learned knowledge while preserving the zero-shot capabilities of pre-trained VLMs. Extensive experiments on benchmark datasets demonstrate that our framework is favorable against state-of-the-art continual learning approaches for preventing catastrophic forgetting and zero-shot degradation. Project page: https://chuyu.org/research/snd.

PDF ECCV Semantic Scholar

Cite

Text

Yu et al. "Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models." Proceedings of the European Conference on Computer Vision (ECCV), 2024. doi:10.1007/978-3-031-73347-5_13

Markdown

[Yu et al. "Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models." Proceedings of the European Conference on Computer Vision (ECCV), 2024.](https://mlanthology.org/eccv/2024/yu2024eccv-select/) doi:10.1007/978-3-031-73347-5_13

BibTeX

@inproceedings{yu2024eccv-select,
  title     = {{Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models}},
  author    = {Yu, Yu-Chu and Huang, Chi-Pin and Chen, Jr-Jen and Chang, Kai-Po and Lai, Yung-Hsuan and Yang, Fu-En and Wang, Yu-Chiang Frank},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2024},
  doi       = {10.1007/978-3-031-73347-5_13},
  url       = {https://mlanthology.org/eccv/2024/yu2024eccv-select/}
}