Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition

Yang, Chuanguang; Yu, Xinqiang; Yang, Han; An, Zhulin; Yu, Chengqing; Huang, Libo; Xu, Yongjun

doi:10.1609/AAAI.V39I9.32990

Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition

Chuanguang Yang, Xinqiang Yu, Han Yang, Zhulin An, Chengqing Yu, Libo Huang, Yongjun Xu

AAAI 2025 pp. 9148-9156

doi:10.1609/AAAI.V39I9.32990 /aaai/2025/yang2025aaai-multi/

Abstract

Multi-teacher Knowledge Distillation (KD) transfers diverse knowledge from a teacher pool to a student network. The core problem of multi-teacher KD is how to balance distillation strengths among various teachers. Most existing methods often develop weighting strategies from an individual perspective of teacher performance or teacher-student gaps, lacking comprehensive information for guidance. This paper proposes Multi-Teacher Knowledge Distillation with Reinforcement Learning (MTKD-RL) to optimize multi-teacher weights. In this framework, we construct both teacher performance and teacher-student gaps as state information to an agent. The agent outputs the teacher weight and can be updated by the return reward from the student. MTKD-RL reinforces the interaction between the student and teacher using an agent in an RL-based decision mechanism, achieving better matching capability with more meaningful weights. Experimental results on visual recognition tasks, including image classification, object detection, and semantic segmentation tasks, demonstrate that MTKD-RL achieves state-of-the-art performance compared to the existing multi-teacher KD works.

PDF AAAI Semantic Scholar

Cite

Text

Yang et al. "Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I9.32990

Markdown

[Yang et al. "Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/yang2025aaai-multi/) doi:10.1609/AAAI.V39I9.32990

BibTeX

@inproceedings{yang2025aaai-multi,
  title     = {{Multi-Teacher Knowledge Distillation with Reinforcement Learning for Visual Recognition}},
  author    = {Yang, Chuanguang and Yu, Xinqiang and Yang, Han and An, Zhulin and Yu, Chengqing and Huang, Libo and Xu, Yongjun},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {9148-9156},
  doi       = {10.1609/AAAI.V39I9.32990},
  url       = {https://mlanthology.org/aaai/2025/yang2025aaai-multi/}
}