Make Continual Learning Stronger via C-Flat

Bian, Ang; Li, Wei; Yuan, Hangjie; Yu, Chengrong; Wang, Mang; Zhao, Zixiang; Lu, Aojun; Ji, Pengliang; Feng, Tao

doi:10.52202/079017-0244

Make Continual Learning Stronger via C-Flat

Ang Bian, Wei Li, Hangjie Yuan, Chengrong Yu, Mang Wang, Zixiang Zhao, Aojun Lu, Pengliang Ji, Tao Feng

NeurIPS 2024

doi:10.52202/079017-0244 /neurips/2024/bian2024neurips-make/

Abstract

How to balance the learning ’sensitivity-stability’ upon new task training and memory preserving is critical in CL to resolve catastrophic forgetting. Improving model generalization ability within each learning phase is one solution to help CL learning overcome the gap in the joint knowledge space. Zeroth-order loss landscape sharpness-aware minimization is a strong training regime improving model generalization in transfer learning compared with optimizer like SGD. It has also been introduced into CL to improve memory representation or learning efficiency. However, zeroth-order sharpness alone could favors sharper over flatter minima in certain scenarios, leading to a rather sensitive minima rather than a global optima. To further enhance learning stability, we propose a Continual Flatness (C-Flat) method featuring a flatter loss landscape tailored for CL. C-Flat could be easily called with only one line of code and is plug-and-play to any CL methods. A general framework of C-Flat applied to all CL categories and a thorough comparison with loss minima optimizer and flat minima based CL approaches is presented in this paper, showing that our method can boost CL performance in almost all cases. Code is available at https://github.com/WanNaa/C-Flat.

PDF NeurIPS OpenReview Semantic Scholar

Cite

Text

Bian et al. "Make Continual Learning Stronger via C-Flat." Neural Information Processing Systems, 2024. doi:10.52202/079017-0244

Markdown

[Bian et al. "Make Continual Learning Stronger via C-Flat." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/bian2024neurips-make/) doi:10.52202/079017-0244

BibTeX

@inproceedings{bian2024neurips-make,
  title     = {{Make Continual Learning Stronger via C-Flat}},
  author    = {Bian, Ang and Li, Wei and Yuan, Hangjie and Yu, Chengrong and Wang, Mang and Zhao, Zixiang and Lu, Aojun and Ji, Pengliang and Feng, Tao},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-0244},
  url       = {https://mlanthology.org/neurips/2024/bian2024neurips-make/}
}