Understanding the Forgetting of (Replay-Based) Continual Learning via Feature Learning: Angle Matters

Abstract

Continual learning (CL) is crucial for advancing human-level intelligence, but its theoretical understanding, especially regarding factors influencing forgetting, is still relatively limited. This work aims to build a unified theoretical framework for understanding CL using feature learning theory. Different from most existing studies that analyze forgetting under linear regression model or lazy training, we focus on a more practical two-layer convolutional neural network (CNN) with polynomial ReLU activation for sequential tasks within a signal-noise data model. Specifically, we theoretically reveal how the angle between task signal vectors influences forgetting that: acute or small obtuse angles lead to benign forgetting, whereas larger obtuse angles result in harmful forgetting. Furthermore, we demonstrate that the replay method alleviates forgetting by expanding the range of angles corresponding to benign forgetting. Our theoretical results suggest that mid-angle sampling, which selects examples with moderate angles to the prototype, can enhance the replay method’s ability to mitigate forgetting. Experiments on synthetic and real-world datasets confirm our theoretical results and highlight the effectiveness of our mid-angle sampling strategy.

Cite

Text

Wan et al. "Understanding the Forgetting of (Replay-Based) Continual Learning via Feature Learning: Angle Matters." Proceedings of the 42nd International Conference on Machine Learning, 2025.

Markdown

[Wan et al. "Understanding the Forgetting of (Replay-Based) Continual Learning via Feature Learning: Angle Matters." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/wan2025icml-understanding/)

BibTeX

@inproceedings{wan2025icml-understanding,
  title     = {{Understanding the Forgetting of (Replay-Based) Continual Learning via Feature Learning: Angle Matters}},
  author    = {Wan, Hongyi and Ren, Shiyuan and Huang, Wei and Zhang, Miao and Deng, Xiang and Bao, Yixin and Nie, Liqiang},
  booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
  year      = {2025},
  pages     = {61956-62019},
  volume    = {267},
  url       = {https://mlanthology.org/icml/2025/wan2025icml-understanding/}
}