Quantized Gradient Projection for Memory-Efficient Continual Learning

Abstract

Real-world deployment of machine learning models requires the ability to continually learn from non-stationary data while preserving prior knowledge and user privacy. Therefore, storing knowledge acquired from past data in a resource- and privacy-friendly manner is a crucial consideration in determining their viability. We introduce Quantized Gradient Projection Memory (QGPM), a systematic framework for continual learning that compresses and preserves the previous gradient subspace. QGPM integrates three key components: (i) distribution-aware, basis-wise quantization to minimize storage overhead, (ii) a Quantization Error-Aware (QEA) gradient projection that selectively relaxes orthogonality to mitigate gradient drift caused by accumulated quantization noise, and (iii) an on-the-fly sparse sketching strategy that improves runtime memory and computational efficiency. Experiments across multiple benchmarks demonstrate that QGPM achieves state-of-the-art performance under fixed memory budgets, highlighting its effectiveness in scalable, privacy-preserving continual learning.

Cite

Text

Kim et al. "Quantized Gradient Projection for Memory-Efficient Continual Learning." International Conference on Learning Representations, 2026.

Markdown

[Kim et al. "Quantized Gradient Projection for Memory-Efficient Continual Learning." International Conference on Learning Representations, 2026.](https://mlanthology.org/iclr/2026/kim2026iclr-quantized/)

BibTeX

@inproceedings{kim2026iclr-quantized,
  title     = {{Quantized Gradient Projection for Memory-Efficient Continual Learning}},
  author    = {Kim, Dongjun and Cha, Seohyeon and Chen, Huancheng and Wang, Chianing and Vikalo, Haris},
  booktitle = {International Conference on Learning Representations},
  year      = {2026},
  url       = {https://mlanthology.org/iclr/2026/kim2026iclr-quantized/}
}