Dars : Robust Sparse Fine-Tuning with Regularized Subspace Disalignment

Abstract

Recent works have identified the alignment, which measures a layerwise weight correlation, as a novel yet crucial mechanism for feature learning. We investigate an underlying connection between the alignment learning and the structural fitting of a network to the training data span. Based on this insight, we further demonstrate that fine-tuning on out-of-distribution (OOD) data disrupts this well-aligned structure fitted during the pre-training phase, degrading generalization performance. To address this, we propose DARS, DisAlignment-Regularized Sparse fine-tuning, a novel sparse fine-tuning approach that mitigates disalignment by letting the gradient update to be partially constrained within the principal subspace of the pre-trained network, constructed based on the in-distribution (ID) data used for its pre-training. Specifically, we define the two disjoint subsets of trainable parameters for sparse channel unfreezing: i) a random subset and ii) a subset with higher gradient projections onto the principal subspace. The latter serves as a disalignment regularizer during fine-tuning, while the random subset ensures a minimal bias in parameter selection. By adjusting the ratio between the two subsets, we can control the strength of subspace regularization, thereby balancing the trade-off between generalization capacity and strong fitting to new downstream tasks. By employing DARS, we achieved SOTA performance on various benchmarks, including commonsense and arithmetic reasoning tasks, across LLaMA-7B and LLaMA2-7B.

Cite

Text

Park and Park. "Dars : Robust Sparse Fine-Tuning with Regularized Subspace Disalignment." ICLR 2025 Workshops: SCOPE, 2025.

Markdown

[Park and Park. "Dars : Robust Sparse Fine-Tuning with Regularized Subspace Disalignment." ICLR 2025 Workshops: SCOPE, 2025.](https://mlanthology.org/iclrw/2025/park2025iclrw-dars/)

BibTeX

@inproceedings{park2025iclrw-dars,
  title     = {{Dars : Robust Sparse Fine-Tuning with Regularized Subspace Disalignment}},
  author    = {Park, Sumin and Park, Noseong},
  booktitle = {ICLR 2025 Workshops: SCOPE},
  year      = {2025},
  url       = {https://mlanthology.org/iclrw/2025/park2025iclrw-dars/}
}