Less Is More: Efficient Model Merging with Binary Task Switch

Abstract

As an effective approach to equip models with multi-task capabilities without additional training, model merging has garnered significant attention. However, existing merging methods face challenges of redundant parameter conflicts and the excessive storage burden of fine-tuned parameters. In this work, through controlled experiments, we reveal that for fine-tuned task vectors, only those parameters with magnitudes above a certain threshold contribute positively to the task, exhibiting a pulse-like characteristic. We then attempt leveraging this pulse-like characteristic to binarize the task vectors and reduce storage overhead. Further controlled experiments show that the binarized task vectors incur almost no decrease in fine-tuning and merging performance, and even exhibit stronger performance improvements as the proportion of redundant parameters increases. Based on these insights, we propose Task Switch (T-Switch), which decomposes task vectors into three components: 1) an activation switch instantiated by a binarized mask vector, 2) a polarity switch instantiated by a binarized sign vector, and 3) a scaling knob instantiated by a scalar coefficient. By storing task vectors in a binarized form, T-Switch alleviates parameter conflicts while ensuring efficient task parameter storage. Furthermore, to enable automated switch combination in T-Switch, we further introduce Auto-Switch, which enables training-free switch combination via retrieval from a small query set. Experiments indicate that our methods achieve significant performance improvements over existing baselines, requiring only 1-3% of the storage space of full-precision parameters.

Cite

Text

Qi et al. "Less Is More: Efficient Model Merging with Binary Task Switch." Conference on Computer Vision and Pattern Recognition, 2025. doi:10.1109/CVPR52734.2025.01422

Markdown

[Qi et al. "Less Is More: Efficient Model Merging with Binary Task Switch." Conference on Computer Vision and Pattern Recognition, 2025.](https://mlanthology.org/cvpr/2025/qi2025cvpr-less/) doi:10.1109/CVPR52734.2025.01422

BibTeX

@inproceedings{qi2025cvpr-less,
  title     = {{Less Is More: Efficient Model Merging with Binary Task Switch}},
  author    = {Qi, Biqing and Li, Fangyuan and Wang, Zhen and Gao, Junqi and Li, Dong and Ye, Peng and Zhou, Bowen},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2025},
  pages     = {15265-15274},
  doi       = {10.1109/CVPR52734.2025.01422},
  url       = {https://mlanthology.org/cvpr/2025/qi2025cvpr-less/}
}