Semi-Ensemble: A Simple Approach Over-Parameterize Model Interpolation

Abstract

We develop a unified framework for interpolating two models with various degrees of over-parameterization, having model merging and model ensemble as special cases. Instead of directly interpolating models in their original parameter space, the proposed Semi-Ensemble interpolates the over-parameterized versions of the models in a higher-dimensional joint parameter space. Here, the over-parameterizations recover each endpoint model when projected to some low-dimensional subspace spanned by a fraction of bases. By carefully constructing the joint parameter space, the interpolated model can achieve a smooth tradeoff between the total number of parameters and the model accuracy, outperforming existing baselines. Intriguingly, we show that Semi-ensembles can sometimes achieve a better performance than vanilla ensembles, even with a slightly smaller number of parameters.

Cite

Text

Lee and Lee. "Semi-Ensemble: A Simple Approach Over-Parameterize Model Interpolation." NeurIPS 2023 Workshops: UniReps, 2023.

Markdown

[Lee and Lee. "Semi-Ensemble: A Simple Approach Over-Parameterize Model Interpolation." NeurIPS 2023 Workshops: UniReps, 2023.](https://mlanthology.org/neuripsw/2023/lee2023neuripsw-semiensemble/)

BibTeX

@inproceedings{lee2023neuripsw-semiensemble,
  title     = {{Semi-Ensemble: A Simple Approach Over-Parameterize Model Interpolation}},
  author    = {Lee, Jiwoon and Lee, Jaeho},
  booktitle = {NeurIPS 2023 Workshops: UniReps},
  year      = {2023},
  url       = {https://mlanthology.org/neuripsw/2023/lee2023neuripsw-semiensemble/}
}