Model Fusion via Neuron Transplantation

Abstract

Ensemble learning is a widespread technique to improve the prediction performance of neural networks. However, it comes at the price of increased memory and inference time. In this work we propose a novel model fusion technique called \emph{Neuron Transplantation (NT)} in which we fuse an ensemble of models by transplanting important neurons from all ensemble members into the vacant space obtained by pruning insignificant neurons. An initial loss in performance post-transplantation can be quickly recovered via fine-tuning, consistently outperforming individual ensemble members of the same model capacity and architecture. Furthermore, NT enables all the ensemble members to be jointly pruned and jointly trained in a combined model. Comparing it to alignment-based averaging (like Optimal-Transport-fusion), it requires less fine-tuning than the corresponding OT-fused model, the fusion itself is faster and requires less memory, while the resulting model performance is comparable or better. The code is available under the following link: https://github.com/masterbaer/neuron-transplantation.

Cite

Text

Öz et al. "Model Fusion via Neuron Transplantation." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024. doi:10.1007/978-3-031-70359-1_1

Markdown

[Öz et al. "Model Fusion via Neuron Transplantation." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2024.](https://mlanthology.org/ecmlpkdd/2024/oz2024ecmlpkdd-model/) doi:10.1007/978-3-031-70359-1_1

BibTeX

@inproceedings{oz2024ecmlpkdd-model,
  title     = {{Model Fusion via Neuron Transplantation}},
  author    = {Öz, Muhammed and Kiefer, Nicholas and Debus, Charlotte and Hörter, Jasmin and Streit, Achim and Götz, Markus},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2024},
  pages     = {3-19},
  doi       = {10.1007/978-3-031-70359-1_1},
  url       = {https://mlanthology.org/ecmlpkdd/2024/oz2024ecmlpkdd-model/}
}