Equivariant Architectures for Learning in Deep Weight Spaces
Abstract
Designing machine learning architectures for processing neural networks in their raw weight matrix form is a newly introduced research direction. Unfortunately, the unique symmetry structure of deep weight spaces makes this design very challenging. If successful, such architectures would be capable of performing a wide range of intriguing tasks, from adapting a pre-trained network to a new domain to editing objects represented as functions (INRs or NeRFs). As a first step towards this goal, we present here a novel network architecture for learning in deep weight spaces. It takes as input a concatenation of weights and biases of a pre-trained MLP and processes it using a composition of layers that are equivariant to the natural permutation symmetry of the MLP’s weights: Changing the order of neurons in intermediate layers of the MLP does not affect the function it represents. We provide a full characterization of all affine equivariant and invariant layers for these symmetries and show how these layers can be implemented using three basic operations: pooling, broadcasting, and fully connected layers applied to the input in an appropriate manner. We demonstrate the effectiveness of our architecture and its advantages over natural baselines in a variety of learning tasks.
Cite
Text
Navon et al. "Equivariant Architectures for Learning in Deep Weight Spaces." International Conference on Machine Learning, 2023.Markdown
[Navon et al. "Equivariant Architectures for Learning in Deep Weight Spaces." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/navon2023icml-equivariant/)BibTeX
@inproceedings{navon2023icml-equivariant,
title = {{Equivariant Architectures for Learning in Deep Weight Spaces}},
author = {Navon, Aviv and Shamsian, Aviv and Achituve, Idan and Fetaya, Ethan and Chechik, Gal and Maron, Haggai},
booktitle = {International Conference on Machine Learning},
year = {2023},
pages = {25790-25816},
volume = {202},
url = {https://mlanthology.org/icml/2023/navon2023icml-equivariant/}
}