Equivariant Architectures for Learning in Deep Weight Spaces

Abstract

Designing machine learning architectures for processing neural networks in their raw weight matrix form is a newly introduced research direction. Unfortunately, the unique symmetry structure of deep weight spaces makes this design very challenging. If successful, such architectures would be capable of performing a wide range of intriguing tasks, from adapting a pre-trained network to a new domain to editing objects represented as functions (INRs or NeRFs). As a first step towards this goal, we present here a novel network architecture for learning in deep weight spaces. It takes as input a concatenation of weights and biases of a pre-trained MLP and processes it using a composition of layers that are equivariant to the natural permutation symmetry of the MLP’s weights: Changing the order of neurons in intermediate layers of the MLP does not affect the function it represents. We provide a full characterization of all affine equivariant and invariant layers for these symmetries and show how these layers can be implemented using three basic operations: pooling, broadcasting, and fully connected layers applied to the input in an appropriate manner. We demonstrate the effectiveness of our architecture and its advantages over natural baselines in a variety of learning tasks.

Cite

Text

Navon et al. "Equivariant Architectures for Learning in Deep Weight Spaces." International Conference on Machine Learning, 2023.

Markdown

[Navon et al. "Equivariant Architectures for Learning in Deep Weight Spaces." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/navon2023icml-equivariant/)

BibTeX

@inproceedings{navon2023icml-equivariant,
  title     = {{Equivariant Architectures for Learning in Deep Weight Spaces}},
  author    = {Navon, Aviv and Shamsian, Aviv and Achituve, Idan and Fetaya, Ethan and Chechik, Gal and Maron, Haggai},
  booktitle = {International Conference on Machine Learning},
  year      = {2023},
  pages     = {25790-25816},
  volume    = {202},
  url       = {https://mlanthology.org/icml/2023/navon2023icml-equivariant/}
}