GradMetaNet: An Equivariant Architecture for Learning on Gradients

Abstract

Gradients of neural networks encode valuable information for optimization, editing, and analysis of models. Therefore, practitioners often treat gradients as inputs to task-specific algorithms, e.g., using gradient statistics for pruning or optimization. Recent works explore *learning* algorithms that operate directly on gradients but use architectures that are not specifically designed for gradient processing, hindering their applicability. In this paper, we present a principled approach for designing architectures that process gradients. Our approach is guided by three principles: (1) equivariant design that preserves neuron permutation symmetries, (2) processing sets of gradients across multiple data points to capture curvature information, and (3) efficient gradient representation through rank-1 decomposition. Based on these principles, we introduce GradMetaNet, a novel architecture for learning on gradients, constructed from simple equivariant blocks. We prove universality results for GradMetaNet, and show that previous approaches cannot approximate natural gradient-based functions that GradMetaNet can. We then demonstrate GradMetaNet's effectiveness on a diverse set of gradient-based tasks for *MLPs* and *transformers*, such as learned optimization, INR editing, and loss landscape curvature estimation.

Cite

Text

Gelberg et al. "GradMetaNet: An Equivariant Architecture for Learning on Gradients." Advances in Neural Information Processing Systems, 2025.

Markdown

[Gelberg et al. "GradMetaNet: An Equivariant Architecture for Learning on Gradients." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/gelberg2025neurips-gradmetanet/)

BibTeX

@inproceedings{gelberg2025neurips-gradmetanet,
  title     = {{GradMetaNet: An Equivariant Architecture for Learning on Gradients}},
  author    = {Gelberg, Yoav and Eitan, Yam and Navon, Aviv and Shamsian, Aviv and Putterman, Theo and Bronstein, Michael M. and Maron, Haggai},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/gelberg2025neurips-gradmetanet/}
}