Exploring the Approximation Capabilities of Multiplicative Neural Networks for Smooth Functions

Abstract

Multiplication layers are a key component in various influential neural network modules, including self-attention and hypernetwork layers. In this paper, we investigate the approximation capabilities of deep neural networks with intermediate neurons connected by simple multiplication operations. We consider two classes of target functions: generalized bandlimited functions, which are frequently used to model real-world signals with finite bandwidth, and Sobolev-Type balls, which are embedded in the Sobolev Space $\mathcal{W}^{r,2}$. Our results demonstrate that multiplicative neural networks can approximate these functions with significantly fewer layers and neurons compared to standard ReLU neural networks, with respect to both input dimension and approximation error. These findings suggest that multiplicative gates can outperform standard feed-forward layers and have potential for improving neural network design.

Cite

Text

Ben-Shaul et al. "Exploring the Approximation Capabilities of Multiplicative Neural Networks for Smooth Functions." Transactions on Machine Learning Research, 2023.

Markdown

[Ben-Shaul et al. "Exploring the Approximation Capabilities of Multiplicative Neural Networks for Smooth Functions." Transactions on Machine Learning Research, 2023.](https://mlanthology.org/tmlr/2023/benshaul2023tmlr-exploring/)

BibTeX

@article{benshaul2023tmlr-exploring,
  title     = {{Exploring the Approximation Capabilities of Multiplicative Neural Networks for Smooth Functions}},
  author    = {Ben-Shaul, Ido and Galanti, Tomer and Dekel, Shai},
  journal   = {Transactions on Machine Learning Research},
  year      = {2023},
  url       = {https://mlanthology.org/tmlr/2023/benshaul2023tmlr-exploring/}
}