Separable Operator Networks

Yu, Xinling; Hooten, Sean; Liu, Ziyue; Zhao, Yequan; Fiorentino, Marco; Van Vaerenbergh, Thomas; Zhang, Zheng

Separable Operator Networks

Xinling Yu, Sean Hooten, Ziyue Liu, Yequan Zhao, Marco Fiorentino, Thomas Van Vaerenbergh, Zheng Zhang

TMLR 2024

/tmlr/2024/yu2024tmlr-separable/

Abstract

Operator learning has become a powerful tool in machine learning for modeling complex physical systems governed by partial differential equations (PDEs). Although Deep Operator Networks (DeepONet) show promise, they require extensive data acquisition. Physics-informed DeepONets (PI-DeepONet) mitigate data scarcity but suffer from inefficient training processes. We introduce Separable Operator Networks (SepONet), a novel framework that significantly enhances the efficiency of physics-informed operator learning. SepONet uses independent trunk networks to learn basis functions separately for different coordinate axes, enabling faster and more memory-efficient training via forward-mode automatic differentiation. We provide a universal approximation theorem for SepONet proving the existence of a separable approximation to any nonlinear continuous operator. Then, we comprehensively benchmark its representational capacity and computational performance against PI-DeepONet. Our results demonstrate SepONet's superior performance across various nonlinear and inseparable PDEs, with SepONet's advantages increasing with problem complexity, dimension, and scale. For 1D time-dependent PDEs, SepONet achieves up to 112× faster training and 82× reduction in GPU memory usage compared to PI-DeepONet, while maintaining comparable accuracy. For the 2D time-dependent nonlinear diffusion equation, SepONet efficiently handles the complexity, achieving a 6.44% mean relative $\ell_{2}$ test error, while PI-DeepONet fails due to memory constraints. This work paves the way for extreme-scale learning of continuous mappings between infinite-dimensional function spaces. Open source code is available at https://github.com/HewlettPackard/separable-operator-networks.

PDF TMLR Code Semantic Scholar

Cite

Text

Yu et al. "Separable Operator Networks." Transactions on Machine Learning Research, 2024.

Markdown

[Yu et al. "Separable Operator Networks." Transactions on Machine Learning Research, 2024.](https://mlanthology.org/tmlr/2024/yu2024tmlr-separable/)

BibTeX

@article{yu2024tmlr-separable,
  title     = {{Separable Operator Networks}},
  author    = {Yu, Xinling and Hooten, Sean and Liu, Ziyue and Zhao, Yequan and Fiorentino, Marco and Van Vaerenbergh, Thomas and Zhang, Zheng},
  journal   = {Transactions on Machine Learning Research},
  year      = {2024},
  url       = {https://mlanthology.org/tmlr/2024/yu2024tmlr-separable/}
}