FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression

Abstract

Functional data play a pivotal role across science and engineering, yet their infinite-dimensional nature makes representation learning challenging. Conventional statistical models depend on pre-chosen basis expansions or kernels, limiting the flexibility of data-driven discovery, while many deep-learning pipelines treat functions as fixed-grid vectors, ignoring inherent continuity. In this paper, we introduce Functional Attention with a Mixture-of-Experts (FAME), an end-to-end, fully data-driven framework for function-on-function regression. FAME forms continuous attention by coupling a bidirectional neural controlled differential equation with MoE-driven vector fields to capture intra-functional continuity, and further fuses change to inter-functional dependencies via multi-head cross attention. Extensive experiments on synthetic and real-world functional regression benchmarks show that FAME achieves state-of-the-art accuracy and strong robustness to arbitrarily sampled discrete observations of functions.

Cite

Text

Gao et al. "FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression." Advances in Neural Information Processing Systems, 2025.

Markdown

[Gao et al. "FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/gao2025neurips-fame/)

BibTeX

@inproceedings{gao2025neurips-fame,
  title     = {{FAME: Adaptive Functional Attention with Expert Routing for Function-on-Function Regression}},
  author    = {Gao, Yifei and Chen, Yong and Zhang, Chen},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/gao2025neurips-fame/}
}