GAMformer: Exploring In-Context Learning for Generalized Additive Models

Abstract

Generalized Additive Models (GAMs) are widely recognized for their ability to create fully interpretable machine learning models for tabular data. Traditionally, training GAMs involves iterative learning algorithms, such as splines, boosted trees, or neural networks, which refine the additive components through repeated error reduction. In this paper, we introduce *GAMformer*, the first method to leverage in-context learning to estimate shape functions of a GAM in a single forward pass, representing a significant departure from the conventional iterative approaches to GAM fitting. Building on previous research applying in-context learning to tabular data, we exclusively use complex, synthetic data to train GAMformer, yet find it to extrapolate well to real-world data. Our experiments show that GAMformer performs on par with other leading GAMs across various classification benchmarks while generating highly interpretable shape functions. Source code is available under this [link](https://anonymous.4open.science/r/900265cb0d5e51efb612/README.md.).

Cite

Text

Mueller et al. "GAMformer: Exploring In-Context Learning for Generalized Additive Models." NeurIPS 2024 Workshops: InterpretableAI, 2024.

Markdown

[Mueller et al. "GAMformer: Exploring In-Context Learning for Generalized Additive Models." NeurIPS 2024 Workshops: InterpretableAI, 2024.](https://mlanthology.org/neuripsw/2024/mueller2024neuripsw-gamformer/)

BibTeX

@inproceedings{mueller2024neuripsw-gamformer,
  title     = {{GAMformer: Exploring In-Context Learning for Generalized Additive Models}},
  author    = {Mueller, Andreas C and Siems, Julien and Nori, Harsha and Caruana, Rich and Hutter, Frank},
  booktitle = {NeurIPS 2024 Workshops: InterpretableAI},
  year      = {2024},
  url       = {https://mlanthology.org/neuripsw/2024/mueller2024neuripsw-gamformer/}
}