GAMformer: Exploring In-Context Learning for Generalized Additive Models
Abstract
Generalized Additive Models (GAMs) are widely recognized for their ability to create fully interpretable machine learning models for tabular data. Traditionally, training GAMs involves iterative learning algorithms, such as splines, boosted trees, or neural networks, which refine the additive components through repeated error reduction. In this paper, we introduce GAMformer, the first method to leverage in-context learning to estimate shape functions of a GAM in a single forward pass, representing a significant departure from the conventional iterative approaches to GAM fitting. Building on previous research applying in-context learning to tabular data, we exclusively use complex, synthetic data to train GAMformer, yet find it to extrapolate well to real-world data. Our experiments show that GAMformer performs on par with other leading GAMs across various classification benchmarks while generating highly interpretable shape functions.
Cite
Text
Mueller et al. "GAMformer: Exploring In-Context Learning for Generalized Additive Models." NeurIPS 2024 Workshops: TRL, 2024.Markdown
[Mueller et al. "GAMformer: Exploring In-Context Learning for Generalized Additive Models." NeurIPS 2024 Workshops: TRL, 2024.](https://mlanthology.org/neuripsw/2024/mueller2024neuripsw-gamformer-a/)BibTeX
@inproceedings{mueller2024neuripsw-gamformer-a,
title = {{GAMformer: Exploring In-Context Learning for Generalized Additive Models}},
author = {Mueller, Andreas C and Siems, Julien and Nori, Harsha and Salinas, David and Zela, Arber and Caruana, Rich and Hutter, Frank},
booktitle = {NeurIPS 2024 Workshops: TRL},
year = {2024},
url = {https://mlanthology.org/neuripsw/2024/mueller2024neuripsw-gamformer-a/}
}