Invariant Random Forest: Tree-Based Model Solution for OOD Generalization

Abstract

Out-Of-Distribution (OOD) generalization is an essential topic in machine learning. However, recent research is only focusing on the corresponding methods for neural networks. This paper introduces a novel and effective solution for OOD generalization of decision tree models, named Invariant Decision Tree (IDT). IDT enforces a penalty term with regard to the unstable/varying behavior of a split across different environments during the growth of the tree. Its ensemble version, the Invariant Random Forest (IRF), is constructed. Our proposed method is motivated by a theoretical result under mild conditions, and validated by numerical tests with both synthetic and real datasets. The superior performance compared to non-OOD tree models implies that considering OOD generalization for tree models is absolutely necessary and should be given more attention.

Cite

Text

Liao et al. "Invariant Random Forest: Tree-Based Model Solution for OOD Generalization." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I12.29283

Markdown

[Liao et al. "Invariant Random Forest: Tree-Based Model Solution for OOD Generalization." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/liao2024aaai-invariant/) doi:10.1609/AAAI.V38I12.29283

BibTeX

@inproceedings{liao2024aaai-invariant,
  title     = {{Invariant Random Forest: Tree-Based Model Solution for OOD Generalization}},
  author    = {Liao, Yufan and Wu, Qi and Yan, Xing},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {13772-13781},
  doi       = {10.1609/AAAI.V38I12.29283},
  url       = {https://mlanthology.org/aaai/2024/liao2024aaai-invariant/}
}