Invariant Random Forest: Tree-Based Model Solution for OOD Generalization
Abstract
Out-Of-Distribution (OOD) generalization is an essential topic in machine learning. However, recent research is only focusing on the corresponding methods for neural networks. This paper introduces a novel and effective solution for OOD generalization of decision tree models, named Invariant Decision Tree (IDT). IDT enforces a penalty term with regard to the unstable/varying behavior of a split across different environments during the growth of the tree. Its ensemble version, the Invariant Random Forest (IRF), is constructed. Our proposed method is motivated by a theoretical result under mild conditions, and validated by numerical tests with both synthetic and real datasets. The superior performance compared to non-OOD tree models implies that considering OOD generalization for tree models is absolutely necessary and should be given more attention.
Cite
Text
Liao et al. "Invariant Random Forest: Tree-Based Model Solution for OOD Generalization." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I12.29283Markdown
[Liao et al. "Invariant Random Forest: Tree-Based Model Solution for OOD Generalization." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/liao2024aaai-invariant/) doi:10.1609/AAAI.V38I12.29283BibTeX
@inproceedings{liao2024aaai-invariant,
title = {{Invariant Random Forest: Tree-Based Model Solution for OOD Generalization}},
author = {Liao, Yufan and Wu, Qi and Yan, Xing},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2024},
pages = {13772-13781},
doi = {10.1609/AAAI.V38I12.29283},
url = {https://mlanthology.org/aaai/2024/liao2024aaai-invariant/}
}