Trees with Attention for Set Prediction Tasks

Abstract

In many machine learning applications, each record represents a set of items. For example, when making predictions from medical records, the medications prescribed to a patient are a set whose size is not fixed and whose order is arbitrary. However, most machine learning algorithms are not designed to handle set structures and are limited to processing records of fixed size. Set-Tree, presented in this work, extends the support for sets to tree-based models, such as Random-Forest and Gradient-Boosting, by introducing an attention mechanism and set-compatible split criteria. We evaluate the new method empirically on a wide range of problems ranging from making predictions on sub-atomic particle jets to estimating the redshift of galaxies. The new method outperforms existing tree-based methods consistently and significantly. Moreover, it is competitive and often outperforms Deep Learning. We also discuss the theoretical properties of Set-Trees and explain how they enable item-level explainability.

Cite

Text

Hirsch and Gilad-Bachrach. "Trees with Attention for Set Prediction Tasks." International Conference on Machine Learning, 2021.

Markdown

[Hirsch and Gilad-Bachrach. "Trees with Attention for Set Prediction Tasks." International Conference on Machine Learning, 2021.](https://mlanthology.org/icml/2021/hirsch2021icml-trees/)

BibTeX

@inproceedings{hirsch2021icml-trees,
  title     = {{Trees with Attention for Set Prediction Tasks}},
  author    = {Hirsch, Roy and Gilad-Bachrach, Ran},
  booktitle = {International Conference on Machine Learning},
  year      = {2021},
  pages     = {4250-4261},
  volume    = {139},
  url       = {https://mlanthology.org/icml/2021/hirsch2021icml-trees/}
}