Fully-Dynamic Decision Trees

Abstract

We develop the first fully dynamic algorithm that maintains a decision tree over an arbitrary sequence of insertions and deletions of labeled examples. Given ε>0 our algorithm guarantees that, at every point in time, every node of the decision tree uses a split with Gini gain within an additive ε of the optimum. For real-valued features the algorithm has an amortized running time per insertion/deletion of O((d·log³n)/ε²), which improves to O((d·log²n)/ε) for binary or categorical features, while it uses space O(n·d), where n is the maximum number of examples at any point in time and d is the number of features. Our algorithm is nearly optimal, as we show that any algorithm with similar guarantees requires amortized running time Ω(d) and space Ω(n·d/polylog(nd)). We complement our theoretical results with an extensive experimental evaluation on real-world data, showing the effectiveness of our algorithm.

Cite

Text

Bressan et al. "Fully-Dynamic Decision Trees." AAAI Conference on Artificial Intelligence, 2023. doi:10.1609/AAAI.V37I6.25838

Markdown

[Bressan et al. "Fully-Dynamic Decision Trees." AAAI Conference on Artificial Intelligence, 2023.](https://mlanthology.org/aaai/2023/bressan2023aaai-fully/) doi:10.1609/AAAI.V37I6.25838

BibTeX

@inproceedings{bressan2023aaai-fully,
  title     = {{Fully-Dynamic Decision Trees}},
  author    = {Bressan, Marco and Damay, Gabriel and Sozio, Mauro},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {6842-6849},
  doi       = {10.1609/AAAI.V37I6.25838},
  url       = {https://mlanthology.org/aaai/2023/bressan2023aaai-fully/}
}