Adversarial Random Forests for Density Estimation and Generative Modeling

Abstract

We propose methods for density estimation and data synthesis using a novel form of unsupervised random forests. Inspired by generative adversarial networks, we implement a recursive procedure in which trees gradually learn structural properties of the data through alternating rounds of generation and discrimination. The method is provably consistent under minimal assumptions. Unlike classic tree-based alternatives, our approach provides smooth (un)conditional densities and allows for fully synthetic data generation. We achieve comparable or superior performance to state-of-the-art probabilistic circuits and deep learning models on various tabular data benchmarks while executing about two orders of magnitude faster on average. An accompanying $R$ package, $arf$, is available on $CRAN$.

Cite

Text

Watson et al. "Adversarial Random Forests for Density Estimation and Generative Modeling." Artificial Intelligence and Statistics, 2023.

Markdown

[Watson et al. "Adversarial Random Forests for Density Estimation and Generative Modeling." Artificial Intelligence and Statistics, 2023.](https://mlanthology.org/aistats/2023/watson2023aistats-adversarial/)

BibTeX

@inproceedings{watson2023aistats-adversarial,
  title     = {{Adversarial Random Forests for Density Estimation and Generative Modeling}},
  author    = {Watson, David S. and Blesch, Kristin and Kapar, Jan and Wright, Marvin N.},
  booktitle = {Artificial Intelligence and Statistics},
  year      = {2023},
  pages     = {5357-5375},
  volume    = {206},
  url       = {https://mlanthology.org/aistats/2023/watson2023aistats-adversarial/}
}