Random Forest Autoencoders for Guided Representation Learning

Abstract

Extensive research has produced robust methods for unsupervised data visualization. Yet supervised visualization—where expert labels guide representations—remains underexplored, as most supervised approaches prioritize classification over visualization. Recently, RF-PHATE, a diffusion-based manifold learning method leveraging random forests and information geometry, marked significant progress in supervised visualization. However, its lack of an explicit mapping function limits scalability and its application to unseen data, posing challenges for large datasets and label-scarce scenarios. To overcome these limitations, we introduce Random Forest Autoencoders (RF-AE), a neural network-based framework for out-of-sample kernel extension that combines the flexibility of autoencoders with the supervised learning strengths of random forests and the geometry captured by RF-PHATE. RF-AE enables efficient out-of-sample supervised visualization and outperforms existing methods, including RF-PHATE's standard kernel extension, in both accuracy and interpretability. Additionally, RF-AE is modality-agnostic, demonstrates strong robustness to hyperparameters, supports both classification and regression, and natively accommodates missing feature values. Our code is available at https://github.com/JakeSRhodesLab/RF-AE.

Cite

Text

Aumon et al. "Random Forest Autoencoders for Guided Representation Learning." Advances in Neural Information Processing Systems, 2025.

Markdown

[Aumon et al. "Random Forest Autoencoders for Guided Representation Learning." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/aumon2025neurips-random/)

BibTeX

@inproceedings{aumon2025neurips-random,
  title     = {{Random Forest Autoencoders for Guided Representation Learning}},
  author    = {Aumon, Adrien and Ni, Shuang and Lizotte, Myriam and Wolf, Guy and Moon, Kevin R. and Rhodes, Jake Slater},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/aumon2025neurips-random/}
}