A Transductive Forest for Anomaly Detection with Few Labels

Abstract

Extensive labeled training data for anomaly detection is enormously expensive and often unavailable in data-sensitive applications due to privacy constraints. We propose TransForest, a transductive forest for anomaly detection, in the semi-supervised setting where few labels are available. Guided by little label information, TransForest pushes classification boundaries toward sensitive areas where abnormal and normal points are located, increasing learning capacity. Empirically, TransForest is competitive with other unsupervised and semi-supervised representative detectors given a small number of labeled points. TransForest also offers a feature importance ranking consistent with the rankings provided by popular supervised forests on low-dimensional data sets. Our code is available at https://github.com/jzha968/transForest .

Cite

Text

Zhang et al. "A Transductive Forest for Anomaly Detection with Few Labels." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2023. doi:10.1007/978-3-031-43412-9_17

Markdown

[Zhang et al. "A Transductive Forest for Anomaly Detection with Few Labels." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2023.](https://mlanthology.org/ecmlpkdd/2023/zhang2023ecmlpkdd-transductive/) doi:10.1007/978-3-031-43412-9_17

BibTeX

@inproceedings{zhang2023ecmlpkdd-transductive,
  title     = {{A Transductive Forest for Anomaly Detection with Few Labels}},
  author    = {Zhang, Jingrui and Pham, Ninh and Dobbie, Gillian},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2023},
  pages     = {286-301},
  doi       = {10.1007/978-3-031-43412-9_17},
  url       = {https://mlanthology.org/ecmlpkdd/2023/zhang2023ecmlpkdd-transductive/}
}