AutoML for Outlier Detection with Optimal Transport Distances

Abstract

Automated machine learning (AutoML) has been widely researched and adopted for supervised problems, but progress in unsupervised settings has been limited. We propose `"LOTUS", a novel framework to automate outlier detection based on meta-learning. Our premise is that the selection of the optimal outlier detection technique depends on the inherent properties of the data distribution. We leverage optimal transport to find the dataset with the most similar underlying distribution, and then apply the outlier detection techniques that proved to work best for that data distribution. We evaluate the robustness of our framework and find that it outperforms all state-of-the-art automated outlier detection tools. This approach can also be easily generalized to automate other unsupervised settings.

Cite

Text

Singh and Vanschoren. "AutoML for Outlier Detection with Optimal Transport Distances." International Joint Conference on Artificial Intelligence, 2023. doi:10.24963/IJCAI.2023/843

Markdown

[Singh and Vanschoren. "AutoML for Outlier Detection with Optimal Transport Distances." International Joint Conference on Artificial Intelligence, 2023.](https://mlanthology.org/ijcai/2023/singh2023ijcai-automl/) doi:10.24963/IJCAI.2023/843

BibTeX

@inproceedings{singh2023ijcai-automl,
  title     = {{AutoML for Outlier Detection with Optimal Transport Distances}},
  author    = {Singh, Prabhant and Vanschoren, Joaquin},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {7175-7178},
  doi       = {10.24963/IJCAI.2023/843},
  url       = {https://mlanthology.org/ijcai/2023/singh2023ijcai-automl/}
}