Beyond Outlier Detection: LookOut for Pictorial Explanation
Abstract
Why is a given point in a dataset marked as an outlier by an off-the-shelf detection algorithm? Which feature(s) explain it the best? What is the best way to convince a human analyst that the point is indeed an outlier? We provide succinct, interpretable, and simple pictorial explanations of outlying behavior in multi-dimensional real-valued datasets while respecting the limited attention of human analysts. Specifically, we propose to output a few focus-plots, i.e., pairwise feature plots, from a few, carefully chosen feature sub-spaces. The proposed LookOut makes four contributions: (a) problem formulation: we introduce an “analyst-centered” problem formulation for explaining outliers via focus-plots, (b) explanation algorithm: we propose a plot-selection objective and the LookOut algorithm to approximate it with optimality guarantees, (c) generality: our explanation algorithm is both domain- and detector-agnostic, and (d) scalability: LookOut scales linearly with the size of input outliers to explain and the explanation budget. Our experiments show that LookOut performs near-ideally in terms of maximizing explanation objective on several real datasets, while producing visually interpretable and intuitive results in explaining groundtruth outliers. Code related to this paper is available at: https://github.com/NikhilGupta1997/Lookout .
Cite
Text
Gupta et al. "Beyond Outlier Detection: LookOut for Pictorial Explanation." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2018. doi:10.1007/978-3-030-10925-7_8Markdown
[Gupta et al. "Beyond Outlier Detection: LookOut for Pictorial Explanation." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2018.](https://mlanthology.org/ecmlpkdd/2018/gupta2018ecmlpkdd-beyond/) doi:10.1007/978-3-030-10925-7_8BibTeX
@inproceedings{gupta2018ecmlpkdd-beyond,
title = {{Beyond Outlier Detection: LookOut for Pictorial Explanation}},
author = {Gupta, Nikhil and Eswaran, Dhivya and Shah, Neil and Akoglu, Leman and Faloutsos, Christos},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2018},
pages = {122-138},
doi = {10.1007/978-3-030-10925-7_8},
url = {https://mlanthology.org/ecmlpkdd/2018/gupta2018ecmlpkdd-beyond/}
}