Ranking Interesting Subgroups

Abstract

Subgroup discovery is the task of identifying the top k patterns in a database with most significant deviation in the distribution of a target attribute Y . Subgroup discovery is a popular approach for identifying interesting patterns in data, because it effectively combines statistical significance with an understandable representation of patterns as a logical formula. However, it is often a problem that some subgroups, even if they are statistically highly significant, are not interesting to the user for some reason. In this paper, we present an approach based on the work on ranking Support Vector Machines that ranks subgroups with respect to the user's concept of interestingness, and finds subgroups that are interesting to the user. It will be shown that this approach can significantly increase the quality of the subgroups.

Cite

Text

Rüping. "Ranking Interesting Subgroups." International Conference on Machine Learning, 2009. doi:10.1145/1553374.1553491

Markdown

[Rüping. "Ranking Interesting Subgroups." International Conference on Machine Learning, 2009.](https://mlanthology.org/icml/2009/ruping2009icml-ranking/) doi:10.1145/1553374.1553491

BibTeX

@inproceedings{ruping2009icml-ranking,
  title     = {{Ranking Interesting Subgroups}},
  author    = {Rüping, Stefan},
  booktitle = {International Conference on Machine Learning},
  year      = {2009},
  pages     = {913-920},
  doi       = {10.1145/1553374.1553491},
  url       = {https://mlanthology.org/icml/2009/ruping2009icml-ranking/}
}