Itemset Based Sequence Classification

Abstract

Sequence classification is an important task in data mining. We address the problem of sequence classification using rules composed of interesting itemsets found in a dataset of labelled sequences and accompanying class labels. We measure the interestingness of an itemset in a given class of sequences by combining the cohesion and the support of the itemset. We use the discovered itemsets to generate confident classification rules, and present two different ways of building a classifier. The first classifier is based on the CBA (Classification based on associations) method, but we use a new ranking strategy for the generated rules, achieving better results. The second classifier ranks the rules by first measuring their value specific to the new data object. Experimental results show that our classifiers outperform existing comparable classifiers in terms of accuracy and stability, while maintaining a computational advantage over sequential pattern based classification.

Cite

Text

Zhou et al. "Itemset Based Sequence Classification." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2013. doi:10.1007/978-3-642-40988-2_23

Markdown

[Zhou et al. "Itemset Based Sequence Classification." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2013.](https://mlanthology.org/ecmlpkdd/2013/zhou2013ecmlpkdd-itemset/) doi:10.1007/978-3-642-40988-2_23

BibTeX

@inproceedings{zhou2013ecmlpkdd-itemset,
  title     = {{Itemset Based Sequence Classification}},
  author    = {Zhou, Cheng and Cule, Boris and Goethals, Bart},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2013},
  pages     = {353-368},
  doi       = {10.1007/978-3-642-40988-2_23},
  url       = {https://mlanthology.org/ecmlpkdd/2013/zhou2013ecmlpkdd-itemset/}
}