Semigeometric Tiling of Event Sequences
Abstract
Event sequences are ubiquitous, e.g., in finance, medicine, and social media. Often the same underlying phenomenon, such as television advertisements during Superbowl, is reflected in independent event sequences, like different Twitter users. It is hence of interest to find combinations of temporal segments and subsets of sequences where an event of interest, like a particular hashtag, has an increased occurrence probability. Such patterns allow exploration of the event sequences in terms of their evolving temporal dynamics, and provide more fine-grained insights to the data than what for example straightforward clustering can reveal. We formulate the task of finding such patterns as a novel matrix tiling problem, and propose two algorithms for solving it. Our first algorithm is a greedy set-cover heuristic, while in the second approach we view the problem as time-series segmentation. We apply the algorithms on real and artificial datasets and obtain promising results. The software related to this paper is available at https://github.com/bwrc/semigeom-r .
Cite
Text
Henelius et al. "Semigeometric Tiling of Event Sequences." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2016. doi:10.1007/978-3-319-46128-1_21Markdown
[Henelius et al. "Semigeometric Tiling of Event Sequences." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2016.](https://mlanthology.org/ecmlpkdd/2016/henelius2016ecmlpkdd-semigeometric/) doi:10.1007/978-3-319-46128-1_21BibTeX
@inproceedings{henelius2016ecmlpkdd-semigeometric,
title = {{Semigeometric Tiling of Event Sequences}},
author = {Henelius, Andreas and Karlsson, Isak and Papapetrou, Panagiotis and Ukkonen, Antti and Puolamäki, Kai},
booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
year = {2016},
pages = {329-344},
doi = {10.1007/978-3-319-46128-1_21},
url = {https://mlanthology.org/ecmlpkdd/2016/henelius2016ecmlpkdd-semigeometric/}
}