Learning Process Models with Missing Data

Abstract

In this paper, we review the task of inductive process modeling, which uses domain knowledge to compose explanatory models of continuous dynamic systems. Next we discuss approaches to learning with missing values in time series, noting that these efforts are typically applied for descriptive modeling tasks that use little background knowledge. We also point out that these methods assume that data are missing at random—a condition that may not hold in scientific domains. Using experiments with synthetic and natural data, we compare an expectation maximization approach with one that simply ignores the missing data. Results indicate that expectation maximization leads to more accurate models in most cases, even though its basic assumptions are unmet. We conclude by discussing the implications of our findings along with directions for future work.

Cite

Text

Bridewell et al. "Learning Process Models with Missing Data." European Conference on Machine Learning, 2006. doi:10.1007/11871842_52

Markdown

[Bridewell et al. "Learning Process Models with Missing Data." European Conference on Machine Learning, 2006.](https://mlanthology.org/ecmlpkdd/2006/bridewell2006ecml-learning/) doi:10.1007/11871842_52

BibTeX

@inproceedings{bridewell2006ecml-learning,
  title     = {{Learning Process Models with Missing Data}},
  author    = {Bridewell, Will and Langley, Pat and Racunas, Steve and Borrett, Stuart R.},
  booktitle = {European Conference on Machine Learning},
  year      = {2006},
  pages     = {557-565},
  doi       = {10.1007/11871842_52},
  url       = {https://mlanthology.org/ecmlpkdd/2006/bridewell2006ecml-learning/}
}