Reducing Overfitting in Process Model Induction
Abstract
In this paper, we review the paradigm of inductive process modeling, which uses background knowledge about possible component processes to construct quantitative models of dynamical systems. We note that previous methods for this task tend to overfit the training data, which suggests ensemble learning as a likely response. However, such techniques combine models in ways that reduce comprehensibility, making their output much less accessible to domain scientists. As an alternative, we introduce a new approach that induces a set of process models from different samples of the training data and uses them to guide a final search through the space of model structures. Experiments with synthetic and natural data suggest this method reduces error and decreases the chance of including unnecessary processes in the model. We conclude by discussing related work and suggesting directions for additional research.
Cite
Text
Bridewell et al. "Reducing Overfitting in Process Model Induction." International Conference on Machine Learning, 2005. doi:10.1145/1102351.1102362Markdown
[Bridewell et al. "Reducing Overfitting in Process Model Induction." International Conference on Machine Learning, 2005.](https://mlanthology.org/icml/2005/bridewell2005icml-reducing/) doi:10.1145/1102351.1102362BibTeX
@inproceedings{bridewell2005icml-reducing,
title = {{Reducing Overfitting in Process Model Induction}},
author = {Bridewell, Will and Asadi, Narges Bani and Langley, Pat and Todorovski, Ljupco},
booktitle = {International Conference on Machine Learning},
year = {2005},
pages = {81-88},
doi = {10.1145/1102351.1102362},
url = {https://mlanthology.org/icml/2005/bridewell2005icml-reducing/}
}