The Minimum Transfer Cost Principle for Model-Order Selection

Abstract

The goal of model-order selection is to select a model variant that generalizes best from training data to unseen test data. In unsupervised learning without any labels, the computation of the generalization error of a solution poses a conceptual problem which we address in this paper. We formulate the principle of “ minimum transfer costs ” for model-order selection. This principle renders the concept of cross-validation applicable to unsupervised learning problems. As a substitute for labels, we introduce a mapping between objects of the training set to objects of the test set enabling the transfer of training solutions. Our method is explained and investigated by applying it to well-known problems such as singular-value decomposition, correlation clustering, Gaussian mixture-models, and k -means clustering. Our principle finds the optimal model complexity in controlled experiments and in real-world problems such as image denoising, role mining and detection of misconfigurations in access-control data.

Cite

Text

Frank et al. "The Minimum Transfer Cost Principle for Model-Order Selection." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2011. doi:10.1007/978-3-642-23780-5_37

Markdown

[Frank et al. "The Minimum Transfer Cost Principle for Model-Order Selection." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2011.](https://mlanthology.org/ecmlpkdd/2011/frank2011ecmlpkdd-minimum/) doi:10.1007/978-3-642-23780-5_37

BibTeX

@inproceedings{frank2011ecmlpkdd-minimum,
  title     = {{The Minimum Transfer Cost Principle for Model-Order Selection}},
  author    = {Frank, Mario and Chehreghani, Morteza Haghir and Buhmann, Joachim M.},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2011},
  pages     = {423-438},
  doi       = {10.1007/978-3-642-23780-5_37},
  url       = {https://mlanthology.org/ecmlpkdd/2011/frank2011ecmlpkdd-minimum/}
}