Pool-Based Agnostic Experiment Design in Linear Regression

Abstract

We address the problem of batch active learning (or experiment design) in regression scenarios, where the best input points to label is chosen from a ‘pool’ of unlabeled input samples. Existing active learning methods often assume that the model is correctly specified, i.e., the unknown learning target function is included in the model at hand. However, this assumption may not be fulfilled in practice (i.e., agnostic) and then the existing methods do not work well. In this paper, we propose a new active learning method that is robust against model misspecification. Simulations with various benchmark datasets as well as a real application to wafer alignment in semiconductor exposure apparatus illustrate the usefulness of the proposed method.

Cite

Text

Sugiyama and Nakajima. "Pool-Based Agnostic Experiment Design in Linear Regression." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2008. doi:10.1007/978-3-540-87481-2_27

Markdown

[Sugiyama and Nakajima. "Pool-Based Agnostic Experiment Design in Linear Regression." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2008.](https://mlanthology.org/ecmlpkdd/2008/sugiyama2008ecmlpkdd-poolbased/) doi:10.1007/978-3-540-87481-2_27

BibTeX

@inproceedings{sugiyama2008ecmlpkdd-poolbased,
  title     = {{Pool-Based Agnostic Experiment Design in Linear Regression}},
  author    = {Sugiyama, Masashi and Nakajima, Shinichi},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2008},
  pages     = {406-422},
  doi       = {10.1007/978-3-540-87481-2_27},
  url       = {https://mlanthology.org/ecmlpkdd/2008/sugiyama2008ecmlpkdd-poolbased/}
}