Online Active Linear Regression via Thresholding

Abstract

We consider the problem of online active learning to collect data for regression modeling. Specifically, we consider a decision maker with a limited experimentation budget who must efficiently learn an underlying linear population model. Our main contribution is a novel threshold-based algorithm for selection of most informative observations; we characterize its performance and fundamental lower bounds. We extend the algorithm and its guarantees to sparse linear regression in high-dimensional settings. Simulations suggest the algorithm is remarkably robust: it provides significant benefits over passive random sampling in real-world datasets that exhibit high nonlinearity and high dimensionality — significantly reducing both the mean and variance of the squared error.

Cite

Text

Riquelme et al. "Online Active Linear Regression via Thresholding." AAAI Conference on Artificial Intelligence, 2017. doi:10.1609/AAAI.V31I1.10859

Markdown

[Riquelme et al. "Online Active Linear Regression via Thresholding." AAAI Conference on Artificial Intelligence, 2017.](https://mlanthology.org/aaai/2017/riquelme2017aaai-online/) doi:10.1609/AAAI.V31I1.10859

BibTeX

@inproceedings{riquelme2017aaai-online,
  title     = {{Online Active Linear Regression via Thresholding}},
  author    = {Riquelme, Carlos and Johari, Ramesh and Zhang, Baosen},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2017},
  pages     = {2506-2512},
  doi       = {10.1609/AAAI.V31I1.10859},
  url       = {https://mlanthology.org/aaai/2017/riquelme2017aaai-online/}
}