Online Active Linear Regression via Thresholding
Abstract
We consider the problem of online active learning to collect data for regression modeling. Specifically, we consider a decision maker with a limited experimentation budget who must efficiently learn an underlying linear population model. Our main contribution is a novel threshold-based algorithm for selection of most informative observations; we characterize its performance and fundamental lower bounds. We extend the algorithm and its guarantees to sparse linear regression in high-dimensional settings. Simulations suggest the algorithm is remarkably robust: it provides significant benefits over passive random sampling in real-world datasets that exhibit high nonlinearity and high dimensionality — significantly reducing both the mean and variance of the squared error.
Cite
Text
Riquelme et al. "Online Active Linear Regression via Thresholding." AAAI Conference on Artificial Intelligence, 2017. doi:10.1609/AAAI.V31I1.10859Markdown
[Riquelme et al. "Online Active Linear Regression via Thresholding." AAAI Conference on Artificial Intelligence, 2017.](https://mlanthology.org/aaai/2017/riquelme2017aaai-online/) doi:10.1609/AAAI.V31I1.10859BibTeX
@inproceedings{riquelme2017aaai-online,
title = {{Online Active Linear Regression via Thresholding}},
author = {Riquelme, Carlos and Johari, Ramesh and Zhang, Baosen},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2017},
pages = {2506-2512},
doi = {10.1609/AAAI.V31I1.10859},
url = {https://mlanthology.org/aaai/2017/riquelme2017aaai-online/}
}