Fast Distribution to Real Regression

Abstract

We study the problem of distribution to real-value regression, where one aims to regress a mapping $f$ that takes in a distribution input covariate $P\in \mathcal{I}$ (for a non-parametric family of distributions $\mathcal{I}$) and outputs a real-valued response $Y=f(P) + \epsilon$. This setting was recently studied, and a "Kernel-Kernel" estimator was introduced and shown to have a polynomial rate of convergence. However, evaluating a new prediction with the Kernel-Kernel estimator scales as $\Omega(N)$. This causes the difficult situation where a large amount of data may be necessary for a low estimation risk, but the computation cost of estimation becomes infeasible when the data-set is too large. To this end, we propose the Double-Basis estimator, which looks to alleviate this big data problem in two ways: first, the Double-Basis estimator is shown to have a computation complexity that is independent of the number of of instances $N$ when evaluating new predictions after training; secondly, the Double-Basis estimator is shown to have a fast rate of convergence for a general class of mappings $f\in\mathcal{F}$.

Cite

Text

Oliva et al. "Fast Distribution to Real Regression." International Conference on Artificial Intelligence and Statistics, 2014.

Markdown

[Oliva et al. "Fast Distribution to Real Regression." International Conference on Artificial Intelligence and Statistics, 2014.](https://mlanthology.org/aistats/2014/oliva2014aistats-fast/)

BibTeX

@inproceedings{oliva2014aistats-fast,
  title     = {{Fast Distribution to Real Regression}},
  author    = {Oliva, Junier B. and Neiswanger, Willie and Póczos, Barnabás and Schneider, Jeff G. and Xing, Eric P.},
  booktitle = {International Conference on Artificial Intelligence and Statistics},
  year      = {2014},
  pages     = {706-714},
  url       = {https://mlanthology.org/aistats/2014/oliva2014aistats-fast/}
}