Efficient Linear Bandits Through Matrix Sketching

Abstract

We prove that two popular linear contextual bandit algorithms, OFUL and Thompson Sampling, can be made efficient using Frequent Directions, a deterministic online sketching technique. More precisely, we show that a sketch of size $m$ allows a $\mathcal{O}(md)$ update time for both algorithms, as opposed to $\Omega(d^2)$ required by their non-sketched versions in general (where $d$ is the dimension of context vectors). This computational speedup is accompanied by regret bounds of order $(1+\varepsilon_m)^{3/2}d\sqrt{T}$ for OFUL and of order $\big((1+\varepsilon_m)d\big)^{3/2}\sqrt{T}$ for Thompson Sampling, where $\varepsilon_m$ is bounded by the sum of the tail eigenvalues not covered by the sketch. In particular, when the selected contexts span a subspace of dimension at most $m$, our algorithms have a regret bound matching that of their slower, non-sketched counterparts. Experiments on real-world datasets corroborate our theoretical results.

Cite

Text

Kuzborskij et al. "Efficient Linear Bandits Through Matrix Sketching." Artificial Intelligence and Statistics, 2019.

Markdown

[Kuzborskij et al. "Efficient Linear Bandits Through Matrix Sketching." Artificial Intelligence and Statistics, 2019.](https://mlanthology.org/aistats/2019/kuzborskij2019aistats-efficient/)

BibTeX

@inproceedings{kuzborskij2019aistats-efficient,
  title     = {{Efficient Linear Bandits Through Matrix Sketching}},
  author    = {Kuzborskij, Ilja and Cella, Leonardo and Cesa-Bianchi, Nicolò},
  booktitle = {Artificial Intelligence and Statistics},
  year      = {2019},
  pages     = {177-185},
  volume    = {89},
  url       = {https://mlanthology.org/aistats/2019/kuzborskij2019aistats-efficient/}
}