Learning the Switching Rate by Discretising Bernoulli Sources Online

Abstract

The expert tracking algorithm Fixed-Share depends on a parameter alpha, called the switching rate. If the final number of outcomes $T$ is known in advance, then the switching rate can be learned with regret $\frac{1}{2} \log T + O(1)$ bits. The current fastest method that achieves this, Learn-alpha, is based on optimal discretisation of the Bernoulli distributions into $O(\sqrt{T})$ bins and runs in $(T\sqrt{T})$ time; however the exact locations of these points have to be determined algorithmically. This paper introduces a new discretisation scheme with the same regret bound for known $T$, that specifies the number and positions of the discretisation points explicitly. The scheme is especially useful when $T$ is not known in advance: a new fully online algorithm, Refine-Online, is presented, which runs in $O(T \sqrt{T} \log T)$ time and achieves a regret of $\frac{1}{2} \log 3 \log T + O(\log \log T)$ bits.

Cite

Text

Rooij and Erven. "Learning the Switching Rate by Discretising Bernoulli Sources Online." Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009.

Markdown

[Rooij and Erven. "Learning the Switching Rate by Discretising Bernoulli Sources Online." Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009.](https://mlanthology.org/aistats/2009/rooij2009aistats-learning/)

BibTeX

@inproceedings{rooij2009aistats-learning,
  title     = {{Learning the Switching Rate by Discretising Bernoulli Sources Online}},
  author    = {Rooij, Steven and Erven, Tim},
  booktitle = {Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics},
  year      = {2009},
  pages     = {432-439},
  volume    = {5},
  url       = {https://mlanthology.org/aistats/2009/rooij2009aistats-learning/}
}