Mode Estimation with Partial Feedback

Abstract

The combination of lightly supervised pre-training and online fine-tuning has played a key role in recent AI developments. These new learning pipelines call for new theoretical frameworks. In this paper, we formalize key aspects of weakly supervised and active learning with a simple problem: the estimation of the mode of a distribution with partial feedback. We showcase how entropy coding allows for optimal information acquisition from partial feedback, develop coarse sufficient statistics for mode identification, and adapt bandit algorithms to our new setting. Finally, we combine those contributions into a statistically and computationally efficient solution to our original problem.

Cite

Text

Arnal et al. "Mode Estimation with Partial Feedback." Conference on Learning Theory, 2024.

Markdown

[Arnal et al. "Mode Estimation with Partial Feedback." Conference on Learning Theory, 2024.](https://mlanthology.org/colt/2024/arnal2024colt-mode/)

BibTeX

@inproceedings{arnal2024colt-mode,
  title     = {{Mode Estimation with Partial Feedback}},
  author    = {Arnal, Charles and Cabannes, Vivien and Perchet, Vianney},
  booktitle = {Conference on Learning Theory},
  year      = {2024},
  pages     = {219-220},
  volume    = {247},
  url       = {https://mlanthology.org/colt/2024/arnal2024colt-mode/}
}