Expectation Maximization for Weakly Labeled Data
Abstract
We call data weakly labeled if it has no exact label but rather a numerical indication of correctness o the label "guessed" by the learning algo rithm - a situatio n cofi;)?v enco( tered inpro?(:fi o reinfo rcement learning. The term emphasizes similarities o o r approfi h to the kno wn techniqueso fso lving unsupervised and transductive pro blems. In this paper we present an on-line algoj thm that casts the pro blem as a multi-arm bandit with hidden state and so lves it iteratively within the Expectatio n-Maximizatio framewo7fi The hidden state is represented by a parameterized pro bability distributio no ver states tied to the reward. The parameterizatio n isfo rmally justified, allowing for smooth blending between likeliho o d- and reward-based costs.
Cite
Text
Ivanov et al. "Expectation Maximization for Weakly Labeled Data." International Conference on Machine Learning, 2001.Markdown
[Ivanov et al. "Expectation Maximization for Weakly Labeled Data." International Conference on Machine Learning, 2001.](https://mlanthology.org/icml/2001/ivanov2001icml-expectation/)BibTeX
@inproceedings{ivanov2001icml-expectation,
title = {{Expectation Maximization for Weakly Labeled Data}},
author = {Ivanov, Yuri A. and Blumberg, Bruce and Pentland, Alex},
booktitle = {International Conference on Machine Learning},
year = {2001},
pages = {218-225},
url = {https://mlanthology.org/icml/2001/ivanov2001icml-expectation/}
}