A Two-Stage Method for Active Learning of Statistical Grammars

Abstract

Active learning reduces the amount of manually annotated sentences necessary when training state-ofthe-art statistical parsers. One popular method, uncertainty sampling, selects sentences for which the parser exhibits low certainty. However, this method does not quantify confidence about the current statistical model itself. In particular, we should be less confident about selection decisions based on low frequency events. We present a novel twostage method which first targets sentences which cannot be reliably selected using uncertainty sampling, and then applies standard uncertainty sampling to the remaining sentences. An evaluation shows that this method performs better than pure uncertainty sampling, and better than an ensemble method based on bagged ensemble members only. 1

Cite

Text

Becker and Osborne. "A Two-Stage Method for Active Learning of Statistical Grammars." International Joint Conference on Artificial Intelligence, 2005.

Markdown

[Becker and Osborne. "A Two-Stage Method for Active Learning of Statistical Grammars." International Joint Conference on Artificial Intelligence, 2005.](https://mlanthology.org/ijcai/2005/becker2005ijcai-two/)

BibTeX

@inproceedings{becker2005ijcai-two,
  title     = {{A Two-Stage Method for Active Learning of Statistical Grammars}},
  author    = {Becker, Markus and Osborne, Miles},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2005},
  pages     = {991-996},
  url       = {https://mlanthology.org/ijcai/2005/becker2005ijcai-two/}
}