Embracing Data Abundance

Abstract

There is a practically unlimited amount of natural language data available. Still, recent work in text comprehension has focused on datasets which are small relative to current computing possibilities. This article is making a case for the community to move to larger data and is offering the BookTest dataset as a step in that direction.

Cite

Text

Bajgar et al. "Embracing Data Abundance." International Conference on Learning Representations, 2017.

Markdown

[Bajgar et al. "Embracing Data Abundance." International Conference on Learning Representations, 2017.](https://mlanthology.org/iclr/2017/bajgar2017iclr-embracing/)

BibTeX

@inproceedings{bajgar2017iclr-embracing,
  title     = {{Embracing Data Abundance}},
  author    = {Bajgar, Ondrej and Kadlec, Rudolf and Kleindienst, Jan},
  booktitle = {International Conference on Learning Representations},
  year      = {2017},
  url       = {https://mlanthology.org/iclr/2017/bajgar2017iclr-embracing/}
}