Sentence-Based Image Description with Scalable, Explicit Models

Abstract

Associating photographs with complete sentences that describe what is depicted in them is a challenging problem. This paper examines how an approach that is inspired by image tagging techniques which can scale to very large data sets performs on this much harder task, and examines some of the linguistic difficulties that this bag-of-words model faces.

Cite

Text

Hodosh and Hockenmaier. "Sentence-Based Image Description with Scalable, Explicit Models." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2013. doi:10.1109/CVPRW.2013.51

Markdown

[Hodosh and Hockenmaier. "Sentence-Based Image Description with Scalable, Explicit Models." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2013.](https://mlanthology.org/cvprw/2013/hodosh2013cvprw-sentencebased/) doi:10.1109/CVPRW.2013.51

BibTeX

@inproceedings{hodosh2013cvprw-sentencebased,
  title     = {{Sentence-Based Image Description with Scalable, Explicit Models}},
  author    = {Hodosh, Micah and Hockenmaier, Julia},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2013},
  pages     = {294-300},
  doi       = {10.1109/CVPRW.2013.51},
  url       = {https://mlanthology.org/cvprw/2013/hodosh2013cvprw-sentencebased/}
}