Sentence-Based Image Description with Scalable, Explicit Models
Abstract
Associating photographs with complete sentences that describe what is depicted in them is a challenging problem. This paper examines how an approach that is inspired by image tagging techniques which can scale to very large data sets performs on this much harder task, and examines some of the linguistic difficulties that this bag-of-words model faces.
Cite
Text
Hodosh and Hockenmaier. "Sentence-Based Image Description with Scalable, Explicit Models." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2013. doi:10.1109/CVPRW.2013.51Markdown
[Hodosh and Hockenmaier. "Sentence-Based Image Description with Scalable, Explicit Models." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2013.](https://mlanthology.org/cvprw/2013/hodosh2013cvprw-sentencebased/) doi:10.1109/CVPRW.2013.51BibTeX
@inproceedings{hodosh2013cvprw-sentencebased,
title = {{Sentence-Based Image Description with Scalable, Explicit Models}},
author = {Hodosh, Micah and Hockenmaier, Julia},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2013},
pages = {294-300},
doi = {10.1109/CVPRW.2013.51},
url = {https://mlanthology.org/cvprw/2013/hodosh2013cvprw-sentencebased/}
}