Order-Embeddings of Images and Language

Abstract

Hypernymy, textual entailment, and image captioning can be seen as special cases of a single visual-semantic hierarchy over words, sentences, and images. In this paper we advocate for explicitly modeling the partial order structure of this hierarchy. Towards this goal, we introduce a general method for learning ordered representations, and show how it can be applied to a variety of tasks involving images and language. We show that the resulting representations improve performance over current approaches for hypernym prediction and image-caption retrieval.

Cite

Text

Vendrov et al. "Order-Embeddings of Images and Language." International Conference on Learning Representations, 2016.

Markdown

[Vendrov et al. "Order-Embeddings of Images and Language." International Conference on Learning Representations, 2016.](https://mlanthology.org/iclr/2016/vendrov2016iclr-order/)

BibTeX

@inproceedings{vendrov2016iclr-order,
  title     = {{Order-Embeddings of Images and Language}},
  author    = {Vendrov, Ivan and Kiros, Ryan and Fidler, Sanja and Urtasun, Raquel},
  booktitle = {International Conference on Learning Representations},
  year      = {2016},
  url       = {https://mlanthology.org/iclr/2016/vendrov2016iclr-order/}
}