Latent Embeddings for Zero-Shot Classification

Abstract

We present a novel latent embedding model for learning a compatibility function between image and class embeddings, in the context of zero-shot classification. The proposed method augments the state-of-the-art bilinear compatibility model by incorporating latent variables. Instead of learning a single bilinear map, it learns a collection of maps with the selection, of which map to use, being a latent variable for the current image-class pair. We train the model with a ranking based objective function which penalizes incorrect rankings of the true class for a given image. We empirically demonstrate that our model improves the state-of-the-art for various class embeddings consistently on three challenging publicly available datasets for the zero-shot setting. Moreover, our method leads to visually highly interpretable results with clear clusters of different fine-grained object properties that correspond to different latent variable maps.

Cite

Text

Xian et al. "Latent Embeddings for Zero-Shot Classification." Conference on Computer Vision and Pattern Recognition, 2016. doi:10.1109/CVPR.2016.15

Markdown

[Xian et al. "Latent Embeddings for Zero-Shot Classification." Conference on Computer Vision and Pattern Recognition, 2016.](https://mlanthology.org/cvpr/2016/xian2016cvpr-latent/) doi:10.1109/CVPR.2016.15

BibTeX

@inproceedings{xian2016cvpr-latent,
  title     = {{Latent Embeddings for Zero-Shot Classification}},
  author    = {Xian, Yongqin and Akata, Zeynep and Sharma, Gaurav and Nguyen, Quynh and Hein, Matthias and Schiele, Bernt},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2016},
  doi       = {10.1109/CVPR.2016.15},
  url       = {https://mlanthology.org/cvpr/2016/xian2016cvpr-latent/}
}