Zero-Shot Recognition via Structured Prediction

Abstract

We develop a novel method for zero shot learning (ZSL) based on test-time adaptation of similarity functions learned using training data. Existing methods exclusively employ source-domain side information for recognizing unseen classes during test time. We show that for batch-mode applications, accuracy can be significantly improved by adapting these predictors to the observed test-time target-domain ensemble. We develop a novel structured prediction method for maximum a posteriori (MAP) estimation, where parameters account for test-time domain shift from what is predicted primarily using source domain information. We propose a Gaussian parameterization for the MAP problem and derive an efficient structure prediction algorithm. Empirically we test our method on four popular benchmark image datasets for ZSL, and show significant improvement over the state-of-the-art, on average, by 11.50 % and 30.12 % in terms of accuracy for recognition and mean average precision (mAP) for retrieval, respectively.

Cite

Text

Zhang and Saligrama. "Zero-Shot Recognition via Structured Prediction." European Conference on Computer Vision, 2016. doi:10.1007/978-3-319-46478-7_33

Markdown

[Zhang and Saligrama. "Zero-Shot Recognition via Structured Prediction." European Conference on Computer Vision, 2016.](https://mlanthology.org/eccv/2016/zhang2016eccv-zero/) doi:10.1007/978-3-319-46478-7_33

BibTeX

@inproceedings{zhang2016eccv-zero,
  title     = {{Zero-Shot Recognition via Structured Prediction}},
  author    = {Zhang, Ziming and Saligrama, Venkatesh},
  booktitle = {European Conference on Computer Vision},
  year      = {2016},
  pages     = {533-548},
  doi       = {10.1007/978-3-319-46478-7_33},
  url       = {https://mlanthology.org/eccv/2016/zhang2016eccv-zero/}
}