Spoken Attributes: Mixing Binary and Relative Attributes to Say the Right Thing

Sadovnik, Amir; Gallagher, Andrew; Parikh, Devi; Chen, Tsuhan

doi:10.1109/ICCV.2013.268

Spoken Attributes: Mixing Binary and Relative Attributes to Say the Right Thing

Amir Sadovnik, Andrew Gallagher, Devi Parikh, Tsuhan Chen

ICCV 2013

doi:10.1109/ICCV.2013.268 /iccv/2013/sadovnik2013iccv-spoken/

Abstract

In recent years, there has been a great deal of progress in describing objects with attributes. Attributes have proven useful for object recognition, image search, face verification, image description, and zero-shot learning. Typically, attributes are either binary or relative: they describe either the presence or absence of a descriptive characteristic, or the relative magnitude of the characteristic when comparing two exemplars. However, prior work fails to model the actual way in which humans use these attributes in descriptive statements of images. Specifically, it does not address the important interactions between the binary and relative aspects of an attribute. In this work we propose a spoken attribute classifier which models a more natural way of using an attribute in a description. For each attribute we train a classifier which captures the specific way this attribute should be used. We show that as a result of using this model, we produce descriptions about images of people that are more natural and specific than past systems.

PDF ICCV Semantic Scholar

Cite

Text

Sadovnik et al. "Spoken Attributes: Mixing Binary and Relative Attributes to Say the Right Thing." International Conference on Computer Vision, 2013. doi:10.1109/ICCV.2013.268

Markdown

[Sadovnik et al. "Spoken Attributes: Mixing Binary and Relative Attributes to Say the Right Thing." International Conference on Computer Vision, 2013.](https://mlanthology.org/iccv/2013/sadovnik2013iccv-spoken/) doi:10.1109/ICCV.2013.268

BibTeX

@inproceedings{sadovnik2013iccv-spoken,
  title     = {{Spoken Attributes: Mixing Binary and Relative Attributes to Say the Right Thing}},
  author    = {Sadovnik, Amir and Gallagher, Andrew and Parikh, Devi and Chen, Tsuhan},
  booktitle = {International Conference on Computer Vision},
  year      = {2013},
  doi       = {10.1109/ICCV.2013.268},
  url       = {https://mlanthology.org/iccv/2013/sadovnik2013iccv-spoken/}
}