Multi-Attribute Spaces: Calibration for Attribute Fusion and Similarity Search
Abstract
Recent work has shown that visual attributes are a powerful approach for applications such as recognition, image description and retrieval. However, fusing multiple attribute scores - as required during multi-attribute queries or similarity searches - presents a significant challenge. Scores from different attribute classifiers cannot be combined in a simple way; the same score for different attributes can mean different things. In this work, we show how to construct normalized “multi-attribute spaces” from raw classifier outputs, using techniques based on the statistical Extreme Value Theory. Our method calibrates each raw score to a probability that the given attribute is present in the image. We describe how these probabilities can be fused in a simple way to perform more accurate multiattribute searches, as well as enable attribute-based similarity searches. A significant advantage of our approach is that the normalization is done after-the-fact, requiring neither modification to the attribute classification system nor ground truth attribute annotations. We demonstrate results on a large data set of nearly 2 million face images and show significant improvements over prior work. We also show that perceptual similarity of search results increases by using contextual attributes.
Cite
Text
Scheirer et al. "Multi-Attribute Spaces: Calibration for Attribute Fusion and Similarity Search." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012. doi:10.1109/CVPR.2012.6248021Markdown
[Scheirer et al. "Multi-Attribute Spaces: Calibration for Attribute Fusion and Similarity Search." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2012.](https://mlanthology.org/cvpr/2012/scheirer2012cvpr-multi/) doi:10.1109/CVPR.2012.6248021BibTeX
@inproceedings{scheirer2012cvpr-multi,
title = {{Multi-Attribute Spaces: Calibration for Attribute Fusion and Similarity Search}},
author = {Scheirer, Walter J. and Kumar, Neeraj and Belhumeur, Peter N. and Boult, Terrance E.},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2012},
pages = {2933-2940},
doi = {10.1109/CVPR.2012.6248021},
url = {https://mlanthology.org/cvpr/2012/scheirer2012cvpr-multi/}
}