A Joint Appearance-Spatial Distance for Kernel-Based Image Categorization

Abstract

The goal of image categorization is to classify a collection of unlabeled images into a set of predefined classes to support semantic-level image retrieval. The distance measures used in most existing approaches either ignored the spatial structures or used them in a separate step. As a result, these distance measures achieved only limited success. To address these difficulties, in this paper, we propose a new distance measure that integrates joint appearance-spatial image features. Such a distance measure is computed as an upper bound of an information-theoretic discrimination, and can be computed efficiently in a recursive formulation that scales well to image size. In addition, the upper bound approximation can be further tightened via adaption learning from a universal reference model. Extensive experiments on two widely-used data sets show that the proposed approach significantly outperforms the state-of-the-art approaches.

Cite

Text

Qi et al. "A Joint Appearance-Spatial Distance for Kernel-Based Image Categorization." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008. doi:10.1109/CVPR.2008.4587379

Markdown

[Qi et al. "A Joint Appearance-Spatial Distance for Kernel-Based Image Categorization." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008.](https://mlanthology.org/cvpr/2008/qi2008cvpr-joint/) doi:10.1109/CVPR.2008.4587379

BibTeX

@inproceedings{qi2008cvpr-joint,
  title     = {{A Joint Appearance-Spatial Distance for Kernel-Based Image Categorization}},
  author    = {Qi, Guo-Jun and Hua, Xian-Sheng and Rui, Yong and Tang, Jinhui and Zha, Zheng-Jun and Zhang, Hong-Jiang},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2008},
  doi       = {10.1109/CVPR.2008.4587379},
  url       = {https://mlanthology.org/cvpr/2008/qi2008cvpr-joint/}
}