Large-Scale Image Categorization with Explicit Data Embedding

Abstract

Kernel machines rely on an implicit mapping of the data such that non-linear classification in the original space corresponds to linear classification in the new space. As kernel machines are difficult to scale to large training sets, it has been proposed to perform an explicit mapping of the data and to learn directly linear classifiers in the new space. In this paper, we consider the problem of learning image categorizers on large image sets (e.g. > 100k images) using bag-of-visual-words (BOV) image representations and Support Vector Machine classifiers. We experiment with three approaches to BOV embedding: 1) kernel PCA (kPCA), 2) a modified kPCA we propose for additive kernels and 3) random projections for shift-invariant kernels. We report experiments on 3 datasets: Cal-tech101, VOC07 and ImageNet. An important conclusion is that simply square-rooting BOV vectors - which corresponds to an exact mapping for the Bhattacharyya kernel - already leads to large improvements, often quite close to the best results obtained with additive kernels. Another conclusion is that, although it is possible to go beyond additive kernels, the embedding comes at a much higher cost.

Cite

Text

Perronnin et al. "Large-Scale Image Categorization with Explicit Data Embedding." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2010. doi:10.1109/CVPR.2010.5539914

Markdown

[Perronnin et al. "Large-Scale Image Categorization with Explicit Data Embedding." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2010.](https://mlanthology.org/cvpr/2010/perronnin2010cvpr-large-a/) doi:10.1109/CVPR.2010.5539914

BibTeX

@inproceedings{perronnin2010cvpr-large-a,
  title     = {{Large-Scale Image Categorization with Explicit Data Embedding}},
  author    = {Perronnin, Florent and Sánchez, Jorge and Liu, Yan},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2010},
  pages     = {2297-2304},
  doi       = {10.1109/CVPR.2010.5539914},
  url       = {https://mlanthology.org/cvpr/2010/perronnin2010cvpr-large-a/}
}