Kernel Descriptors for Visual Recognition

Abstract

The design of low-level image features is critical for computer vision algorithms. Orientation histograms, such as those in SIFT~\cite{Lowe2004Distinctive} and HOG~\cite{Dalal2005Histograms}, are the most successful and popular features for visual object and scene recognition. We highlight the kernel view of orientation histograms, and show that they are equivalent to a certain type of match kernels over image patches. This novel view allows us to design a family of kernel descriptors which provide a unified and principled framework to turn pixel attributes (gradient, color, local binary pattern, \etc) into compact patch-level features. In particular, we introduce three types of match kernels to measure similarities between image patches, and construct compact low-dimensional kernel descriptors from these match kernels using kernel principal component analysis (KPCA)~\cite{Scholkopf1998Nonlinear}. Kernel descriptors are easy to design and can turn any type of pixel attribute into patch-level features. They outperform carefully tuned and sophisticated features including SIFT and deep belief networks. We report superior performance on standard image classification benchmarks: Scene-15, Caltech-101, CIFAR10 and CIFAR10-ImageNet.

Cite

Text

Bo et al. "Kernel Descriptors for Visual Recognition." Neural Information Processing Systems, 2010.

Markdown

[Bo et al. "Kernel Descriptors for Visual Recognition." Neural Information Processing Systems, 2010.](https://mlanthology.org/neurips/2010/bo2010neurips-kernel/)

BibTeX

@inproceedings{bo2010neurips-kernel,
  title     = {{Kernel Descriptors for Visual Recognition}},
  author    = {Bo, Liefeng and Ren, Xiaofeng and Fox, Dieter},
  booktitle = {Neural Information Processing Systems},
  year      = {2010},
  pages     = {244-252},
  url       = {https://mlanthology.org/neurips/2010/bo2010neurips-kernel/}
}