Max-Margin Dictionary Learning for Multiclass Image Categorization

Abstract

Visual dictionary learning and base (binary) classifier training are two basic problems for the recently most popular image categorization framework, which is based on the bag-of-visual-terms (BOV) models and multiclass SVM classifiers. In this paper, we study new algorithms to improve performance of this framework from these two aspects. Typically SVM classifiers are trained with dictionaries fixed, and as a result the traditional loss function can only be minimized with respect to hyperplane parameters ( w and b ). We propose a novel loss function for a binary classifier, which links the hinge-loss term with dictionary learning. By doing so, we can further optimize the loss function with respect to the dictionary parameters. Thus, this framework is able to further increase margins of binary classifiers, and consequently decrease the error bound of the aggregated classifier. On two benchmark dataset, Graz [1] and the fifteen scene category dataset [2], our experiment results significantly outperformed state-of-the-art works.

Cite

Text

Lian et al. "Max-Margin Dictionary Learning for Multiclass Image Categorization." European Conference on Computer Vision, 2010. doi:10.1007/978-3-642-15561-1_12

Markdown

[Lian et al. "Max-Margin Dictionary Learning for Multiclass Image Categorization." European Conference on Computer Vision, 2010.](https://mlanthology.org/eccv/2010/lian2010eccv-max/) doi:10.1007/978-3-642-15561-1_12

BibTeX

@inproceedings{lian2010eccv-max,
  title     = {{Max-Margin Dictionary Learning for Multiclass Image Categorization}},
  author    = {Lian, Xiao-Chen and Li, Zhiwei and Lu, Bao-Liang and Zhang, Lei},
  booktitle = {European Conference on Computer Vision},
  year      = {2010},
  pages     = {157-170},
  doi       = {10.1007/978-3-642-15561-1_12},
  url       = {https://mlanthology.org/eccv/2010/lian2010eccv-max/}
}