Learning Graph Structure for Multi-Label Image Classification via Clique Generation

Abstract

Exploiting label dependency for multi-label image classification can significantly improve classification performance. Probabilistic Graphical Models are one of the primary methods for representing such dependencies. The structure of graphical models, however, is either determined heuristically or learned from very limited information. Moreover, neither of these approaches scales well to large or complex graphs. We propose a principled way to learn the structure of a graphical model by considering input features and labels, together with loss functions. We formulate this problem into a max-margin framework initially, and then transform it into a convex programming problem. Finally, we propose a highly scalable procedure that activates a set of cliques iteratively. Our approach exhibits both strong theoretical properties and a significant performance improvement over state-of-the-art methods on both synthetic and real-world data sets.

Cite

Text

Tan et al. "Learning Graph Structure for Multi-Label Image Classification via Clique Generation." Conference on Computer Vision and Pattern Recognition, 2015. doi:10.1109/CVPR.2015.7299037

Markdown

[Tan et al. "Learning Graph Structure for Multi-Label Image Classification via Clique Generation." Conference on Computer Vision and Pattern Recognition, 2015.](https://mlanthology.org/cvpr/2015/tan2015cvpr-learning/) doi:10.1109/CVPR.2015.7299037

BibTeX

@inproceedings{tan2015cvpr-learning,
  title     = {{Learning Graph Structure for Multi-Label Image Classification via Clique Generation}},
  author    = {Tan, Mingkui and Shi, Qinfeng and van den Hengel, Anton and Shen, Chunhua and Gao, Junbin and Hu, Fuyuan and Zhang, Zhen},
  booktitle = {Conference on Computer Vision and Pattern Recognition},
  year      = {2015},
  doi       = {10.1109/CVPR.2015.7299037},
  url       = {https://mlanthology.org/cvpr/2015/tan2015cvpr-learning/}
}