MANTRA: Minimum Maximum Latent Structural SVM for Image Classification and Ranking
Abstract
In this work, we propose a novel Weakly Supervised Learning (WSL) framework dedicated to learn discriminative part detectors from images annotated with a global label. Our WSL method encompasses three main contributions. Firstly, we introduce a new structured output latent variable model, Minimum mAximum lateNt sTRucturAl SVM (MANTRA), which prediction relies on a pair of latent variables: h^+ (resp. h^-) provides positive (resp. negative) evidence for a given output y. Secondly, we instantiate MANTRA for two different visual recognition tasks: multi-class classification and ranking. For ranking, we propose efficient solutions to exactly solve the inference and the loss-augmented problems. Finally, extensive experiments highlight the relevance of the proposed method: MANTRA outperforms state-of-the art results on five different datasets.
Cite
Text
Durand et al. "MANTRA: Minimum Maximum Latent Structural SVM for Image Classification and Ranking." International Conference on Computer Vision, 2015. doi:10.1109/ICCV.2015.311Markdown
[Durand et al. "MANTRA: Minimum Maximum Latent Structural SVM for Image Classification and Ranking." International Conference on Computer Vision, 2015.](https://mlanthology.org/iccv/2015/durand2015iccv-mantra/) doi:10.1109/ICCV.2015.311BibTeX
@inproceedings{durand2015iccv-mantra,
title = {{MANTRA: Minimum Maximum Latent Structural SVM for Image Classification and Ranking}},
author = {Durand, Thibaut and Thome, Nicolas and Cord, Matthieu},
booktitle = {International Conference on Computer Vision},
year = {2015},
doi = {10.1109/ICCV.2015.311},
url = {https://mlanthology.org/iccv/2015/durand2015iccv-mantra/}
}