C-MIL: Continuation Multiple Instance Learning for Weakly Supervised Object Detection

Abstract

Weakly supervised object detection (WSOD) is a challenging task when provided with image category supervision but required to simultaneously learn object locations and object detectors. Many WSOD approaches adopt multiple instance learning (MIL) and have non-convex loss functions which are prone to get stuck into local minima (falsely localize object parts) while missing full object extent during training. In this paper, we introduce a continuation optimization method into MIL and thereby creating continuation multiple instance learning (C-MIL), with the intention of alleviating the non-convexity problem in a systematic way. We partition instances into spatially related and class related subsets, and approximate the original loss function with a series of smoothed loss functions defined within the subsets. Optimizing smoothed loss functions prevents the training procedure falling prematurely into local minima and facilitates the discovery of Stable Semantic Extremal Regions (SSERs) which indicate full object extent. On the PASCAL VOC 2007 and 2012 datasets, C-MIL improves the state-of-the-art of weakly supervised object detection and weakly supervised object localization with large margins.

Cite

Text

Wan et al. "C-MIL: Continuation Multiple Instance Learning for Weakly Supervised Object Detection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00230

Markdown

[Wan et al. "C-MIL: Continuation Multiple Instance Learning for Weakly Supervised Object Detection." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/wan2019cvpr-cmil/) doi:10.1109/CVPR.2019.00230

BibTeX

@inproceedings{wan2019cvpr-cmil,
  title     = {{C-MIL: Continuation Multiple Instance Learning for Weakly Supervised Object Detection}},
  author    = {Wan, Fang and Liu, Chang and Ke, Wei and Ji, Xiangyang and Jiao, Jianbin and Ye, Qixiang},
  booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2019},
  doi       = {10.1109/CVPR.2019.00230},
  url       = {https://mlanthology.org/cvpr/2019/wan2019cvpr-cmil/}
}