Hide-and-Seek: Forcing a Network to Be Meticulous for Weakly-Supervised Object and Action Localization

Singh, Krishna Kumar; Lee, Yong Jae

doi:10.1109/ICCV.2017.381

Hide-and-Seek: Forcing a Network to Be Meticulous for Weakly-Supervised Object and Action Localization

Krishna Kumar Singh, Yong Jae Lee

ICCV 2017

doi:10.1109/ICCV.2017.381 /iccv/2017/singh2017iccv-hideandseek/

Abstract

We propose 'Hide-and-Seek', a weakly-supervised framework that aims to improve object localization in images and action localization in videos. Most existing weakly-supervised methods localize only the most discriminative parts of an object rather than all relevant parts, which leads to suboptimal performance. Our key idea is to hide patches in a training image randomly, forcing the network to seek other relevant parts when the most discriminative part is hidden. Our approach only needs to modify the input image and can work with any network designed for object localization. During testing, we do not need to hide any patches. Our Hide-and-Seek approach obtains superior performance compared to previous methods for weakly-supervised object localization on the ILSVRC dataset. We also demonstrate that our framework can be easily extended to weakly-supervised action localization.

PDF ICCV Semantic Scholar

Cite

Text

Singh and Lee. "Hide-and-Seek: Forcing a Network to Be Meticulous for Weakly-Supervised Object and Action Localization." International Conference on Computer Vision, 2017. doi:10.1109/ICCV.2017.381

Markdown

[Singh and Lee. "Hide-and-Seek: Forcing a Network to Be Meticulous for Weakly-Supervised Object and Action Localization." International Conference on Computer Vision, 2017.](https://mlanthology.org/iccv/2017/singh2017iccv-hideandseek/) doi:10.1109/ICCV.2017.381

BibTeX

@inproceedings{singh2017iccv-hideandseek,
  title     = {{Hide-and-Seek: Forcing a Network to Be Meticulous for Weakly-Supervised Object and Action Localization}},
  author    = {Singh, Krishna Kumar and Lee, Yong Jae},
  booktitle = {International Conference on Computer Vision},
  year      = {2017},
  doi       = {10.1109/ICCV.2017.381},
  url       = {https://mlanthology.org/iccv/2017/singh2017iccv-hideandseek/}
}