Weakly-Supervised Spatial Context Networks
Abstract
We explore the power of spatial context as a self-supervisory signal for learning visual representations. In particular, we propose spatial context networks that learn to predict a representation of one image patch from another image patch, within the same image, conditioned on their real-valued relative spatial offset. Unlike auto-encoders, that aim to encode and reconstruct original image patches, our network aims to encode and reconstruct intermediate representations of the spatially offset patches. As such, the network learns a spatially conditioned contextual representation. By testing performance with various patch selection mechanisms we show that focusing on object-centric patches is important, and that using object proposal as a patch selection mechanism leads to the highest improvement in performance. Further, unlike auto-encoders, context encoders [21], or other forms of unsupervised feature learning, we illustrate that contextual supervision (with pre-trained model initialization) can improve on existing pre-trained model performance. We build our spatial context networks on top of standard VGG_19 and CNN_M architectures and, among other things, show that we can achieve improvements (with no additional explicit supervision) over the original ImageNet pre-trained VGG_19 and CNN_M models in object categorization and detection on VOC2007.
Cite
Text
Wu et al. "Weakly-Supervised Spatial Context Networks." IEEE/CVF Winter Conference on Applications of Computer Vision, 2019. doi:10.1109/WACV.2019.00138Markdown
[Wu et al. "Weakly-Supervised Spatial Context Networks." IEEE/CVF Winter Conference on Applications of Computer Vision, 2019.](https://mlanthology.org/wacv/2019/wu2019wacv-weakly/) doi:10.1109/WACV.2019.00138BibTeX
@inproceedings{wu2019wacv-weakly,
title = {{Weakly-Supervised Spatial Context Networks}},
author = {Wu, Zuxuan and Davis, Larry and Sigal, Leonid},
booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
year = {2019},
pages = {1253-1261},
doi = {10.1109/WACV.2019.00138},
url = {https://mlanthology.org/wacv/2019/wu2019wacv-weakly/}
}