Soft Transfer Learning via Gradient Diagnosis for Visual Relationship Detection

Abstract

Detecting all visual relationships is posed as the most fundamental task towards the ultimate semantic reasoning. However, due to the rich context embedded in the image and diverse language ambiguities, it is unrealistic to annotate and list all possible relationships for providing a noise-free supervised setting. All prior approaches simply adopt the traditional fully-supervised detection pipeline and ignore the effect of incomplete annotations on model convergence, resulting in the unstable optimization and unsatisfactory performance. In this work, we make the first attempt to address this critical incomplete annotations issue and reformulate this task via the Soft Transfer Learning (STL), which aims to transfer knowledge learned from the annotations in hand into the uncertain pairs in a self-supervised way. The knowledge transfer process is inferred from a principled gradient diagnosis. Extensive experiments on VRD and the large-scale VG benchmarks demonstrate the superiority of our STL method.

Cite

Text

Chen et al. "Soft Transfer Learning via Gradient Diagnosis for Visual Relationship Detection." IEEE/CVF Winter Conference on Applications of Computer Vision, 2019. doi:10.1109/WACV.2019.00124

Markdown

[Chen et al. "Soft Transfer Learning via Gradient Diagnosis for Visual Relationship Detection." IEEE/CVF Winter Conference on Applications of Computer Vision, 2019.](https://mlanthology.org/wacv/2019/chen2019wacv-soft/) doi:10.1109/WACV.2019.00124

BibTeX

@inproceedings{chen2019wacv-soft,
  title     = {{Soft Transfer Learning via Gradient Diagnosis for Visual Relationship Detection}},
  author    = {Chen, Diqi and Liang, Xiaodan and Wang, Yizhou and Gao, Wen},
  booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
  year      = {2019},
  pages     = {1118-1126},
  doi       = {10.1109/WACV.2019.00124},
  url       = {https://mlanthology.org/wacv/2019/chen2019wacv-soft/}
}