MR-NET: Exploiting Mutual Relation for Visual Relationship Detection
Abstract
Inferring the interactions between objects, a.k.a visual relationship detection, is a crucial point for vision understanding, which captures more definite concepts than object detection. Most previous work that treats the interaction between a pair of objects as a one way fail to exploit the mutual relation between objects, which is essential to modern visual application. In this work, we propose a mutual relation net, dubbed MR-Net, to explore the mutual relation between paired objects for visual relationship detection. Specifically, we construct a mutual relation space to model the mutual interaction of paired objects, and employ linear constraint to optimize the mutual interaction, which is called mutual relation learning. Our mutual relation learning does not introduce any parameters, and can adapt to improve the performance of other methods. In addition, we devise a semantic ranking loss to discriminatively penalize predicates with semantic similarity, which is ignored by traditional loss function (e.g., cross entropy with softmax). Then, our MR-Net optimizes the mutual relation learning together with semantic ranking loss with a siamese network. The experimental results on two commonly used datasets (VG and VRD) demonstrate the superior performance of the proposed approach.
Cite
Text
Bin et al. "MR-NET: Exploiting Mutual Relation for Visual Relationship Detection." AAAI Conference on Artificial Intelligence, 2019. doi:10.1609/AAAI.V33I01.33018110Markdown
[Bin et al. "MR-NET: Exploiting Mutual Relation for Visual Relationship Detection." AAAI Conference on Artificial Intelligence, 2019.](https://mlanthology.org/aaai/2019/bin2019aaai-mr/) doi:10.1609/AAAI.V33I01.33018110BibTeX
@inproceedings{bin2019aaai-mr,
title = {{MR-NET: Exploiting Mutual Relation for Visual Relationship Detection}},
author = {Bin, Yi and Yang, Yang and Tao, Chaofan and Huang, Zi and Li, Jingjing and Shen, Heng Tao},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2019},
pages = {8110-8117},
doi = {10.1609/AAAI.V33I01.33018110},
url = {https://mlanthology.org/aaai/2019/bin2019aaai-mr/}
}