Learning from the Scene and Borrowing from the Rich: Tackling the Long Tail in Scene Graph Generation
Abstract
Despite the huge progress in scene graph generation in recent years, its long-tail distribution in object relationships remains a challenging and pestering issue. Existing methods largely rely on either external knowledge or statistical bias information to alleviate this problem. In this paper, we tackle this issue from another two aspects: (1) scene-object interaction aiming at learning specific knowledge from a scene via an additive attention mechanism; and (2) long-tail knowledge transfer which tries to transfer the rich knowledge learned from the head into the tail. Extensive experiments on the benchmark dataset Visual Genome on three tasks demonstrate that our method outperforms current state-of-the-art competitors. Our source code is available at https://github.com/htlsn/issg.
Cite
Text
He et al. "Learning from the Scene and Borrowing from the Rich: Tackling the Long Tail in Scene Graph Generation." International Joint Conference on Artificial Intelligence, 2020. doi:10.24963/IJCAI.2020/82Markdown
[He et al. "Learning from the Scene and Borrowing from the Rich: Tackling the Long Tail in Scene Graph Generation." International Joint Conference on Artificial Intelligence, 2020.](https://mlanthology.org/ijcai/2020/he2020ijcai-learning/) doi:10.24963/IJCAI.2020/82BibTeX
@inproceedings{he2020ijcai-learning,
title = {{Learning from the Scene and Borrowing from the Rich: Tackling the Long Tail in Scene Graph Generation}},
author = {He, Tao and Gao, Lianli and Song, Jingkuan and Cai, Jianfei and Li, Yuan-Fang},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2020},
pages = {587-593},
doi = {10.24963/IJCAI.2020/82},
url = {https://mlanthology.org/ijcai/2020/he2020ijcai-learning/}
}