DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction Detection

Abstract

Recent years, human-object interaction (HOI) detection has achieved impressive advances. However, conventional two-stage methods are usually slow in inference. On the other hand, existing one-stage methods mainly focus on the union regions of interactions, which introduce unnecessary visual information as disturbances to HOI detection. To tackle the problems above, we propose a novel one-stage HOI detection approach DIRV in this paper, based on a new concept called interaction region for the HOI problem. Unlike previous methods, our approach concentrates on the densely sampled interaction regions across different scales for each human-object pair, so as to capture the subtle visual features that is most essential to the interaction. Moreover, in order to compensate for the detection flaws of a single interaction region, we introduce a novel voting strategy that makes full use of those overlapped interaction regions in place of conventional Non-Maximal Suppression (NMS). Extensive experiments on two popular benchmarks: V-COCO and HICO-DET show that our approach outperforms existing state-of-the-arts by a large margin with the highest inference speed and lightest network architecture. Our code is publicly available at www.github.com/MVIG-SJTU/DIRV.

Cite

Text

Fang et al. "DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction Detection." AAAI Conference on Artificial Intelligence, 2021. doi:10.1609/AAAI.V35I2.16217

Markdown

[Fang et al. "DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction Detection." AAAI Conference on Artificial Intelligence, 2021.](https://mlanthology.org/aaai/2021/fang2021aaai-dirv/) doi:10.1609/AAAI.V35I2.16217

BibTeX

@inproceedings{fang2021aaai-dirv,
  title     = {{DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction Detection}},
  author    = {Fang, Haoshu and Xie, Yichen and Shao, Dian and Lu, Cewu},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {1291-1299},
  doi       = {10.1609/AAAI.V35I2.16217},
  url       = {https://mlanthology.org/aaai/2021/fang2021aaai-dirv/}
}