Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection

Abstract

Human-Object Interaction (HOI) detection plays a vital role in scene understanding, which aims to predict the HOI triplet in the form of . Existing methods mainly extract multi-modal features (e.g., appearance, object semantics, human pose) and then fuse them together to directly predict HOI triplets. However, most of these methods focus on seeking for self-triplet aggregation, but ignore the potential cross-triplet dependencies, resulting in ambiguity of action prediction. In this work, we propose to explore Self- and Cross-Triplet Correlations (SCTC) for HOI detection. Specifically, we regard each triplet proposal as a graph where Human, Object represent nodes and Action indicates edge, to aggregate self-triplet correlation. Also, we try to explore cross-triplet dependencies by jointly considering instance-level, semantic-level, and layout-level relations. Besides, we leverage the CLIP model to assist our SCTC obtain interaction-aware feature by knowledge distillation, which provides useful action clues for HOI detection. Extensive experiments on HICO-DET and V-COCO datasets verify the effectiveness of our proposed SCTC.

Cite

Text

Jiang et al. "Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection." AAAI Conference on Artificial Intelligence, 2024. doi:10.1609/AAAI.V38I3.28031

Markdown

[Jiang et al. "Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection." AAAI Conference on Artificial Intelligence, 2024.](https://mlanthology.org/aaai/2024/jiang2024aaai-exploring/) doi:10.1609/AAAI.V38I3.28031

BibTeX

@inproceedings{jiang2024aaai-exploring,
  title     = {{Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection}},
  author    = {Jiang, Weibo and Ren, Weihong and Tian, Jiandong and Qu, Liangqiong and Wang, Zhiyong and Liu, Honghai},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {2543-2551},
  doi       = {10.1609/AAAI.V38I3.28031},
  url       = {https://mlanthology.org/aaai/2024/jiang2024aaai-exploring/}
}