Motion-Aware Contrastive Learning for Temporal Panoptic Scene Graph Generation

Abstract

To equip artificial intelligence with a comprehensive understanding towards a temporal world, video and 4D panoptic scene graph generation abstracts visual data into nodes to represent entities and edges to capture temporal relations. Existing methods encode entity masks tracked across temporal dimensions (mask tubes), then predict their relations with temporal pooling operation, which does not fully utilize the motion indicative of the entities' relation. To overcome this limitation, we introduce a contrastive representation learning framework that focuses on motion pattern for temporal scene graph generation. Firstly, our framework encourages the model to learn close representations for mask tubes of similar subject-relation-object triplets. Secondly, we seek to push apart mask tubes from their temporally shuffled versions. Moreover, we also learn distant representations for mask tubes belonging to the same video but different triplets. Extensive experiments show that our motion-aware contrastive framework significantly improves state-of-the-art methods on both video and 4D datasets.

Cite

Text

Nguyen et al. "Motion-Aware Contrastive Learning for Temporal Panoptic Scene Graph Generation." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I6.32665

Markdown

[Nguyen et al. "Motion-Aware Contrastive Learning for Temporal Panoptic Scene Graph Generation." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/nguyen2025aaai-motion/) doi:10.1609/AAAI.V39I6.32665

BibTeX

@inproceedings{nguyen2025aaai-motion,
  title     = {{Motion-Aware Contrastive Learning for Temporal Panoptic Scene Graph Generation}},
  author    = {Nguyen, Thong Thanh and Wu, Xiaobao and Bin, Yi and Nguyen, Cong-Duy T. and Ng, See-Kiong and Luu, Anh Tuan},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {6218-6226},
  doi       = {10.1609/AAAI.V39I6.32665},
  url       = {https://mlanthology.org/aaai/2025/nguyen2025aaai-motion/}
}