GAIN: On the Generalization of Instructional Action Understanding

Abstract

Despite the great success achieved in instructional action understanding by deep learning and mountainous data, deploying trained models to the unseen environment still remains a great challenge, since it requires strong generalizability of models from in-distribution training data to out-of-distribution (OOD) data. In this paper, we introduce a benchmark, named GAIN, to analyze the GeneralizAbility of INstructional action understanding models. In GAIN, we reassemble steps of existing instructional video training datasets to construct the OOD tasks and then collect the corresponding videos. We evaluate the generalizability of models trained on in-distribution datasets with the performance on OOD videos and observe a significant performance drop. We further propose a simple yet effective approach, which cuts off the excessive contextual dependency of action steps by performing causal inference, to provide a potential direction for enhancing the OOD generalizability. In the experiments, we show that this simple approach can improve several baselines on both instructional action segmentation and detection tasks. We expect the introduction of the GAIN dataset will promote future in-depth research on the generalization of instructional video understanding.

Cite

Text

Li et al. "GAIN: On the Generalization of Instructional Action Understanding." International Conference on Learning Representations, 2023.

Markdown

[Li et al. "GAIN: On the Generalization of Instructional Action Understanding." International Conference on Learning Representations, 2023.](https://mlanthology.org/iclr/2023/li2023iclr-gain/)

BibTeX

@inproceedings{li2023iclr-gain,
  title     = {{GAIN: On the Generalization of Instructional Action Understanding}},
  author    = {Li, Junlong and Chen, Guangyi and Tang, Yansong and Bao, Jinan and Zhang, Kun and Zhou, Jie and Lu, Jiwen},
  booktitle = {International Conference on Learning Representations},
  year      = {2023},
  url       = {https://mlanthology.org/iclr/2023/li2023iclr-gain/}
}