What if We Could Not See? Counterfactual Analysis for Egocentric Action Anticipation

Abstract

Egocentric action anticipation aims at predicting the near future based on past observation in first-person vision. While future actions may be wrongly predicted due to the dataset bias, we present a counterfactual analysis framework for egocentric action anticipation (CA-EAA) to enhance the capacity. In the factual case, we can predict the upcoming action based on visual features and semantic labels from past observation. Imagining one counterfactual situation where no visual representation had been observed, we would obtain a counterfactual predicted action only using past semantic labels. In this way, we can reduce the side-effect caused by semantic labels via a comparison between factual and counterfactual outcomes, which moves a step towards unbiased prediction for egocentric action anticipation. We conduct experiments on two large-scale egocentric video datasets. Qualitative and quantitative results validate the effectiveness of our proposed CA-EAA.

Cite

Text

Zhang et al. "What if We Could Not See? Counterfactual Analysis for Egocentric Action Anticipation." International Joint Conference on Artificial Intelligence, 2021. doi:10.24963/IJCAI.2021/182

Markdown

[Zhang et al. "What if We Could Not See? Counterfactual Analysis for Egocentric Action Anticipation." International Joint Conference on Artificial Intelligence, 2021.](https://mlanthology.org/ijcai/2021/zhang2021ijcai-we/) doi:10.24963/IJCAI.2021/182

BibTeX

@inproceedings{zhang2021ijcai-we,
  title     = {{What if We Could Not See? Counterfactual Analysis for Egocentric Action Anticipation}},
  author    = {Zhang, Tianyu and Min, Weiqing and Yang, Jiahao and Liu, Tao and Jiang, Shuqiang and Rui, Yong},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {1316-1322},
  doi       = {10.24963/IJCAI.2021/182},
  url       = {https://mlanthology.org/ijcai/2021/zhang2021ijcai-we/}
}