Causality Compensated Attention for Contextual Biased Visual Recognition
Abstract
Visual attention does not always capture the essential object representation desired for robust predictions. Attention modules tend to underline not only the target object but also the common co-occurring context that the module thinks helpful in the training. The problem is rooted in the confounding effect of the context leading to incorrect causalities between objects and predictions, which is further exacerbated by visual attention. In this paper, to learn causal object features robust for contextual bias, we propose a novel attention module named Interventional Dual Attention (IDA) for visual recognition. Specifically, IDA adopts two attention layers with multiple sampling intervention, which compensates the attention against the confounder context. Note that our method is model-agnostic and thus can be implemented on various backbones. Extensive experiments show our model obtains significant improvements in classification and detection with lower computation. In particular, we achieve the state-of-the-art results in multi-label classification on MS-COCO and PASCAL-VOC.
Cite
Text
Liu et al. "Causality Compensated Attention for Contextual Biased Visual Recognition." International Conference on Learning Representations, 2023.Markdown
[Liu et al. "Causality Compensated Attention for Contextual Biased Visual Recognition." International Conference on Learning Representations, 2023.](https://mlanthology.org/iclr/2023/liu2023iclr-causality/)BibTeX
@inproceedings{liu2023iclr-causality,
title = {{Causality Compensated Attention for Contextual Biased Visual Recognition}},
author = {Liu, Ruyang and Huang, Jingjia and Li, Thomas H. and Li, Ge},
booktitle = {International Conference on Learning Representations},
year = {2023},
url = {https://mlanthology.org/iclr/2023/liu2023iclr-causality/}
}