Adversarial Cross-Domain Action Recognition with Co-Attention

Abstract

Action recognition has been a widely studied topic with a heavy focus on supervised learning involving sufficient labeled videos. However, the problem of cross-domain action recognition, where training and testing videos are drawn from different underlying distributions, remains largely under-explored. Previous methods directly employ techniques for cross-domain image recognition, which tend to suffer from the severe temporal misalignment problem. This paper proposes a Temporal Co-attention Network (TCoN), which matches the distributions of temporally aligned action features between source and target domains using a novel cross-domain co-attention mechanism. Experimental results on three cross-domain action recognition datasets demonstrate that TCoN improves both previous single-domain and cross-domain methods significantly under the cross-domain setting.

Cite

Text

Pan et al. "Adversarial Cross-Domain Action Recognition with Co-Attention." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I07.6854

Markdown

[Pan et al. "Adversarial Cross-Domain Action Recognition with Co-Attention." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/pan2020aaai-adversarial/) doi:10.1609/AAAI.V34I07.6854

BibTeX

@inproceedings{pan2020aaai-adversarial,
  title     = {{Adversarial Cross-Domain Action Recognition with Co-Attention}},
  author    = {Pan, Boxiao and Cao, Zhangjie and Adeli, Ehsan and Niebles, Juan Carlos},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {11815-11822},
  doi       = {10.1609/AAAI.V34I07.6854},
  url       = {https://mlanthology.org/aaai/2020/pan2020aaai-adversarial/}
}