Recognizing Actions Using Object States

Abstract

Object-centric actions cause changes in object states, including their visual appearance and their immediate context. We propose a computational framework that uses only two object states, start and end, and learns to recognize the under-lying actions. Our approach has two modules that learn subtle changes induced by the action and suppress spurious correlations. We demonstrate that only two object states are sufficient to recognize object-centric actions. Our framework per-forms better than approaches that use multiple frames and a relatively large model.Moreover, our method generalizes to unseen objects and unseen video datasets

Cite

Text

Saini et al. "Recognizing Actions Using Object States." ICLR 2022 Workshops: OSC, 2022.

Markdown

[Saini et al. "Recognizing Actions Using Object States." ICLR 2022 Workshops: OSC, 2022.](https://mlanthology.org/iclrw/2022/saini2022iclrw-recognizing/)

BibTeX

@inproceedings{saini2022iclrw-recognizing,
  title     = {{Recognizing Actions Using Object States}},
  author    = {Saini, Nirat and He, Bo and Shrivastava, Gaurav and Rambhatla, Sai Saketh and Shrivastava, Abhinav},
  booktitle = {ICLR 2022 Workshops: OSC},
  year      = {2022},
  url       = {https://mlanthology.org/iclrw/2022/saini2022iclrw-recognizing/}
}