Alternative Semantic Representations for Zero-Shot Human Action Recognition

Abstract

A proper semantic representation for encoding side information is key to the success of zero-shot learning. In this paper, we explore two alternative semantic representations especially for zero-shot human action recognition: textual descriptions of human actions and deep features extracted from still images relevant to human actions. Such side information are accessible on Web with little cost, which paves a new way in gaining side information for large-scale zero-shot human action recognition. We investigate different encoding methods to generate semantic representations for human actions from such side information. Based on our zero-shot visual recognition method, we conducted experiments on UCF101 and HMDB51 to evaluate two proposed semantic representations. The results suggest that our proposed text- and image-based semantic representations outperform traditional attributes and word vectors considerably for zero-shot human action recognition. In particular, the image-based semantic representations yield the favourable performance even though the representation is extracted from a small number of images per class.

Cite

Text

Wang and Chen. "Alternative Semantic Representations for Zero-Shot Human Action Recognition." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2017. doi:10.1007/978-3-319-71249-9_6

Markdown

[Wang and Chen. "Alternative Semantic Representations for Zero-Shot Human Action Recognition." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2017.](https://mlanthology.org/ecmlpkdd/2017/wang2017ecmlpkdd-alternative/) doi:10.1007/978-3-319-71249-9_6

BibTeX

@inproceedings{wang2017ecmlpkdd-alternative,
  title     = {{Alternative Semantic Representations for Zero-Shot Human Action Recognition}},
  author    = {Wang, Qian and Chen, Ke},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2017},
  pages     = {87-102},
  doi       = {10.1007/978-3-319-71249-9_6},
  url       = {https://mlanthology.org/ecmlpkdd/2017/wang2017ecmlpkdd-alternative/}
}