What Do You Do? Occupation Recognition in a Photo via Social Context

Abstract

In this paper, we investigate the problem of recognizing occupations of multiple people with arbitrary poses in a photo. Previous work utilizing single person's nearly frontal clothing information and fore/background context preliminarily proves that occupation recognition is computationally feasible in computer vision. However, in practice, multiple people with arbitrary poses are common in a photo, and recognizing their occupations is even more challenging. We argue that with appropriately built visual attributes, co-occurrence, and spatial configuration model that is learned through structure SVM, we can recognize multiple people's occupations in a photo simultaneously. To evaluate our method's performance, we conduct extensive experiments on a new well-labeled occupation database with 14 representative occupations and over 7K images. Results on this database validate our method's effectiveness and show that occupation recognition is solvable in a more general case.

Cite

Text

Shao et al. "What Do You Do? Occupation Recognition in a Photo via Social Context." International Conference on Computer Vision, 2013. doi:10.1109/ICCV.2013.451

Markdown

[Shao et al. "What Do You Do? Occupation Recognition in a Photo via Social Context." International Conference on Computer Vision, 2013.](https://mlanthology.org/iccv/2013/shao2013iccv-you/) doi:10.1109/ICCV.2013.451

BibTeX

@inproceedings{shao2013iccv-you,
  title     = {{What Do You Do? Occupation Recognition in a Photo via Social Context}},
  author    = {Shao, Ming and Li, Liangyue and Fu, Yun},
  booktitle = {International Conference on Computer Vision},
  year      = {2013},
  doi       = {10.1109/ICCV.2013.451},
  url       = {https://mlanthology.org/iccv/2013/shao2013iccv-you/}
}