Discovering Object Functionality

Abstract

Object functionality refers to the quality of an object that allows humans to perform some specific actions. It has been shown in psychology that functionality (affordance) is at least as essential as appearance in object recognition by humans. In computer vision, most previous work on functionality either assumes exactly one functionality for each object, or requires detailed annotation of human poses and objects. In this paper, we propose a weakly supervised approach to discover all possible object functionalities. Each object functionality is represented by a specific type of human-object interaction. Our method takes any possible human-object interaction into consideration, and evaluates image similarity in 3D rather than 2D in order to cluster human-object interactions more coherently. Experimental results on a dataset of people interacting with musical instruments show the effectiveness of our approach.

Cite

Text

Yao et al. "Discovering Object Functionality." International Conference on Computer Vision, 2013. doi:10.1109/ICCV.2013.312

Markdown

[Yao et al. "Discovering Object Functionality." International Conference on Computer Vision, 2013.](https://mlanthology.org/iccv/2013/yao2013iccv-discovering/) doi:10.1109/ICCV.2013.312

BibTeX

@inproceedings{yao2013iccv-discovering,
  title     = {{Discovering Object Functionality}},
  author    = {Yao, Bangpeng and Ma, Jiayuan and Fei-Fei, Li},
  booktitle = {International Conference on Computer Vision},
  year      = {2013},
  doi       = {10.1109/ICCV.2013.312},
  url       = {https://mlanthology.org/iccv/2013/yao2013iccv-discovering/}
}