A Latent Model of Discriminative Aspect

Abstract

Recognition using appearance features is confounded by phenomena that cause images of the same object to look different, or images of different objects to look the same. This may occur because the same object looks different from different viewing directions, or because two generally different objects have views from which they look similar. In this paper, we introduce the idea of discriminative aspect, a set of latent variables that encode these phenomena. Changes in view direction are one cause of changes in discriminative aspect, but others include changes in texture or lighting. However, images are not labelled with relevant discriminative aspect parameters. We describe a method to improve discrimination by inferring and then using latent discriminative aspect parameters. We apply our method to two parallel problems: object category recognition and human activity recognition. In each case, appearance features are powerful given appropriate training data, but traditionally fail badly under large changes in view. Our method can recognize an object quite reliably in a view for which it possesses no training example. Our method also reweights features to discount accidental similarities in appearance. We demonstrate that our method produces a significant improvement on the state of the art for both object and activity recognition.

Cite

Text

Farhadi et al. "A Latent Model of Discriminative Aspect." IEEE/CVF International Conference on Computer Vision, 2009. doi:10.1109/ICCV.2009.5459350

Markdown

[Farhadi et al. "A Latent Model of Discriminative Aspect." IEEE/CVF International Conference on Computer Vision, 2009.](https://mlanthology.org/iccv/2009/farhadi2009iccv-latent/) doi:10.1109/ICCV.2009.5459350

BibTeX

@inproceedings{farhadi2009iccv-latent,
  title     = {{A Latent Model of Discriminative Aspect}},
  author    = {Farhadi, Ali and Tabrizi, Mostafa Kamali and Endres, Ian and Forsyth, David A.},
  booktitle = {IEEE/CVF International Conference on Computer Vision},
  year      = {2009},
  pages     = {948-955},
  doi       = {10.1109/ICCV.2009.5459350},
  url       = {https://mlanthology.org/iccv/2009/farhadi2009iccv-latent/}
}