A Maximum-Likelihood Approach to Visual Event Classification

Abstract

This paper presents a novel framework, based on maximum likelihood, for training models to recognise simple spatial-motion events, such as those described by the verbs pick up, put down, push, pull, drop , and throw , and classifying novel observations into previously trained classes. The model that we employ does not presuppose prior recognition or tracking of 3D object pose, shape, or identity. We describe our general framework for using maximum-likelihood techniques for visual event classification, the details of the generative model that we use to characterise observations as instances of event types, and the implemented computational techniques used to support training and classification for this generative model. We conclude by illustrating the operation of our implementation on a small example.

Cite

Text

Siskind and Morris. "A Maximum-Likelihood Approach to Visual Event Classification." European Conference on Computer Vision, 1996. doi:10.1007/3-540-61123-1_152

Markdown

[Siskind and Morris. "A Maximum-Likelihood Approach to Visual Event Classification." European Conference on Computer Vision, 1996.](https://mlanthology.org/eccv/1996/siskind1996eccv-maximum/) doi:10.1007/3-540-61123-1_152

BibTeX

@inproceedings{siskind1996eccv-maximum,
  title     = {{A Maximum-Likelihood Approach to Visual Event Classification}},
  author    = {Siskind, Jeffrey Mark and Morris, Quaid},
  booktitle = {European Conference on Computer Vision},
  year      = {1996},
  pages     = {347-360},
  doi       = {10.1007/3-540-61123-1_152},
  url       = {https://mlanthology.org/eccv/1996/siskind1996eccv-maximum/}
}