Learning Multi-Modal Densities on Discriminative Temporal Interaction Manifold for Group Activity Recognition

Abstract

While video-based activity analysis and recognition has received much attention, existing body of work mostly deals with single object/person case. Coordinated multi-object activities, or group activities, present in a variety of applications such as surveillance, sports, and biological monitoring records, etc., are the main focus of this paper. Unlike earlier attempts which model the complex spatial temporal constraints among multiple objects with a parametric Bayesian network, we propose a Discriminative Temporal Interaction Manifold (DTIM) framework as a data-driven strategy to characterize the group motion pattern without employing specific domain knowledge. In particular, we establish probability densities on the DTIM, whose element, the discriminative temporal interaction matrix, compactly describes the coordination and interaction among multiple objects in a group activity. For each class of group activity we learn a multi-modal density function on the DTIM. A Maximum a Posteriori (MAP) classifier on the manifold is then designed for recognizing new activities. Experiments on football play recognition demonstrate the effectiveness of the approach.

Cite

Text

Li et al. "Learning Multi-Modal Densities on Discriminative Temporal Interaction Manifold for Group Activity Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009. doi:10.1109/CVPR.2009.5206676

Markdown

[Li et al. "Learning Multi-Modal Densities on Discriminative Temporal Interaction Manifold for Group Activity Recognition." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2009.](https://mlanthology.org/cvpr/2009/li2009cvpr-learning/) doi:10.1109/CVPR.2009.5206676

BibTeX

@inproceedings{li2009cvpr-learning,
  title     = {{Learning Multi-Modal Densities on Discriminative Temporal Interaction Manifold for Group Activity Recognition}},
  author    = {Li, Ruonan and Chellappa, Rama and Zhou, Shaohua Kevin},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2009},
  pages     = {2450-2457},
  doi       = {10.1109/CVPR.2009.5206676},
  url       = {https://mlanthology.org/cvpr/2009/li2009cvpr-learning/}
}