P-Net: A Representation for Partially-Sequenced, Multi-Stream Activity

Abstract

In this paper, we devise a Propagation Net (P-Net) as a new mechanism for the representation and recognition of multi-stream activity. Most of daily activities can be represented by temporally partial ordered intervals where each interval has not only temporal constraint, i.e., before/after/duration, but also a logical relationship such as a and b both must happen. P-Net associates a node for each interval that is probabilistically triggered function dependent upon the state of its parent nodes. Each node is also associated with an observation distribution function that associates perceptual evidence. This evidence, generated by lower level vision modules, is a positive indicator of the elemental action. Using this architecture, we devise an iterative temporal sequencing algorithm that interprets a multi-dimensional observation sequence of visual evidence as a multi-stream propagation through the P-Net. Simple vision and motion-capture data experiments demonstrate the capabilities of our algorithm.

Cite

Text

Shi and Bobick. "P-Net: A Representation for Partially-Sequenced, Multi-Stream Activity." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2003. doi:10.1109/CVPRW.2003.10037

Markdown

[Shi and Bobick. "P-Net: A Representation for Partially-Sequenced, Multi-Stream Activity." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2003.](https://mlanthology.org/cvprw/2003/shi2003cvprw-pnet/) doi:10.1109/CVPRW.2003.10037

BibTeX

@inproceedings{shi2003cvprw-pnet,
  title     = {{P-Net: A Representation for Partially-Sequenced, Multi-Stream Activity}},
  author    = {Shi, Yifan and Bobick, Aaron F.},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2003},
  pages     = {40},
  doi       = {10.1109/CVPRW.2003.10037},
  url       = {https://mlanthology.org/cvprw/2003/shi2003cvprw-pnet/}
}