A Computational Approach to Semantic Event Detection

Abstract

We propose a three-level video event detection algorithm and apply it to animal hunt detection in wildlife documentaries. The first level extracts texture, color and motion features, and detects motion blobs. The mid-level employs a neural network to verify whether the motion blobs belong to objects of interest. This level also generates shot summaries in terms of intermediate-level descriptors which combine low-level features from the first level and contain results of mid-level, domain specific inferences made on the basis of shot features. The shot summaries are then used by a domain-specific inference process at the third level to detect the video segments that contain events of interest, e.g., hunts. Event based video indexing, summarization and browsing are among the applications of the proposed approach.

Cite

Text

Qian et al. "A Computational Approach to Semantic Event Detection." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1999. doi:10.1109/CVPR.1999.786939

Markdown

[Qian et al. "A Computational Approach to Semantic Event Detection." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 1999.](https://mlanthology.org/cvpr/1999/qian1999cvpr-computational/) doi:10.1109/CVPR.1999.786939

BibTeX

@inproceedings{qian1999cvpr-computational,
  title     = {{A Computational Approach to Semantic Event Detection}},
  author    = {Qian, Richard J. and Haering, Niels C. and Sezan, M. Ibrahim},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {1999},
  pages     = {1200-1206},
  doi       = {10.1109/CVPR.1999.786939},
  url       = {https://mlanthology.org/cvpr/1999/qian1999cvpr-computational/}
}