Providing Video Annotations in Multimedia Containers for Visualization and Research

Abstract

There is an ever increasing amount of video data sets which comprise additional metadata, such as object labels, tagged events, or gaze data. Unfortunately, metadata are usually stored in separate files in custom-made data formats, which reduces accessibility even for experts and makes the data inaccessible for non-experts. Consequently, we still lack interfaces for many common use cases, such as visualization, streaming, data analysis, machine learning, high-level understanding and semantic web integration. To bridge this gap, we want to promote the use of existing multimedia container formats to establish a standardized method of incorporating content and metadata. This will facilitate visualization in standard multimedia players, streaming via the Internet, and easy use without conversion, as shown in the attached demonstration video and files. In two prototype implementations, we embed object labels, gaze data from eye-tracking and the corresponding video into a single multimedia container and visualize this data using a media player. Based on this prototype, we discuss the benefit of our approach as a possible standard. Finally, we argue for the inclusion of MPEG-7 in multimedia containers as a further improvement.

Cite

Text

Schöning et al. "Providing Video Annotations in Multimedia Containers for Visualization and Research." IEEE/CVF Winter Conference on Applications of Computer Vision, 2017. doi:10.1109/WACV.2017.78

Markdown

[Schöning et al. "Providing Video Annotations in Multimedia Containers for Visualization and Research." IEEE/CVF Winter Conference on Applications of Computer Vision, 2017.](https://mlanthology.org/wacv/2017/schoning2017wacv-providing/) doi:10.1109/WACV.2017.78

BibTeX

@inproceedings{schoning2017wacv-providing,
  title     = {{Providing Video Annotations in Multimedia Containers for Visualization and Research}},
  author    = {Schöning, Julius and Faion, Patrick and Heidemann, Gunther and Krumnack, Ulf},
  booktitle = {IEEE/CVF Winter Conference on Applications of Computer Vision},
  year      = {2017},
  pages     = {650-659},
  doi       = {10.1109/WACV.2017.78},
  url       = {https://mlanthology.org/wacv/2017/schoning2017wacv-providing/}
}