Utilizing Semantic Word Similarity Measures for Video Retrieval

Abstract

This is a high level computer vision paper, which employs concepts from Natural Language Understanding in solving the video retrieval problem. Our main contribution is the utilization of the semantic word similarity measures (Lin and PMI-IR similarities) for video retrieval. In our approach, we use trained concept detectors, and the visual co-occurrence relations between such concepts. We propose two methods for content-based retrieval of videos: (1) A method for retrieving a <b xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">new</b> <b xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">concept</b> (a concept which is not known to the system, and no annotation is available) using semantic word similarity and visual co-occurrence. (2) A method for retrieval of videos based on their relevance to a user defined text query using the semantic word similarity and visual content of videos. For evaluation purposes, we have mainly used the automatic search and the high level feature extraction test set of TRECVIDpsila06 benchmark, and the automatic search test set of TRECVIDpsila07. These two data sets consist of 250 hours of multilingual news video captured from American, Arabic, German and Chinese TV channels. Although our method for retrieving a new concept is an unsupervised method, it outperforms the trained concept detectors (which are supervised) on 7 out of 20 test concepts, and overall it performs very close to the trained detectors. On the other hand, our visual content based semantic retrieval method performs 81% better than the text-based retrieval method. This shows that using visual content alone we can obtain significantly good retrieval results.

Cite

Text

Aytar et al. "Utilizing Semantic Word Similarity Measures for Video Retrieval." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008. doi:10.1109/CVPR.2008.4587822

Markdown

[Aytar et al. "Utilizing Semantic Word Similarity Measures for Video Retrieval." IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2008.](https://mlanthology.org/cvpr/2008/aytar2008cvpr-utilizing/) doi:10.1109/CVPR.2008.4587822

BibTeX

@inproceedings{aytar2008cvpr-utilizing,
  title     = {{Utilizing Semantic Word Similarity Measures for Video Retrieval}},
  author    = {Aytar, Yusuf and Shah, Mubarak and Luo, Jiebo},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  year      = {2008},
  doi       = {10.1109/CVPR.2008.4587822},
  url       = {https://mlanthology.org/cvpr/2008/aytar2008cvpr-utilizing/}
}