Reading the Videos: Temporal Labeling for Crowdsourced Time-Sync Videos Based on Semantic Embedding
Abstract
Recent years have witnessed the boom of online sharing media contents, which raise significant challenges in effective management and retrieval. Though a large amount of efforts have been made, precise retrieval on video shots with certain topics has been largely ignored. At the same time, due to the popularity of novel time-sync comments, or so-called "bullet-screen comments", video semantics could be now combined with timestamps to support further research on temporal video labeling. In this paper, we propose a novel video understanding framework to assign temporal labels on highlighted video shots. To be specific, due to the informal expression of bullet-screen comments, we first propose a temporal deep structured semantic model (T-DSSM) to represent comments into semantic vectors by taking advantage of their temporal correlation. Then, video highlights are recognized and labeled via semantic vectors in a supervised way. Extensive experiments on a real-world dataset prove that our framework could effectively label video highlights with a significant margin compared with baselines, which clearly validates the potential of our framework on video understanding, as well as bullet-screen comments interpretation.
Cite
Text
Lv et al. "Reading the Videos: Temporal Labeling for Crowdsourced Time-Sync Videos Based on Semantic Embedding." AAAI Conference on Artificial Intelligence, 2016. doi:10.1609/AAAI.V30I1.10383Markdown
[Lv et al. "Reading the Videos: Temporal Labeling for Crowdsourced Time-Sync Videos Based on Semantic Embedding." AAAI Conference on Artificial Intelligence, 2016.](https://mlanthology.org/aaai/2016/lv2016aaai-reading/) doi:10.1609/AAAI.V30I1.10383BibTeX
@inproceedings{lv2016aaai-reading,
title = {{Reading the Videos: Temporal Labeling for Crowdsourced Time-Sync Videos Based on Semantic Embedding}},
author = {Lv, Guangyi and Xu, Tong and Chen, Enhong and Liu, Qi and Zheng, Yi},
booktitle = {AAAI Conference on Artificial Intelligence},
year = {2016},
pages = {3000-3006},
doi = {10.1609/AAAI.V30I1.10383},
url = {https://mlanthology.org/aaai/2016/lv2016aaai-reading/}
}