Spatio-Temporal Video Re-Localization by Warp LSTM
Abstract
The need for efficiently finding the video content a user wants is increasing because of the erupting of user-generated videos on the Web. Existing keyword-based or content-based video retrieval methods usually determine what occurs in a video but not when and where. In this paper, we make an answer to the question of when and where by formulating a new task, namely spatio-temporal video re-localization. Specifically, given a query video and a reference video, spatio-temporal video re-localization aims to localize tubelets in the reference video such that the tubelets semantically correspond to the query. To accurately localize the desired tubelets in the reference video, we propose a novel warp LSTM network, which propagates the spatio-temporal information for a long period and thereby captures the corresponding long-term dependencies. Another issue for spatio-temporal video re-localization is the lack of properly labeled video datasets. Therefore, we reorganize the videos in the AVA dataset to form a new dataset for spatio-temporal video re-localization research. Extensive experimental results show that the proposed model achieves superior performances over the designed baselines on the spatio-temporal video re-localization task.
Cite
Text
Feng et al. "Spatio-Temporal Video Re-Localization by Warp LSTM." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019. doi:10.1109/CVPR.2019.00138Markdown
[Feng et al. "Spatio-Temporal Video Re-Localization by Warp LSTM." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2019.](https://mlanthology.org/cvpr/2019/feng2019cvpr-spatiotemporal/) doi:10.1109/CVPR.2019.00138BibTeX
@inproceedings{feng2019cvpr-spatiotemporal,
title = {{Spatio-Temporal Video Re-Localization by Warp LSTM}},
author = {Feng, Yang and Ma, Lin and Liu, Wei and Luo, Jiebo},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
year = {2019},
doi = {10.1109/CVPR.2019.00138},
url = {https://mlanthology.org/cvpr/2019/feng2019cvpr-spatiotemporal/}
}