Interactive Video Retrieval with Dialog
Abstract
In the contemporary world, recording videos can be done quickly and easily. The quantity and availability of videos have continued to increase, therefore, an effective video retrieval method has also become important. To retrieve a target video from a large collection of videos, a video retrieval system needs to obtain appropriate queries from a user. Given a sentence query, there are many similar videos related to the query. The video retrieval system requires more information in addition to the sentence to distinguish the target video from others. If the system actively collects more information on the target video, we can perform video retrieval effectively. Thus, we propose a system to retrieve videos by asking questions about the content of the videos, and leveraging the user’s responses to the questions and the dialog history. Additionally, we confirmed the usefulness of the proposed system through experiments using the dataset called AVSD which includes videos and dialogs about the videos.
Cite
Text
Maeoki et al. "Interactive Video Retrieval with Dialog." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2020. doi:10.1109/CVPRW50498.2020.00484Markdown
[Maeoki et al. "Interactive Video Retrieval with Dialog." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2020.](https://mlanthology.org/cvprw/2020/maeoki2020cvprw-interactive/) doi:10.1109/CVPRW50498.2020.00484BibTeX
@inproceedings{maeoki2020cvprw-interactive,
title = {{Interactive Video Retrieval with Dialog}},
author = {Maeoki, Sho and Uehara, Kohei and Harada, Tatsuya},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2020},
pages = {4091-4099},
doi = {10.1109/CVPRW50498.2020.00484},
url = {https://mlanthology.org/cvprw/2020/maeoki2020cvprw-interactive/}
}