Open-Ended Long-Form Video Question Answering via Adaptive Hierarchical Reinforced Networks

Abstract

Open-ended long-form video question answering is challenging problem in visual information retrieval, which automatically generates the natural language answer from the referenced long-form video content according to the question. However, the existing video question answering works mainly focus on the short-form video question answering, due to the lack of modeling the semantic representation of long-form video contents. In this paper, we consider the problem of long-form video question answering from the viewpoint of adaptive hierarchical reinforced encoder-decoder network learning. We propose the adaptive hierarchical encoder network to learn the joint representation of the long-form video contents according to the question with adaptive video segmentation. we then develop the reinforced decoder network to generate the natural language answer for open-ended video question answering. We construct a large-scale long-form video question answering dataset. The extensive experiments show the effectiveness of our method.

Cite

Text

Zhao et al. "Open-Ended Long-Form Video Question Answering via Adaptive Hierarchical Reinforced Networks." International Joint Conference on Artificial Intelligence, 2018. doi:10.24963/IJCAI.2018/512

Markdown

[Zhao et al. "Open-Ended Long-Form Video Question Answering via Adaptive Hierarchical Reinforced Networks." International Joint Conference on Artificial Intelligence, 2018.](https://mlanthology.org/ijcai/2018/zhao2018ijcai-open/) doi:10.24963/IJCAI.2018/512

BibTeX

@inproceedings{zhao2018ijcai-open,
  title     = {{Open-Ended Long-Form Video Question Answering via Adaptive Hierarchical Reinforced Networks}},
  author    = {Zhao, Zhou and Zhang, Zhu and Xiao, Shuwen and Yu, Zhou and Yu, Jun and Cai, Deng and Wu, Fei and Zhuang, Yueting},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2018},
  pages     = {3683-3689},
  doi       = {10.24963/IJCAI.2018/512},
  url       = {https://mlanthology.org/ijcai/2018/zhao2018ijcai-open/}
}