Tian et al. "Audio-Visual Interpretable and Controllable Video Captioning." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019.
Markdown
[Tian et al. "Audio-Visual Interpretable and Controllable Video Captioning." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2019.](https://mlanthology.org/cvprw/2019/tian2019cvprw-audiovisual-a/)
BibTeX
@inproceedings{tian2019cvprw-audiovisual-a,
title = {{Audio-Visual Interpretable and Controllable Video Captioning}},
author = {Tian, Yapeng and Guan, Chenxiao and Goodman, Justin and Moore, Marc and Xu, Chenliang},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2019},
pages = {9-12},
url = {https://mlanthology.org/cvprw/2019/tian2019cvprw-audiovisual-a/}
}