Gia et al. "VRAG: Retrieval-Augmented Video Question Answering for Long-Form Videos." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.
Markdown
[Gia et al. "VRAG: Retrieval-Augmented Video Question Answering for Long-Form Videos." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.](https://mlanthology.org/cvprw/2025/gia2025cvprw-vrag/)
BibTeX
@inproceedings{gia2025cvprw-vrag,
title = {{VRAG: Retrieval-Augmented Video Question Answering for Long-Form Videos}},
author = {Gia, Bao Tran and Le, Khiem and Do, Tien and Mai, Tien-Dung and Ngo, Thanh Duc and Le, Duy-Dinh and Satoh, Shin'ichi},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2025},
pages = {3689-3698},
url = {https://mlanthology.org/cvprw/2025/gia2025cvprw-vrag/}
}