VRAG: Retrieval-Augmented Video Question Answering for Long-Form Videos

Cite

Text

Gia et al. "VRAG: Retrieval-Augmented Video Question Answering for Long-Form Videos." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.

Markdown

[Gia et al. "VRAG: Retrieval-Augmented Video Question Answering for Long-Form Videos." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.](https://mlanthology.org/cvprw/2025/gia2025cvprw-vrag/)

BibTeX

@inproceedings{gia2025cvprw-vrag,
  title     = {{VRAG: Retrieval-Augmented Video Question Answering for Long-Form Videos}},
  author    = {Gia, Bao Tran and Le, Khiem and Do, Tien and Mai, Tien-Dung and Ngo, Thanh Duc and Le, Duy-Dinh and Satoh, Shin'ichi},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2025},
  pages     = {3689-3698},
  url       = {https://mlanthology.org/cvprw/2025/gia2025cvprw-vrag/}
}