Jang et al. "ICT-QA: Question Answering over Multi-Modal Contexts Including Image, Chart, and Text Modalities." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.
Markdown
[Jang et al. "ICT-QA: Question Answering over Multi-Modal Contexts Including Image, Chart, and Text Modalities." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.](https://mlanthology.org/cvprw/2025/jang2025cvprw-ictqa/)
BibTeX
@inproceedings{jang2025cvprw-ictqa,
title = {{ICT-QA: Question Answering over Multi-Modal Contexts Including Image, Chart, and Text Modalities}},
author = {Jang, Youngrok and Kong, Hyesoo and Kim, Gyeonghun and Lee, Yejin and Choi, Stanley Jungkyu and Bae, Kyunghoon},
booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
year = {2025},
pages = {138-148},
url = {https://mlanthology.org/cvprw/2025/jang2025cvprw-ictqa/}
}