ICT-QA: Question Answering over Multi-Modal Contexts Including Image, Chart, and Text Modalities

Cite

Text

Jang et al. "ICT-QA: Question Answering over Multi-Modal Contexts Including Image, Chart, and Text Modalities." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.

Markdown

[Jang et al. "ICT-QA: Question Answering over Multi-Modal Contexts Including Image, Chart, and Text Modalities." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.](https://mlanthology.org/cvprw/2025/jang2025cvprw-ictqa/)

BibTeX

@inproceedings{jang2025cvprw-ictqa,
  title     = {{ICT-QA: Question Answering over Multi-Modal Contexts Including Image, Chart, and Text Modalities}},
  author    = {Jang, Youngrok and Kong, Hyesoo and Kim, Gyeonghun and Lee, Yejin and Choi, Stanley Jungkyu and Bae, Kyunghoon},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2025},
  pages     = {138-148},
  url       = {https://mlanthology.org/cvprw/2025/jang2025cvprw-ictqa/}
}