SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images

Cite

Text

Tanaka et al. "SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images." AAAI Conference on Artificial Intelligence, 2023. doi:10.1609/AAAI.V37I11.26598

Markdown

[Tanaka et al. "SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images." AAAI Conference on Artificial Intelligence, 2023.](https://mlanthology.org/aaai/2023/tanaka2023aaai-slidevqa/) doi:10.1609/AAAI.V37I11.26598

BibTeX

@inproceedings{tanaka2023aaai-slidevqa,
  title     = {{SlideVQA: A Dataset for Document Visual Question Answering on Multiple Images}},
  author    = {Tanaka, Ryota and Nishida, Kyosuke and Nishida, Kosuke and Hasegawa, Taku and Saito, Itsumi and Saito, Kuniko},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2023},
  pages     = {13636-13645},
  doi       = {10.1609/AAAI.V37I11.26598},
  url       = {https://mlanthology.org/aaai/2023/tanaka2023aaai-slidevqa/}
}