ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Cite

Text

Baechler et al. "ScreenAI: A Vision-Language Model for UI and Infographics Understanding." International Joint Conference on Artificial Intelligence, 2024.

Markdown

[Baechler et al. "ScreenAI: A Vision-Language Model for UI and Infographics Understanding." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/baechler2024ijcai-screenai/)

BibTeX

@inproceedings{baechler2024ijcai-screenai,
  title     = {{ScreenAI: A Vision-Language Model for UI and Infographics Understanding}},
  author    = {Baechler, Gilles and Sunkara, Srinivas and Wang, Maria and Zubach, Fedir and Mansoor, Hassan and Etter, Vincent and Carbune, Victor and Lin, Jason and Chen, Jindong and Sharma, Abhanshu},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {3058-3068},
  url       = {https://mlanthology.org/ijcai/2024/baechler2024ijcai-screenai/}
}