ScreenAgent: A Vision Language Model-Driven Computer Control Agent

Cite

Text

Niu et al. "ScreenAgent: A Vision Language Model-Driven Computer Control Agent." International Joint Conference on Artificial Intelligence, 2024.

Markdown

[Niu et al. "ScreenAgent: A Vision Language Model-Driven Computer Control Agent." International Joint Conference on Artificial Intelligence, 2024.](https://mlanthology.org/ijcai/2024/niu2024ijcai-screenagent/)

BibTeX

@inproceedings{niu2024ijcai-screenagent,
  title     = {{ScreenAgent: A Vision Language Model-Driven Computer Control Agent}},
  author    = {Niu, Runliang and Li, Jindong and Wang, Shiqi and Fu, Yali and Hu, Xiyu and Leng, Xueyuan and Kong, He and Chang, Yi and Wang, Qi},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2024},
  pages     = {6433-6441},
  url       = {https://mlanthology.org/ijcai/2024/niu2024ijcai-screenagent/}
}