TRISHUL: Towards Region Identification and Screen Hierarchy Understanding for Large VLM Based GUI Agents

Cite

Text

Singh et al. "TRISHUL: Towards Region Identification and Screen Hierarchy Understanding for Large VLM Based GUI Agents." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.

Markdown

[Singh et al. "TRISHUL: Towards Region Identification and Screen Hierarchy Understanding for Large VLM Based GUI Agents." IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2025.](https://mlanthology.org/cvprw/2025/singh2025cvprw-trishul/)

BibTeX

@inproceedings{singh2025cvprw-trishul,
  title     = {{TRISHUL: Towards Region Identification and Screen Hierarchy Understanding for Large VLM Based GUI Agents}},
  author    = {Singh, Kunal and Singh, Shreyas and Khanna, Mukund},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops},
  year      = {2025},
  pages     = {170-179},
  url       = {https://mlanthology.org/cvprw/2025/singh2025cvprw-trishul/}
}