What’s Important Here?: Opportunities and Challenges of LLM in Retrieving Information from Web Interface

Abstract

Large language models (LLMs) that have been trained on large corpus of codes exhibit a remarkable ability to understand HTML code [1]. As web interfaces are mainly constructed using HTML, we designed an in-depth study to see how the code understanding ability of LLMs can be used to retrieve and locate important elements for a user given query (i.e. task description) in web interface. In contrast with prior works, which primarily focused on autonomous web navigation, we decompose the problem as an even atomic operation - Can LLMs find out the important information in the web page for a user given query? This decomposition enables us to scrutinize the current capabilities of LLMs and uncover the opportunities and challenges they present. Our empirical experiments show that the LLMs exhibit a reasonable level of competence, there is still a substantial room for improvement. We hope our investigation will inspire follow-up works in overcoming the current challenges in this domain.

Cite

Text

Huq et al. "What’s Important Here?: Opportunities and Challenges of LLM in Retrieving Information from Web Interface." NeurIPS 2023 Workshops: R0-FoMo, 2023.

Markdown

[Huq et al. "What’s Important Here?: Opportunities and Challenges of LLM in Retrieving Information from Web Interface." NeurIPS 2023 Workshops: R0-FoMo, 2023.](https://mlanthology.org/neuripsw/2023/huq2023neuripsw-whats/)

BibTeX

@inproceedings{huq2023neuripsw-whats,
  title     = {{What’s Important Here?: Opportunities and Challenges of LLM in Retrieving Information from Web Interface}},
  author    = {Huq, Faria and Bigham, Jeffrey P. and Martelaro, Nikolas},
  booktitle = {NeurIPS 2023 Workshops: R0-FoMo},
  year      = {2023},
  url       = {https://mlanthology.org/neuripsw/2023/huq2023neuripsw-whats/}
}