UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-Based Mobile GUI Agents

Abstract

In this paper, we introduce UI-Genie, a self-improving framework addressing two key challenges in GUI agents: verification of trajectory outcome is challenging and high-quality training data are not scalable. These challenges are addressed by a reward model and a self-improving pipeline, respectively. The reward model, UI-Genie-RM, features an image-text interleaved architecture that efficiently processes historical context and unifies action-level and task-level rewards. To support the training of UI-Genie-RM, we develop deliberately-designed data generation strategies including rule-based verification, controlled trajectory corruption, and hard negative mining. To address the second challenge, a self-improvement pipeline progressively expands solvable complex GUI tasks by enhancing both the agent and reward models through reward-guided exploration and outcome verification in dynamic environments. For training the model, we generate UI-Genie-RM-517k and UI-Genie-Agent-16k, establishing the first reward-specific dataset for GUI agents while demonstrating high-quality synthetic trajectory generation without manual annotation. Experimental results show that UI-Genie achieves state-of-the-art performance across multiple GUI agent benchmarks with three generations of data-model self-improvement. We open-source our complete framework implementation and generated datasets to facilitate further research in https://github.com/Euphoria16/UI-Genie.

Cite

Text

Xiao et al. "UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-Based Mobile GUI Agents." Advances in Neural Information Processing Systems, 2025.

Markdown

[Xiao et al. "UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-Based Mobile GUI Agents." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/xiao2025neurips-uigenie/)

BibTeX

@inproceedings{xiao2025neurips-uigenie,
  title     = {{UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-Based Mobile GUI Agents}},
  author    = {Xiao, Han and Wang, Guozhi and Chai, Yuxiang and Lu, Zimu and Lin, Weifeng and He, Hao and Fan, Lue and Bian, Liuyang and Hu, Rui and Liu, Liang and Ren, Shuai and Wen, Yafei and Chen, Xiaoxin and Zhou, Aojun and Li, Hongsheng},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/xiao2025neurips-uigenie/}
}