Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery for Foundation Model Internet Agents
Abstract
A generalist foundation model agent needs to have a large and diverse skill repertoire, such as finding directions between two travel locations and buying specific items from the Internet. If each skill needs to be specified manually through a fixed set of human-annotated instructions, the agent’s skill repertoire will necessarily be limited due to the scalability of human-annotated instructions. In this work, we address this challenge by proposing Proposer-Agent-Evaluator (PAE), an effective learning system that enables foundation model agents to autonomously discover and practice skills in the wild. After a context-aware task proposer generates instructions based on website information, the agent policy attempts those tasks in the real world with resulting trajectories evaluated by an autonomous VLM-based success evaluator. The success evaluation serves as the reward signal for the agent to refine its policies through RL. We validate PAE on challenging vision-based web navigation, using both real-world and selfhosted websites from WebVoyager and WebArena. Our results show that PAE significantly improves the zero-shot generalization capability of VLM Internet agents (around 50% relative improvement) to both unseen tasks and websites.
Cite
Text
Zhou et al. "Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery for Foundation Model Internet Agents." Proceedings of the 42nd International Conference on Machine Learning, 2025.Markdown
[Zhou et al. "Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery for Foundation Model Internet Agents." Proceedings of the 42nd International Conference on Machine Learning, 2025.](https://mlanthology.org/icml/2025/zhou2025icml-proposeragentevaluator/)BibTeX
@inproceedings{zhou2025icml-proposeragentevaluator,
title = {{Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery for Foundation Model Internet Agents}},
author = {Zhou, Yifei and Yang, Qianlan and Lin, Kaixiang and Bai, Min and Zhou, Xiong and Wang, Yu-Xiong and Levine, Sergey and Li, Li Erran},
booktitle = {Proceedings of the 42nd International Conference on Machine Learning},
year = {2025},
pages = {79490-79528},
volume = {267},
url = {https://mlanthology.org/icml/2025/zhou2025icml-proposeragentevaluator/}
}