Zhou, Yifei

19 publications

ICLR 2025 Digi-Q: Learning VLM Q-Value Functions for Training Device-Control Agents Hao Bai, Yifei Zhou, Li Erran Li, Sergey Levine, Aviral Kumar
ICML 2025 Proposer-Agent-Evaluator (PAE): Autonomous Skill Discovery for Foundation Model Internet Agents Yifei Zhou, Qianlan Yang, Kaixiang Lin, Min Bai, Xiong Zhou, Yu-Xiong Wang, Sergey Levine, Li Erran Li
NeurIPS 2025 Self-Challenging Language Model Agents Yifei Zhou, Sergey Levine, Jason E Weston, Xian Li, Sainbayar Sukhbaatar
NeurIPS 2025 Thinking vs. Doing: Improving Agent Reasoning by Scaling Test-Time Interaction Junhong Shen, Hao Bai, Lunjun Zhang, Yifei Zhou, Amrith Setlur, Shengbang Tong, Diego Caples, Nan Jiang, Tong Zhang, Ameet Talwalkar, Aviral Kumar
NeurIPS 2024 Aligning Large Language Models with Representation Editing: A Control Perspective Lingkai Kong, Haorui Wang, Wenhao Mu, Yuanqi Du, Yuchen Zhuang, Yifei Zhou, Yue Song, Rongzhi Zhang, Kai Wang, Chao Zhang
ICMLW 2024 Aligning Large Language Models with Representation Editing: A Control Perspective Lingkai Kong, Haorui Wang, Wenhao Mu, Yuanqi Du, Yuchen Zhuang, Yifei Zhou, Yue Song, Rongzhi Zhang, Kai Wang, Chao Zhang
ICML 2024 ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL Yifei Zhou, Andrea Zanette, Jiayi Pan, Sergey Levine, Aviral Kumar
ICLRW 2024 ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL Yifei Zhou, Andrea Zanette, Jiayi Pan, Aviral Kumar, Sergey Levine
NeurIPS 2024 DigiRL: Training In-the-Wild Device-Control Agents with Autonomous Reinforcement Learning Hao Bai, Yifei Zhou, Mert Cemri, Jiayi Pan, Alane Suhr, Sergey Levine, Aviral Kumar
ICMLW 2024 DigiRL: Training In-the-Wild Device-Control Agents with Autonomous Reinforcement Learning Yifei Zhou, Hao Bai, Mert Cemri, Jiayi Pan, Alane Suhr, Sergey Levine, Aviral Kumar
ICMLW 2024 DigiRL: Training In-the-Wild Device-Control Agents with Autonomous Reinforcement Learning Hao Bai, Yifei Zhou, Mert Cemri, Jiayi Pan, Alane Suhr, Sergey Levine, Aviral Kumar
ICMLW 2024 DigiRL: Training In-the-Wild Device-Control Agents with Autonomous Reinforcement Learning Hao Bai, Yifei Zhou, Mert Cemri, Jiayi Pan, Alane Suhr, Sergey Levine, Aviral Kumar
NeurIPS 2024 Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning Yuexiang Zhai, Hao Bai, Zipeng Lin, Jiayi Pan, Shengbang Tong, Yifei Zhou, Alane Suhr, Saining Xie, Yann LeCun, Yi Ma, Sergey Levine
ICLR 2024 Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees Yifei Zhou, Ayush Sekhari, Yuda Song, Wen Sun
ICCV 2023 BT^2: Backward-Compatible Training with Basis Transformation Yifei Zhou, Zilu Li, Abhinav Shrivastava, Hengshuang Zhao, Antonio Torralba, Taipeng Tian, Ser-Nam Lim
ICLR 2023 Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient Yuda Song, Yifei Zhou, Ayush Sekhari, Drew Bagnell, Akshay Krishnamurthy, Wen Sun
NeurIPS 2023 Test-Time Distribution Normalization for Contrastively Learned Visual-Language Models Yifei Zhou, Juntao Ren, Fengyu Li, Ramin Zabih, Ser Nam Lim
NeurIPS 2022 GAPX: Generalized Autoregressive Paraphrase-Identification X Yifei Zhou, Renyu Li, Hayden Housen, Ser Nam Lim
NeurIPSW 2022 Hybrid RL: Using Both Offline and Online Data Can Make RL Efficient Yuda Song, Yifei Zhou, Ayush Sekhari, Drew Bagnell, Akshay Krishnamurthy, Wen Sun