ML Anthology
Authors
Search
About
Zhai, Yuanzhao
4 publications
AAAI
2025
Correcting Large Language Model Behavior via Influence Function
Han Zhang
,
Zhuo Zhang
,
Yi Zhang
,
Yuanzhao Zhai
,
Hanyang Peng
,
Yu Lei
,
Yue Yu
,
Hui Wang
,
Bin Liang
,
Lin Gui
,
Ruifeng Xu
AAAI
2025
Enhancing Decision-Making for LLM Agents via Step-Level Q-Value Models
Yuanzhao Zhai
,
Tingkai Yang
,
Kele Xu
,
Dawei Feng
,
Cheng Yang
,
Bo Ding
,
Huaimin Wang
ICML
2024
Iterative Regularized Policy Optimization with Imperfect Demonstrations
Gong Xudong
,
Feng Dawei
,
Kele Xu
,
Yuanzhao Zhai
,
Chengkang Yao
,
Weijia Wang
,
Bo Ding
,
Huaimin Wang
AAAI
2024
Optimistic Model Rollouts for Pessimistic Offline Policy Optimization
Yuanzhao Zhai
,
Yiying Li
,
Zijian Gao
,
Xudong Gong
,
Kele Xu
,
Dawei Feng
,
Bo Ding
,
Huaimin Wang