Zhou, Zhaoyi

5 publications

ICML 2025 Accelerating Unbiased LLM Evaluation via Synthetic Feedback Zhaoyi Zhou, Yuda Song, Andrea Zanette
ICLRW 2025 Accelerating Unbiased LLM Evaluation via Synthetic Feedback Zhaoyi Zhou, Yuda Song, Andrea Zanette
ICLR 2024 Free from Bellman Completeness: Trajectory Stitching via Model-Based Return-Conditioned Supervised Learning Zhaoyi Zhou, Chuning Zhu, Runlong Zhou, Qiwen Cui, Abhishek Gupta, Simon Shaolei Du
UAI 2023 Convergence Rates for Localized Actor-Critic in Networked Markov Potential Games Zhaoyi Zhou, Zaiwei Chen, Yiheng Lin, Adam Wierman
NeurIPSW 2023 Free from Bellman Completeness: Trajectory Stitching via Model-Based Return-Conditioned Supervised Learning Zhaoyi Zhou, Chuning Zhu, Runlong Zhou, Qiwen Cui, Abhishek Gupta, Simon Shaolei Du