Shi, Ruizhe

9 publications

ICLR 2025 The Crucial Role of Samplers in Online Direct Preference Optimization Ruizhe Shi, Runlong Zhou, Simon Shaolei Du
NeurIPS 2024 Decoding-Time Language Model Alignment with Multiple Objectives Ruizhe Shi, Yifang Chen, Yushi Hu, Alisa Liu, Hannaneh Hajishirzi, Noah A. Smith, Simon S. Du
ICMLW 2024 Decoding-Time Language Model Alignment with Multiple Objectives Ruizhe Shi, Yifang Chen, Yushi Hu, Alisa Liu, Hannaneh Hajishirzi, Noah A. Smith, Simon Shaolei Du
ICML 2024 Rethinking Transformers in Solving POMDPs Chenhao Lu, Ruizhe Shi, Yuyao Liu, Kaizhe Hu, Simon Shaolei Du, Huazhe Xu
NeurIPSW 2024 The Crucial Role of Samplers in Online Direct Preference Optimization Ruizhe Shi, Runlong Zhou, Simon Shaolei Du
NeurIPSW 2024 The Crucial Role of Samplers in Online Direct Preference Optimization Ruizhe Shi, Runlong Zhou, Simon Shaolei Du
ICLR 2024 Unleashing the Power of Pre-Trained Language Models for Offline Reinforcement Learning Ruizhe Shi, Yuyao Liu, Yanjie Ze, Simon Shaolei Du, Huazhe Xu
NeurIPS 2023 H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation Yanjie Ze, Yuyao Liu, Ruizhe Shi, Jiaxin Qin, Zhecheng Yuan, Jiashun Wang, Huazhe Xu
NeurIPSW 2023 Unleashing the Power of Pre-Trained Language Models for Offline Reinforcement Learning Ruizhe Shi, Yuyao Liu, Yanjie Ze, Simon Shaolei Du, Huazhe Xu