Wang, Huaijie

3 publications

JMLR 2025 BitNet: 1-Bit Pre-Training for Large Language Models Hongyu Wang, Shuming Ma, Lingxiao Ma, Lei Wang, Wenhui Wang, Li Dong, Shaohan Huang, Huaijie Wang, Jilong Xue, Ruiping Wang, Yi Wu, Furu Wei
ICLRW 2025 Offline Reinforcement Learning for LLM Multi-Step Reasoning Huaijie Wang, Shibo Hao, Hanze Dong, Shenao Zhang, Yilin Bao, Ziran Yang, Yi Wu
NeurIPS 2022 Grounded Reinforcement Learning: Learning to Win the Game Under Human Commands Shusheng Xu, Huaijie Wang, Yi Wu