ML Anthology
Authors
Search
About
Wang, Huaijie
3 publications
JMLR
2025
BitNet: 1-Bit Pre-Training for Large Language Models
Hongyu Wang
,
Shuming Ma
,
Lingxiao Ma
,
Lei Wang
,
Wenhui Wang
,
Li Dong
,
Shaohan Huang
,
Huaijie Wang
,
Jilong Xue
,
Ruiping Wang
,
Yi Wu
,
Furu Wei
ICLRW
2025
Offline Reinforcement Learning for LLM Multi-Step Reasoning
Huaijie Wang
,
Shibo Hao
,
Hanze Dong
,
Shenao Zhang
,
Yilin Bao
,
Ziran Yang
,
Yi Wu
NeurIPS
2022
Grounded Reinforcement Learning: Learning to Win the Game Under Human Commands
Shusheng Xu
,
Huaijie Wang
,
Yi Wu