Wang, Qinghao

3 publications

ICLR 2026 RiskPO: Risk-Based Policy Optimization with Verifiable Reward for LLM Post-Training Tao Ren, Jinyang Jiang, Hui Yang, Wan Tian, Minhao Zou, Guanghao Li, Zishi Zhang, Qinghao Wang, Shentao Qin, Yanjun Zhao, Rui Tao, Hui Shao, Yijie Peng
TMLR 2025 A Survey on Large Language Model-Based Social Agents in Game-Theoretic Scenarios Xiachong Feng, Longxu Dou, Minzhi Li, Qinghao Wang, Yu Guo, Haochuan Wang, Chang Ma, Lingpeng Kong
NeurIPSW 2024 FPGA-Gym: An FPGA-Accelerated Reinforcement Learning Environment Simulation Framework Jiayi Li, Hongxiao Zhao, Wenshuo Yue, Yihan Fu, Daijing Shi, Anjunyi Fan, Qinghao Wang, Yaodong Yang, Bonan Yan