Wu, Shuo

2 publications

NeurIPS 2025 Hierachical Balance Packing: Towards Efficient Supervised Fine-Tuning for Long-Context LLM Yongqiang Yao, Jingru Tan, Kaihuan Liang, Feizhao Zhang, Jiahao Hu, Shuo Wu, Yazhe Niu, Ruihao Gong, Dahua Lin, Ningyi Xu
JAIR 2025 Robust Reward Design for Markov Decision Processes Shuo Wu, Haoxiang Ma, Jie Fu, Shuo Han