Niu, Yazhe

10 publications

ICLRW 2025 Empowering LLMs in Decision Games Through Algorithmic Data Synthesis Haolin Wang, Xueyan Li, Yazhe Niu, Shuai Hu, Hongsheng Li
NeurIPS 2025 Hierachical Balance Packing: Towards Efficient Supervised Fine-Tuning for Long-Context LLM Yongqiang Yao, Jingru Tan, Kaihuan Liang, Feizhao Zhang, Jiahao Hu, Shuo Wu, Yazhe Niu, Ruihao Gong, Dahua Lin, Ningyi Xu
ICML 2025 OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniverse Computation Balance Yongqiang Yao, Jingru Tan, Feizhao Zhang, Jiahao Hu, Yazhe Niu, Jin Xin, Bo Li, Pengfei Liu, Ruihao Gong, Dahua Lin, Ningyi Xu
ICCV 2025 Pretrained Reversible Generation as Unsupervised Visual Representation Learning Rongkun Xue, Jinouwen Zhang, Yazhe Niu, Dazhong Shen, Bingqi Ma, Yu Liu, Jing Yang
TMLR 2025 UniZero: Generalized and Efficient Planning with Scalable Latent World Models Yuan Pu, Yazhe Niu, Zhenjie Yang, Jiyuan Ren, Hongsheng Li, Yu Liu
AAAI 2024 A Perspective of Q-Value Estimation on Offline-to-Online Reinforcement Learning Yinmin Zhang, Jie Liu, Chuming Li, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang
AAAI 2023 ACE: Cooperative Multi-Agent Q-Learning with Bidirectional Action-Dependency Chuming Li, Jie Liu, Yinmin Zhang, Yuhong Wei, Yazhe Niu, Yaodong Yang, Yu Liu, Wanli Ouyang
ICLR 2023 GoBigger: A Scalable Platform for Cooperative-Competitive Multi-Agent Interactive Simulation Ming Zhang, Shenghan Zhang, Zhenjie Yang, Lekai Chen, Jinliang Zheng, Chao Yang, Chuming Li, Hang Zhou, Yazhe Niu, Yu Liu
NeurIPS 2023 LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios Yazhe Niu, Yuan Pu, Zhenjie Yang, Xueyan Li, Tong Zhou, Jiyuan Ren, Shuai Hu, Hongsheng Li, Yu Liu
ICCVW 2019 AIM 2019 Challenge on Constrained Super-Resolution: Methods and Results Kai Zhang, Nan Nan, Chenghua Li, Xueyi Zou, Ning Kang, Zhan Wang, Hang Xu, Chaofeng Wang, Zheng Li, Linlin Wang, Jun Shi, Shuhang Gu, Wenyu Sun, Zhiqiang Lang, Jiangtao Nie, Wei Wei, Lei Zhang, Yazhe Niu, Peijin Zhuo, Xiangzhen Kong, Long Sun, Wenhao Wang, Radu Timofte, Zheng Hui, Xiumei Wang, Xinbo Gao, Dongliang Xiong, Shuai Liu, Ruipeng Gang