Zhang, Junkai

10 publications

ICLR 2026 Chasing the Tail: Effective Rubric-Based Reward Modeling for Large Language Model Post-Training Junkai Zhang, Zihao Wang, Lin Gui, Swarnashree Mysore Sathyendra, Jaehwan Jeong, Victor Veitch, Wei Wang, Yunzhong He, Bing Liu, Lifeng Jin
ICLR 2026 WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization Zhengwei Tao, Jialong Wu, Wenbiao Yin, Pu Wu, Junkai Zhang, Baixuan Li, Haiyang Shen, Kuan Li, Liwen Zhang, Xinyu Wang, Wentao Zhang, Yong Jiang, Pengjun Xie, Fei Huang, Jingren Zhou
NeurIPS 2025 Bi-Level Knowledge Transfer for Multi-Task Multi-Agent Reinforcement Learning Junkai Zhang, Jinmin He, Yifan Zhang, Yifan Zang, Ning Xu, Jian Cheng
NeurIPS 2025 PUO-Bench: A Panel Understanding and Operation Benchmark with a Privacy-Preserving Framework Wei Lin, Yiwei Zhou, Junkai Zhang, Rui Shao, Zhiyuan Zhao, Junyu Gao, Antoni B. Chan, Xuelong Li
NeurIPS 2024 Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time Zixiang Chen, Huizhuo Yuan, Yongqian Li, Yiwen Kou, Junkai Zhang, Quanquan Gu
AAAI 2024 Intrinsic Action Tendency Consistency for Cooperative Multi-Agent Reinforcement Learning Junkai Zhang, Yifan Zhang, Xi Sheryl Zhang, Yifan Zang, Jian Cheng
ICML 2024 Uncertainty-Aware Reward-Free Exploration with General Function Approximation Junkai Zhang, Weitong Zhang, Dongruo Zhou, Quanquan Gu
NeurIPSW 2023 Causal Graph ODE: Continuous Treatment Effect Modeling in Multi-Agent Dynamical Systems Zijie Huang, Jeehyun Hwang, Junkai Zhang, Jinwoo Baik, Weitong Zhang, Dominik Wodarz, Yizhou Sun, Quanquan Gu, Wei Wang
ICML 2023 Optimal Horizon-Free Reward-Free Exploration for Linear Mixture MDPs Junkai Zhang, Weitong Zhang, Quanquan Gu
NeurIPS 2023 Why Does Sharpness-Aware Minimization Generalize Better than SGD? Zixiang Chen, Junkai Zhang, Yiwen Kou, Xiangning Chen, Cho-Jui Hsieh, Quanquan Gu