Guanjunjiang

3 publications

ICLR 2026 Bringing Stability to Diffusion: Decomposing and Reducing Variance of Training Masked Diffusion Models Mengni Jia, Mengyu Zhou, Yihao Liu, Xiaoxi Jiang, Guanjunjiang
ICLR 2026 Eliminating Inductive Bias in Reward Models with Information-Theoretic Guidance Zhuo Li, Pengyu Cheng, Zhechao Yu, FeifeiTong, Anningzhe Gao, Tsung-Hui Chang, Xiang Wan, Erchao.Zec, Xiaoxi Jiang, Guanjunjiang
ICLR 2026 Search Self-Play: Pushing the Frontier of Agent Capability Without Supervision Hongliang Lu, Yuhang Wen, Pengyu Cheng, Ruijin Ding, Jiaqi Guo, Haotian Xu, Chutian Wang, Haonan Chen, Xiaoxi Jiang, Guanjunjiang