Sun, Zexu

9 publications

ICLR 2026 CurES: From Gradient Analysis to Efficient Curriculum Learning for Reasoning LLMs Yongcheng Zeng, Zexu Sun, Bokai Ji, Erxue Min, Hengyi Cai, Shuaiqiang Wang, Dawei Yin, Haifeng Zhang, Xu Chen, Jun Wang
ICLR 2026 Prompt and Parameter Co-Optimization for Large Language Models Xiaohe Bo, Rui Li, Zexu Sun, Quanyu Dai, Zeyu Zhang, Zihang Tian, Xu Chen, Zhenhua Dong
ICLR 2026 Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents Heyang Gao, Zexu Sun, Erxue Min, Hengyi Cai, Shuaiqiang Wang, Dawei Yin, Xu Chen
ECML-PKDD 2025 Counterfactual Multi-Player Bandits for Explainable Recommendation Diversification Yansen Zhang, Bowei He, Xiaokun Zhang, Haolun Wu, Zexu Sun, Chen Ma
ICML 2025 Invariant Deep Uplift Modeling for Incentive Assignment in Online Marketing via Probability of Necessity and Sufficiency Zexu Sun, Qiyu Han, Hao Yang, Anpeng Wu, Minqin Zhu, Dugang Liu, Chen Ma, Yunpeng Weng, Xing Tang, Xiuqiang He
NeurIPS 2025 Learning to Focus: Causal Attention Distillation via Gradient‐Guided Token Pruning Yiju Guo, Wenkai Yang, Zexu Sun, Ning Ding, Zhiyuan Liu, Yankai Lin
ICML 2025 Rethinking Causal Ranking: A Balanced Perspective on Uplift Model Evaluation Minqin Zhu, Zexu Sun, Ruoxuan Xiong, Anpeng Wu, Baohong Li, Caizhi Tang, Jun Zhou, Fei Wu, Kun Kuang
ICLR 2025 Uncertainty and Influence Aware Reward Model Refinement for Reinforcement Learning from Human Feedback Zexu Sun, Yiju Guo, Yankai Lin, Xu Chen, Qi Qi, Xing Tang, Xiuqiang He, Ji-Rong Wen
NeurIPS 2023 Offline Imitation Learning with Variational Counterfactual Reasoning Zexu Sun, Bowei He, Jinxin Liu, Xu Chen, Chen Ma, Shuai Zhang