Yu, Zishun

10 publications

ICML 2025 Think Smarter Not Harder: Adaptive Reasoning with Inference Aware Optimization Zishun Yu, Tengyu Xu, Di Jin, Karthik Abinav Sankararaman, Yun He, Wenxuan Zhou, Zhouhao Zeng, Eryk Helenowski, Chen Zhu, Sinong Wang, Hao Ma, Han Fang
ICLRW 2025 Think Smarter Not Harder: Adaptive Reasoning with Inference Aware Optimization Zishun Yu, Tengyu Xu, Di Jin, Karthik Abinav Sankararaman, Yun He, Wenxuan Zhou, Zhouhao Zeng, Eryk Helenowski, Chen Zhu, Sinong Wang, Hao Ma, Han Fang
AAAI 2025 Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning Wenzhe Fan, Zishun Yu, Chengdong Ma, Changye Li, Yaodong Yang, Xinhua Zhang
ICLR 2024 $\mathcal{B}$-Coder: Value-Based Deep Reinforcement Learning for Program Synthesis Zishun Yu, Yunzhe Tao, Liyu Chen, Tao Sun, Hongxia Yang
UAI 2024 Offline Reward Perturbation Boosts Distributional Shift in Online RL Zishun Yu, Siteng Kang, Xinhua Zhang
ALT 2024 Slowly Changing Adversarial Bandit Algorithms Are Efficient for Discounted MDPs Ian A. Kash, Lev Reyzin, Zishun Yu
NeurIPSW 2023 $\mathcal{B}$-Coder: On Value-Based Deep Reinforcement Learning for Program Synthesis Zishun Yu, Yunzhe Tao, Liyu Chen, Tao Sun, Hongxia Yang
ICML 2023 Actor-Critic Alignment for Offline-to-Online Reinforcement Learning Zishun Yu, Xinhua Zhang
NeurIPS 2022 Certifying Robust Graph Classification Under Orthogonal Gromov-Wasserstein Threats Hongwei Jin, Zishun Yu, Xinhua Zhang
UAI 2022 Orthogonal Gromov-Wasserstein Discrepancy with Efficient Lower Bound Hongwei Jin, Zishun Yu, Xinhua Zhang