Sun, Yanchao

33 publications

NeurIPS 2025 Checklists Are Better than Reward Models for Aligning Language Models Vijay Viswanathan, Yanchao Sun, Xiang Kong, Meng Cao, Graham Neubig, Tongshuang Wu
AISTATS 2025 Statistical Guarantees for Lifelong Reinforcement Learning Using PAC-Bayes Theory Zhi Zhang, Chris Chow, Yasi Zhang, Yanchao Sun, Haochen Zhang, Eric Hanchen Jiang, Han Liu, Furong Huang, Yuchen Cui, Oscar Hernan Madrid Padilla
ICLR 2025 TIS-DPO: Token-Level Importance Sampling for Direct Preference Optimization with Estimated Weights Aiwei Liu, Haoping Bai, Zhiyun Lu, Yanchao Sun, Xiang Kong, Xiaoming Simon Wang, Jiulong Shan, Albin Madappally Jose, Xiaojiang Liu, Lijie Wen, Philip S. Yu, Meng Cao
ICML 2024 Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies Towards Equal Long-Term Benefit Rate Yuancheng Xu, Chenghao Deng, Yanchao Sun, Ruijie Zheng, Xiyao Wang, Jieyu Zhao, Furong Huang
ICLR 2024 Beyond Worst-Case Attacks: Robust RL with Adaptive Defense via Non-Dominated Policies Xiangyu Liu, Chenghao Deng, Yanchao Sun, Yongyuan Liang, Furong Huang
ICLR 2024 COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL Xiyao Wang, Ruijie Zheng, Yanchao Sun, Ruonan Jia, Wichayaporn Wongkamjan, Huazhe Xu, Furong Huang
ICLR 2024 Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations Yongyuan Liang, Yanchao Sun, Ruijie Zheng, Xiangyu Liu, Benjamin Eysenbach, Tuomas Sandholm, Furong Huang, Stephen Marcus McAleer
ICLR 2024 Rethinking Adversarial Policies: A Generalized Attack Formulation and Provable Defense in RL Xiangyu Liu, Souradip Chakraborty, Yanchao Sun, Furong Huang
NeurIPS 2024 Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models Yuancheng Xu, Jiarui Yao, Manli Shu, Yanchao Sun, Zichu Wu, Ning Yu, Tom Goldstein, Furong Huang
ICLRW 2024 Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models Yuancheng Xu, Jiarui Yao, Manli Shu, Yanchao Sun, Zichu Wu, Ning Yu, Tom Goldstein, Furong Huang
NeurIPS 2023 $\texttt{TACO}$: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning Ruijie Zheng, Xiyao Wang, Yanchao Sun, Shuang Ma, Jieyu Zhao, Huazhe Xu, Hal Daumé Iii, Furong Huang
NeurIPSW 2023 Beyond Worst-Case Attacks: Robust RL with Adaptive Defense via Non-Dominated Policies Xiangyu Liu, Chenghao Deng, Yanchao Sun, Yongyuan Liang, Furong Huang
NeurIPSW 2023 COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL Xiyao Wang, Ruijie Zheng, Yanchao Sun, Ruonan Jia, Wichayaporn Wongkamjan, Huazhe Xu, Furong Huang
ICLR 2023 Certifiably Robust Policy Learning Against Adversarial Multi-Agent Communication Yanchao Sun, Ruijie Zheng, Parisa Hassanzadeh, Yongyuan Liang, Soheil Feizi, Sumitra Ganesh, Furong Huang
ICMLW 2023 Equal Long-Term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making Yuancheng Xu, Chenghao Deng, Yanchao Sun, Ruijie Zheng, Xiyao Wang, Jieyu Zhao, Furong Huang
ICLR 2023 Exploring and Exploiting Decision Boundary Dynamics for Adversarial Robustness Yuancheng Xu, Yanchao Sun, Micah Goldblum, Tom Goldstein, Furong Huang
ICMLW 2023 Game-Theoretic Robust Reinforcement Learning Handles Temporally-Coupled Perturbations Yongyuan Liang, Yanchao Sun, Ruijie Zheng, Xiangyu Liu, Tuomas Sandholm, Furong Huang, Stephen Marcus McAleer
ICCV 2023 Is Imitation All You Need? Generalized Decision-Making with Dual-Phase Training Yao Wei, Yanchao Sun, Ruijie Zheng, Sai Vemprala, Rogerio Bonatti, Shuhang Chen, Ratnesh Madaan, Zhongjie Ba, Ashish Kapoor, Shuang Ma
NeurIPS 2023 Learning Generalizable Agents via Saliency-Guided Features Decorrelation Sili Huang, Yanchao Sun, Jifeng Hu, Siyuan Guo, Hechang Chen, Yi Chang, Lichao Sun, Bo Yang
NeurIPSW 2023 O3D: Offline Data-Driven Discovery and Distillation for Sequential Decision-Making with Large Language Models Yuchen Xiao, Yanchao Sun, Mengda Xu, Udari Madhushani, Jared Vann, Deepeka Garg, Sumitra Ganesh
NeurIPSW 2023 Robustness to Multi-Modal Environment Uncertainty in MARL Using Curriculum Learning Aakriti Agrawal, Rohith Aralikatti, Yanchao Sun, Furong Huang
ICLR 2023 SMART: Self-Supervised Multi-Task pretrAining with contRol Transformers Yanchao Sun, Shuang Ma, Ratnesh Madaan, Rogerio Bonatti, Furong Huang, Ashish Kapoor
NeurIPS 2022 Adversarial Auto-Augment with Label Preservation: A Representation Learning Principle Guided Approach Kaiwen Yang, Yanchao Sun, Jiahao Su, Fengxiang He, Xinmei Tian, Furong Huang, Tianyi Zhou, Dacheng Tao
NeurIPS 2022 Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning Jifeng Hu, Yanchao Sun, Hechang Chen, Sili Huang, Haiyin Piao, Yi Chang, Lichao Sun
NeurIPS 2022 Efficient Adversarial Training Without Attacking: Worst-Case-Aware Robust Reinforcement Learning Yongyuan Liang, Yanchao Sun, Ruijie Zheng, Furong Huang
NeurIPSW 2022 SMART: Self-Supervised Multi-Task pretrAining with contRol Transformers Yanchao Sun, Shuang Ma, Ratnesh Madaan, Rogerio Bonatti, Furong Huang, Ashish Kapoor
ICLR 2022 Transfer RL Across Observation Feature Spaces via Model-Based Regularization Yanchao Sun, Ruijie Zheng, Xiyao Wang, Andrew E Cohen, Furong Huang
ICLR 2022 Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL Yanchao Sun, Ruijie Zheng, Yongyuan Liang, Furong Huang
AAAI 2021 TempLe: Learning Template of Transitions for Sample Efficient Multi-Task RL Yanchao Sun, Xiangyu Yin, Furong Huang
NeurIPSW 2021 Transfer RL Across Observation Feature Spaces via Model-Based Regularization Yanchao Sun, Ruijie Zheng, Xiyao Wang, Andrew E Cohen, Furong Huang
ICLR 2021 Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics Yanchao Sun, Da Huo, Furong Huang
NeurIPSW 2021 Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL Yanchao Sun, Ruijie Zheng, Yongyuan Liang, Furong Huang
AISTATS 2020 Understanding Generalization in Deep Learning via Tensor Methods Jingling Li, Yanchao Sun, Jiahao Su, Taiji Suzuki, Furong Huang