Pan, Yangchen

26 publications

JAIR 2025 An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models Yangchen Pan, Junfeng Wen, Chenjun Xiao, Philip H. S. Torr
ICML 2025 PANDAS: Improving Many-Shot Jailbreaking via Positive Affirmation, Negative Demonstration, and Adaptive Sampling Avery Ma, Yangchen Pan, Amir-Massoud Farahmand
ICMLW 2024 An MRP Formulation for Supervised Learning: Generalized Temporal Difference Learning Models Yangchen Pan, Junfeng Wen, Chenjun Xiao, Philip Torr
ECCV 2024 Improving Adversarial Transferability via Model Alignment Avery Ma, Amir-massoud Farahmand, Yangchen Pan, Philip Torr, Jindong Gu
JMLR 2024 Label Alignment Regularization for Distribution Shift Ehsan Imani, Guojun Zhang, Runjia Li, Jun Luo, Pascal Poupart, Philip H.S. Torr, Yangchen Pan
ICML 2024 Position: Reinforcement Learning in Dynamic Treatment Regimes Needs Critical Reexamination Zhiyao Luo, Yangchen Pan, Peter Watkinson, Tingting Zhu
NeurIPS 2023 An Alternative to Variance: Gini Deviation for Risk-Averse Policy Gradient Yudong Luo, Guiliang Liu, Pascal Poupart, Yangchen Pan
UAI 2023 Conditionally Optimistic Exploration for Cooperative Deep Multi-Agent Reinforcement Learning Xutong Zhao, Yangchen Pan, Chenjun Xiao, Sarath Chandar, Janarthanan Rajendran
ICLR 2023 Greedy Actor-Critic: A New Conditional Cross-Entropy Method for Policy Improvement Samuel Neumann, Sungsu Lim, Ajin George Joseph, Yangchen Pan, Adam White, Martha White
TMLR 2023 Memory-Efficient Reinforcement Learning with Value-Based Knowledge Consolidation Qingfeng Lan, Yangchen Pan, Jun Luo, A. Rupam Mahmood
ICLR 2023 The In-Sample SoftMax for Offline Reinforcement Learning Chenjun Xiao, Han Wang, Yangchen Pan, Adam White, Martha White
TMLR 2023 Understanding the Robustness Difference Between Stochastic Gradient Descent and Adaptive Gradient Methods Avery Ma, Yangchen Pan, Amir-massoud Farahmand
AISTATS 2022 An Alternate Policy Gradient Estimator for SoftMax Policies Shivam Garg, Samuele Tosatto, Yangchen Pan, Martha White, Rupam Mahmood
UAI 2022 Understanding and Mitigating the Limitations of Prioritized Experience Replay Yangchen Pan, Jincheng Mei, Amir-massoud Farahmand, Martha White, Hengshuai Yao, Mohsen Rohani, Jun Luo
ICLR 2021 Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online Yangchen Pan, Kirby Banman, Martha White
ICLR 2020 An Implicit Function Learning Approach for Parametric Modal Regression Yangchen Pan, Ehsan Imani, Martha White, Amir-massoud Farahmand
NeurIPS 2020 An Implicit Function Learning Approach for Parametric Modal Regression Yangchen Pan, Ehsan Imani, Amir-massoud Farahmand, Martha White
ICLR 2020 Frequency-Based Search-Control in Dyna Yangchen Pan, Jincheng Mei, Amir-massoud Farahmand
ICLR 2020 Maxmin Q-Learning: Controlling the Estimation Bias of Q-Learning Qingfeng Lan, Yangchen Pan, Alona Fyshe, Martha White
IJCAI 2019 Hill Climbing on Value Estimates for Search-Control in Dyna Yangchen Pan, Hengshuai Yao, Amir-massoud Farahmand, Martha White
IJCAI 2018 Organizing Experience: A Deeper Look at Replay Mechanisms for Sample-Based Planning in Continuous State Domains Yangchen Pan, Muhammad Zaheer, Adam White, Andrew Patterson, Martha White
ICML 2018 Reinforcement Learning with Function-Valued Action Spaces for Partial Differential Equation Control Yangchen Pan, Amir-massoud Farahmand, Martha White, Saleh Nabi, Piyush Grover, Daniel Nikovski
AAAI 2017 Accelerated Gradient Temporal Difference Learning Yangchen Pan, Adam White, Martha White
ICML 2017 Adapting Kernel Representations Online Using Submodular Maximization Matthew Schlegel, Yangchen Pan, Jiecao Chen, Martha White
UAI 2017 Effective Sketching Methods for Value Function Approximation Yangchen Pan, Erfan Sadeqi Azer, Martha White
IJCAI 2016 Incremental Truncated LSTD Clement Gehring, Yangchen Pan, Martha White