ML Anthology
Authors
Search
About
Yang, Long
15 publications
ICML
2025
Low-Dimension-to-High-Dimension Generalization and Its Implications for Length Generalization
Yang Chen
,
Long Yang
,
Yitao Liang
,
Zhouchen Lin
CoRL
2025
UniTac2Pose: A Unified Approach Learned in Simulation for Category-Level Visuotactile In-Hand Pose Estimation
Mingdong Wu
,
Long Yang
,
Jin Liu
,
Weiyao Huang
,
Lehong Wu
,
Zelin Chen
,
Daolin Ma
,
Hao Dong
IJCAI
2024
FlagVNE: A Flexible and Generalizable Reinforcement Learning Framework for Network Resource Allocation
Tianfu Wang
,
Qilin Fan
,
Chao Wang
,
Long Yang
,
Leilei Ding
,
Nicholas Jing Yuan
,
Hui Xiong
ICML
2024
Langevin Policy for Safe Reinforcement Learning
Fenghao Lei
,
Long Yang
,
Shiting Wen
,
Zhixiong Huang
,
Zhiwang Zhang
,
Chaoyi Pang
NeurIPS
2024
Optimizing over Multiple Distributions Under Generalized Quasar-Convexity Condition
Shihong Ding
,
Long Yang
,
Luo Luo
,
Cong Fang
AAAI
2023
Augmented Proximal Policy Optimization for Safe Reinforcement Learning
Juntao Dai
,
Jiaming Ji
,
Long Yang
,
Qian Zheng
,
Gang Pan
NeurIPS
2023
VOCE: Variational Optimization with Conservative Estimation for Offline Safe Reinforcement Learning
Jiayi Guan
,
Guang Chen
,
Jiaming Ji
,
Long Yang
,
Ao Zhou
,
Zhijun Li
,
Changjun Jiang
COLT
2023
Zeroth-Order Optimization with Weak Dimension Dependency
Pengyun Yue
,
Long Yang
,
Cong Fang
,
Zhouchen Lin
NeurIPS
2022
Constrained Update Projection Approach to Safe Policy Optimization
Long Yang
,
Jiaming Ji
,
Juntao Dai
,
Linrui Zhang
,
Binbin Zhou
,
Pengfei Li
,
Yaodong Yang
,
Gang Pan
IJCAI
2022
Penalized Proximal Policy Optimization for Safe Reinforcement Learning
Linrui Zhang
,
Li Shen
,
Long Yang
,
Shixiang Chen
,
Xueqian Wang
,
Bo Yuan
,
Dacheng Tao
AAAI
2022
Policy Optimization with Stochastic Mirror Descent
Long Yang
,
Yu Zhang
,
Gang Zheng
,
Qian Zheng
,
Pengfei Li
,
Jianhang Huang
,
Gang Pan
AAAI
2021
On Convergence of Gradient Expected Sarsa(λ)
Long Yang
,
Gang Zheng
,
Yu Zhang
,
Qian Zheng
,
Pengfei Li
,
Gang Pan
AAAI
2021
Sample Complexity of Policy Gradient Finding Second-Order Stationary Points
Long Yang
,
Qian Zheng
,
Gang Pan
IJCAI
2018
A Unified Approach for Multi-Step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning
Long Yang
,
Minhao Shi
,
Qian Zheng
,
Wenjia Meng
,
Gang Pan
CVPR
2017
Distinguishing the Indistinguishable: Exploring Structural Ambiguities via Geodesic Context
Qingan Yan
,
Long Yang
,
Ling Zhang
,
Chunxia Xiao