Xu, Yaosheng

3 publications

ICML 2022 Reducing Variance in Temporal-Difference Value Estimation via Ensemble of Deep Networks Litian Liang, Yaosheng Xu, Stephen Mcaleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox
NeurIPSW 2021 Target Entropy Annealing for Discrete Soft Actor-Critic Yaosheng Xu, Dailin Hu, Litian Liang, Stephen Marcus McAleer, Pieter Abbeel, Roy Fox
NeurIPSW 2021 Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates Litian Liang, Yaosheng Xu, Stephen Marcus McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox