Herman, Daniel

1 publications

ICLR 2023 Multi-Objective Reinforcement Learning: Convexity, Stationarity and Pareto Optimality Haoye Lu, Daniel Herman, Yaoliang Yu