Wagner, Paul

2 publications

NeurIPS 2013 Optimistic Policy Iteration and Natural Actor-Critic: A Unifying View and a Non-Optimality Result Paul Wagner
NeurIPS 2011 A Reinterpretation of the Policy Oscillation Phenomenon in Approximate Policy Iteration Paul Wagner