Martyanov, Stepan

1 publications

NeurIPS 2022 Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis Anton Plaksin, Stepan Martyanov