Weltevrede, Max

1 publications

NeurIPS 2025 How Ensembles of Distilled Policies Improve Generalisation in Reinforcement Learning Max Weltevrede, Moritz Akiya Zanger, Matthijs T. J. Spaan, Wendelin Boehmer