Multi-Policy Grounding and Ensemble Policy Learning for Transfer Learning with Dynamics Mismatch
Abstract
We propose a new transfer learning algorithm between tasks with different dynamics. The proposed algorithm solves an Imitation from Observation problem (IfO) to ground the source environment to the target task before learning an optimal policy in the grounded environment. The learned policy is deployed in the target task without additional training. A particular feature of our algorithm is the employment of multiple rollout policies during training with a goal to ground the environment more globally; hence, it is named as Multi-Policy Grounding (MPG). The quality of final policy is further enhanced via ensemble policy learning. We demonstrate the superiority of the proposed algorithm analytically and numerically. Numerical studies show that the proposed multi-policy approach allows comparable grounding with single policy approach with a fraction of target samples, hence the algorithm is able to maintain the quality of obtained policy even as the number of interactions with the target environment becomes extremely small.
Cite
Text
Lee et al. "Multi-Policy Grounding and Ensemble Policy Learning for Transfer Learning with Dynamics Mismatch." International Joint Conference on Artificial Intelligence, 2022. doi:10.24963/IJCAI.2022/440Markdown
[Lee et al. "Multi-Policy Grounding and Ensemble Policy Learning for Transfer Learning with Dynamics Mismatch." International Joint Conference on Artificial Intelligence, 2022.](https://mlanthology.org/ijcai/2022/lee2022ijcai-multi/) doi:10.24963/IJCAI.2022/440BibTeX
@inproceedings{lee2022ijcai-multi,
title = {{Multi-Policy Grounding and Ensemble Policy Learning for Transfer Learning with Dynamics Mismatch}},
author = {Lee, Hyun-Rok and Sreenivasan, Ram Ananth and Jeong, Yeonjeong and Jang, Jongseong and Shim, Dongsub and Lee, Chi-Guhn},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2022},
pages = {3171-3177},
doi = {10.24963/IJCAI.2022/440},
url = {https://mlanthology.org/ijcai/2022/lee2022ijcai-multi/}
}