Stochastic Actor-Executor-Critic for Image-to-Image Translation

Abstract

Training a model-free deep reinforcement learning model to solve image-to-image translation is difficult since it involves high-dimensional continuous state and action spaces. In this paper, we draw inspiration from the recent success of the maximum entropy reinforcement learning framework designed for challenging continuous control problems to develop stochastic policies over high dimensional continuous spaces including image representation, generation, and control simultaneously. Central to this method is the Stochastic Actor-Executor-Critic (SAEC) which is an off-policy actor-critic model with an additional executor to generate realistic images. Specifically, the actor focuses on the high-level representation and control policy by a stochastic latent action, as well as explicitly directs the executor to generate low-level actions to manipulate the state. Experiments on several image-to-image translation tasks have demonstrated the effectiveness and robustness of the proposed SAEC when facing high-dimensional continuous space problems.

Cite

Text

Luo et al. "Stochastic Actor-Executor-Critic for Image-to-Image Translation." International Joint Conference on Artificial Intelligence, 2021. doi:10.24963/IJCAI.2021/382

Markdown

[Luo et al. "Stochastic Actor-Executor-Critic for Image-to-Image Translation." International Joint Conference on Artificial Intelligence, 2021.](https://mlanthology.org/ijcai/2021/luo2021ijcai-stochastic/) doi:10.24963/IJCAI.2021/382

BibTeX

@inproceedings{luo2021ijcai-stochastic,
  title     = {{Stochastic Actor-Executor-Critic for Image-to-Image Translation}},
  author    = {Luo, Ziwei and Hu, Jing and Wang, Xin and Lyu, Siwei and Kong, Bin and Yin, Youbing and Song, Qi and Wu, Xi},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2021},
  pages     = {2775-2781},
  doi       = {10.24963/IJCAI.2021/382},
  url       = {https://mlanthology.org/ijcai/2021/luo2021ijcai-stochastic/}
}