RLSS: A Deep Reinforcement Learning Algorithm for Sequential Scene Generation

Abstract

We present RLSS: a reinforcement learning algorithm for sequential scene generation. This is based on employing the proximal policy optimization (PPO) algorithm for generative problems. In particular, we consider how to effectively reduce the action space by including a greedy search algorithm in the learning process. Our experiments demonstrate that our method converges for a relatively large number of actions and learns to generate scenes with predefined design objectives. This approach is placing objects iteratively in the virtual scene. In each step, the network chooses which objects to place and selects positions which result in maximal reward. A high reward is assigned if the last action resulted in desired properties whereas the violation of constraints is penalized. We demonstrate the capability of our method to generate plausible and diverse scenes efficiently by solving indoor planning problems and generating Angry Birds levels.

Cite

Text

Ostonov et al. "RLSS: A Deep Reinforcement Learning Algorithm for Sequential Scene Generation." Winter Conference on Applications of Computer Vision, 2022.

Markdown

[Ostonov et al. "RLSS: A Deep Reinforcement Learning Algorithm for Sequential Scene Generation." Winter Conference on Applications of Computer Vision, 2022.](https://mlanthology.org/wacv/2022/ostonov2022wacv-rlss/)

BibTeX

@inproceedings{ostonov2022wacv-rlss,
  title     = {{RLSS: A Deep Reinforcement Learning Algorithm for Sequential Scene Generation}},
  author    = {Ostonov, Azimkhon and Wonka, Peter and Michels, Dominik L.},
  booktitle = {Winter Conference on Applications of Computer Vision},
  year      = {2022},
  pages     = {2219-2228},
  url       = {https://mlanthology.org/wacv/2022/ostonov2022wacv-rlss/}
}