FactorSim: Generative Simulation via Factorized Representation

Abstract

Generating simulations to train intelligent agents in game-playing and robotics from natural language input, user input, or task documentation remains an open-ended challenge. Existing approaches focus on parts of this challenge, such as generating reward functions or task hyperparameters. Unlike previous work, we introduce FACTORSIM that generates full simulations in code from language input that can be used to train agents. Exploiting the structural modularity specific to coded simulations, we propose to use a factored partially observable Markov decision process representation that allows us to reduce context dependence during each step of the generation. For evaluation, we introduce a generative simulation benchmark that assesses the generated simulation code’s accuracy and effectiveness in facilitating zero-shot transfers in reinforcement learning settings. We show that FACTORSIM outperforms existing methods in generating simulations regarding prompt alignment (i.e., accuracy), zero-shot transfer abilities, and human evaluation. We also demonstrate its effectiveness in generating robotic tasks.

Cite

Text

Sun et al. "FactorSim: Generative Simulation via Factorized Representation." Neural Information Processing Systems, 2024. doi:10.52202/079017-2775

Markdown

[Sun et al. "FactorSim: Generative Simulation via Factorized Representation." Neural Information Processing Systems, 2024.](https://mlanthology.org/neurips/2024/sun2024neurips-factorsim/) doi:10.52202/079017-2775

BibTeX

@inproceedings{sun2024neurips-factorsim,
  title     = {{FactorSim: Generative Simulation via Factorized Representation}},
  author    = {Sun, Fan-Yun and Harini, S. I. and Yi, Angela and Zhou, Yihan and Zook, Alex and Tremblay, Jonathan and Cross, Logan and Wu, Jiajun and Haber, Nick},
  booktitle = {Neural Information Processing Systems},
  year      = {2024},
  doi       = {10.52202/079017-2775},
  url       = {https://mlanthology.org/neurips/2024/sun2024neurips-factorsim/}
}