Photo-Realistic Neural Domain Randomization

Abstract

Synthetic data is a scalable alternative to manual supervision, but it requires overcoming the sim-to-real domain gap. This discrepancy between virtual and real worlds is addressed by two seemingly opposed approaches: improving the realism of simulation or foregoing realism entirely via domain randomization. In this paper, we show that the recent progress in neural rendering enables a new unified approach we call Photo-realistic Neural Domain Randomization (PNDR). We propose to learn a composition of neural networks that acts as a physics-based ray tracer generating high-quality renderings from scene geometry alone. Our approach is modular, composed of different neural networks for materials, lighting, and rendering, thus enabling randomization of different key image generation components in a differentiable pipeline. Once trained, our method can be combined with other methods and used to generate photo-realistic image augmentations online and significantly more efficiently than via traditional ray-tracing. We demonstrate the usefulness of PNDR through two downstream tasks: 6D object detection and monocular depth estimation. Our experiments show that training with PNDR enables generalization to novel scenes and significantly outperforms the state of the art in terms of real-world transfer.

Cite

Text

Zakharov et al. "Photo-Realistic Neural Domain Randomization." Proceedings of the European Conference on Computer Vision (ECCV), 2022. doi:10.1007/978-3-031-19806-9_18

Markdown

[Zakharov et al. "Photo-Realistic Neural Domain Randomization." Proceedings of the European Conference on Computer Vision (ECCV), 2022.](https://mlanthology.org/eccv/2022/zakharov2022eccv-photorealistic/) doi:10.1007/978-3-031-19806-9_18

BibTeX

@inproceedings{zakharov2022eccv-photorealistic,
  title     = {{Photo-Realistic Neural Domain Randomization}},
  author    = {Zakharov, Sergey and Ambruș, Rareș and Guizilini, Vitor and Kehl, Wadim and Gaidon, Adrien},
  booktitle = {Proceedings of the European Conference on Computer Vision (ECCV)},
  year      = {2022},
  doi       = {10.1007/978-3-031-19806-9_18},
  url       = {https://mlanthology.org/eccv/2022/zakharov2022eccv-photorealistic/}
}