Learning to Describe Scenes with Programs

Liu, Yunchao; Wu, Zheng; Ritchie, Daniel; Freeman, William T.; Tenenbaum, Joshua B.; Wu, Jiajun

Learning to Describe Scenes with Programs

Yunchao Liu, Zheng Wu, Daniel Ritchie, William T. Freeman, Joshua B. Tenenbaum, Jiajun Wu

ICLR 2019

/iclr/2019/liu2019iclr-learning-a/

Abstract

Human scene perception goes beyond recognizing a collection of objects and their pairwise relations. We understand higher-level, abstract regularities within the scene such as symmetry and repetition. Current vision recognition modules and scene representations fall short in this dimension. In this paper, we present scene programs, representing a scene via a symbolic program for its objects, attributes, and their relations. We also propose a model that infers such scene programs by exploiting a hierarchical, object-based scene representation. Experiments demonstrate that our model works well on synthetic data and transfers to real images with such compositional structure. The use of scene programs has enabled a number of applications, such as complex visual analogy-making and scene extrapolation.

PDF ICLR Semantic Scholar

Cite

Text

Liu et al. "Learning to Describe Scenes with Programs." International Conference on Learning Representations, 2019.

Markdown

[Liu et al. "Learning to Describe Scenes with Programs." International Conference on Learning Representations, 2019.](https://mlanthology.org/iclr/2019/liu2019iclr-learning-a/)

BibTeX

@inproceedings{liu2019iclr-learning-a,
  title     = {{Learning to Describe Scenes with Programs}},
  author    = {Liu, Yunchao and Wu, Zheng and Ritchie, Daniel and Freeman, William T. and Tenenbaum, Joshua B. and Wu, Jiajun},
  booktitle = {International Conference on Learning Representations},
  year      = {2019},
  url       = {https://mlanthology.org/iclr/2019/liu2019iclr-learning-a/}
}