Structured Identity Mapping Learning as a Model for Compositional Generalization in Generative Models

Yongyi Yang, Core Francisco Park, Ekdeep Singh Lubana, Maya Okawa, Wei Hu, Hidenori Tanaka

NeurIPSW 2024

/neuripsw/2024/yang2024neuripsw-structured/

Abstract

Multi-modal generative models demonstrate complex concept learning dynamics, initially learning individual concepts and later recombining them in novel ways not present in the training data. Despite the practical importance of understanding the causal mechanisms underlying these learning dynamics, our theoretical understanding remains limited. In this work, we aim to bridge this gap by systematically analyzing the learning dynamics of a simplified model: a one-hidden-layer network learning the identity map, with a training set composed of Gaussian point clouds non-uniformly distributed in space. We argue that a simple yet describe model of multi-modal generative model is the task of learning identity mapping.

PDF NeurIPSW OpenReview Semantic Scholar

Cite

Text

Yang et al. "Structured Identity Mapping Learning as a Model for Compositional Generalization in Generative Models." NeurIPS 2024 Workshops: SciForDL, 2024.

Markdown

[Yang et al. "Structured Identity Mapping Learning as a Model for Compositional Generalization in Generative Models." NeurIPS 2024 Workshops: SciForDL, 2024.](https://mlanthology.org/neuripsw/2024/yang2024neuripsw-structured/)

BibTeX

@inproceedings{yang2024neuripsw-structured,
  title     = {{Structured Identity Mapping Learning as a Model for Compositional Generalization in Generative Models}},
  author    = {Yang, Yongyi and Park, Core Francisco and Lubana, Ekdeep Singh and Okawa, Maya and Hu, Wei and Tanaka, Hidenori},
  booktitle = {NeurIPS 2024 Workshops: SciForDL},
  year      = {2024},
  url       = {https://mlanthology.org/neuripsw/2024/yang2024neuripsw-structured/}
}