Learning Dynamics in Linear VAE: Posterior Collapse Threshold, Superfluous Latent Space Pitfalls, and Speedup with KL Annealing

Ichikawa, Yuma; Hukushima, Koji

Learning Dynamics in Linear VAE: Posterior Collapse Threshold, Superfluous Latent Space Pitfalls, and Speedup with KL Annealing

AISTATS 2024 pp. 1936-1944

/aistats/2024/ichikawa2024aistats-learning/

Abstract

Variational autoencoders (VAEs) face a notorious problem wherein the variational posterior often aligns closely with the prior, a phenomenon known as posterior collapse, which hinders the quality of representation learning. To mitigate this problem, an adjustable hyperparameter $\beta$ and a strategy for annealing this parameter, called KL annealing, are proposed. This study presents a theoretical analysis of the learning dynamics in a minimal VAE. It is rigorously proved that the dynamics converge to a deterministic process within the limit of large input dimensions, thereby enabling a detailed dynamical analysis of the generalization error. Furthermore, the analysis shows that the VAE initially learns entangled representations and gradually acquires disentangled representations. A fixed-point analysis of the deterministic process reveals that when $\beta$ exceeds a certain threshold, posterior collapse becomes inevitable regardless of the learning period. Additionally, the superfluous latent variables for the data-generative factors lead to overfitting of the background noise; this adversely affects both generalization and learning convergence. The analysis further unveiled that appropriately tuned KL annealing can accelerate convergence.

PDF AISTATS Semantic Scholar

Cite

Text

Ichikawa and Hukushima. "Learning Dynamics in Linear VAE: Posterior Collapse Threshold, Superfluous Latent Space Pitfalls, and Speedup with KL Annealing." Artificial Intelligence and Statistics, 2024.

Markdown

[Ichikawa and Hukushima. "Learning Dynamics in Linear VAE: Posterior Collapse Threshold, Superfluous Latent Space Pitfalls, and Speedup with KL Annealing." Artificial Intelligence and Statistics, 2024.](https://mlanthology.org/aistats/2024/ichikawa2024aistats-learning/)

BibTeX

@inproceedings{ichikawa2024aistats-learning,
  title     = {{Learning Dynamics in Linear VAE: Posterior Collapse Threshold, Superfluous Latent Space Pitfalls, and Speedup with KL Annealing}},
  author    = {Ichikawa, Yuma and Hukushima, Koji},
  booktitle = {Artificial Intelligence and Statistics},
  year      = {2024},
  pages     = {1936-1944},
  volume    = {238},
  url       = {https://mlanthology.org/aistats/2024/ichikawa2024aistats-learning/}
}