Learning Disentangled Representations with the Wasserstein Autoencoder

Abstract

Disentangled representation learning has undoubtedly benefited from objective function surgery. However, a delicate balancing act of tuning is still required in order to trade off reconstruction fidelity versus disentanglement. Building on previous successes of penalizing the total correlation in the latent variables, we propose TCWAE (Total Correlation Wasserstein Autoencoder). Working in the WAE paradigm naturally enables the separation of the total-correlation term, thus providing disentanglement control over the learned representation, while offering more flexibility in the choice of reconstruction cost. We propose two variants using different KL estimators and perform extensive quantitative comparisons on data sets with known generative factors, showing competitive results relative to state-of-the-art techniques. We further study the trade off between disentanglement and reconstruction on more-difficult data sets with unknown generative factors, where the flexibility of the WAE paradigm in the reconstruction term improves reconstructions.

Cite

Text

Gaujac et al. "Learning Disentangled Representations with the Wasserstein Autoencoder." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2021. doi:10.1007/978-3-030-86523-8_5

Markdown

[Gaujac et al. "Learning Disentangled Representations with the Wasserstein Autoencoder." European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2021.](https://mlanthology.org/ecmlpkdd/2021/gaujac2021ecmlpkdd-learning/) doi:10.1007/978-3-030-86523-8_5

BibTeX

@inproceedings{gaujac2021ecmlpkdd-learning,
  title     = {{Learning Disentangled Representations with the Wasserstein Autoencoder}},
  author    = {Gaujac, Benoit and Feige, Ilya and Barber, David},
  booktitle = {European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases},
  year      = {2021},
  pages     = {69-84},
  doi       = {10.1007/978-3-030-86523-8_5},
  url       = {https://mlanthology.org/ecmlpkdd/2021/gaujac2021ecmlpkdd-learning/}
}