Unscented Autoencoder

Abstract

The Variational Autoencoder (VAE) is a seminal approach in deep generative modeling with latent variables. Interpreting its reconstruction process as a nonlinear transformation of samples from the latent posterior distribution, we apply the Unscented Transform (UT) – a well-known distribution approximation used in the Unscented Kalman Filter (UKF) from the field of filtering. A finite set of statistics called sigma points, sampled deterministically, provides a more informative and lower-variance posterior representation than the ubiquitous noise-scaling of the reparameterization trick, while ensuring higher-quality reconstruction. We further boost the performance by replacing the Kullback-Leibler (KL) divergence with the Wasserstein distribution metric that allows for a sharper posterior. Inspired by the two components, we derive a novel, deterministic-sampling flavor of the VAE, the Unscented Autoencoder (UAE), trained purely with regularization-like terms on the per-sample posterior. We empirically show competitive performance in Fréchet Inception Distance scores over closely-related models, in addition to a lower training variance than the VAE.

Cite

Text

Janjos et al. "Unscented Autoencoder." International Conference on Machine Learning, 2023.

Markdown

[Janjos et al. "Unscented Autoencoder." International Conference on Machine Learning, 2023.](https://mlanthology.org/icml/2023/janjos2023icml-unscented/)

BibTeX

@inproceedings{janjos2023icml-unscented,
  title     = {{Unscented Autoencoder}},
  author    = {Janjos, Faris and Rosenbaum, Lars and Dolgov, Maxim and Zoellner, J. Marius},
  booktitle = {International Conference on Machine Learning},
  year      = {2023},
  pages     = {14758-14779},
  volume    = {202},
  url       = {https://mlanthology.org/icml/2023/janjos2023icml-unscented/}
}