Longitudinal Variational Autoencoder for Compositional Data Analysis

Abstract

The analysis of compositional longitudinal data, particularly in microbiome time-series, is a challenging task due to its high-dimensional, sparse, and compositional nature. In this paper, we introduce a novel Gaussian process (GP) prior variational autoencoder for longitudinal data analysis with a multinomial likelihood (MNLVAE) that is specifically designed for compositional time-series analysis. Our generative deep learning model captures complex interactions among microbial taxa while accounting for the compositional structure of the data. We utilize centered log-ratio (CLR) and isometric log-ratio (ILR) transformations to preprocess and transform compositional count data, and utilize a latent multi-output additive GP model to enable prediction of future observations. Our experiments demonstrate that MNLVAE outperforms competing method, offering improved prediction performance across different longitudinal microbiome datasets.

Cite

Text

Öğretir et al. "Longitudinal Variational Autoencoder for Compositional Data Analysis." ICML 2023 Workshops: IMLH, 2023.

Markdown

[Öğretir et al. "Longitudinal Variational Autoencoder for Compositional Data Analysis." ICML 2023 Workshops: IMLH, 2023.](https://mlanthology.org/icmlw/2023/ogretir2023icmlw-longitudinal/)

BibTeX

@inproceedings{ogretir2023icmlw-longitudinal,
  title     = {{Longitudinal Variational Autoencoder for Compositional Data Analysis}},
  author    = {Öğretir, Mine and Lähdesmäki, Harri and Norton, Jamie},
  booktitle = {ICML 2023 Workshops: IMLH},
  year      = {2023},
  url       = {https://mlanthology.org/icmlw/2023/ogretir2023icmlw-longitudinal/}
}