Machine Learning Derived Embeddings of Bulk Multi-Omics Data Enable Clinically Significant Representations in a Pan-Cancer Cohort
Abstract
Bulk multiomics data provides a comprehensive view of tissue biology, but datasets rarely contain matched transcriptomics and chromatin accessibility data for a given sample. Furthermore, it is difficult to identify relevant genetic signatures from the high-dimensional, sparse representations provided by omics modalities. Machine learning (ML) models have the ability to extract dense, information-rich, denoised representations from omics data, which facilitate finding novel genetic signatures. To this end, we develop and compare generative ML models through an evaluation framework that examines the biological and clinical relevance of the underlying latent embeddings produced. We focus our analysis on pan-cancer multiomics data from a set of 21 diverse cancer metacohorts across three datasets. We additionally investigate if our framework can generate robust representations from oncology imaging modalities (i.e. histopathology slides). Our best performing models learn clinical and biological signals and show improved performance over traditional baselines in our evaluations, including overall survival prediction.
Cite
Text
Nagaraj et al. "Machine Learning Derived Embeddings of Bulk Multi-Omics Data Enable Clinically Significant Representations in a Pan-Cancer Cohort." NeurIPS 2023 Workshops: GenBio, 2023.Markdown
[Nagaraj et al. "Machine Learning Derived Embeddings of Bulk Multi-Omics Data Enable Clinically Significant Representations in a Pan-Cancer Cohort." NeurIPS 2023 Workshops: GenBio, 2023.](https://mlanthology.org/neuripsw/2023/nagaraj2023neuripsw-machine/)BibTeX
@inproceedings{nagaraj2023neuripsw-machine,
title = {{Machine Learning Derived Embeddings of Bulk Multi-Omics Data Enable Clinically Significant Representations in a Pan-Cancer Cohort}},
author = {Nagaraj, Sanjay and McCaw, Zachary R and Karaletsos, Theofanis and Koller, Daphne and Shcherbina, Anna},
booktitle = {NeurIPS 2023 Workshops: GenBio},
year = {2023},
url = {https://mlanthology.org/neuripsw/2023/nagaraj2023neuripsw-machine/}
}