Multimodal Cell-Free DNA Embeddings Are Informative for Early Cancer Detection

Abstract

Cell-free DNA is a promising biomarker for early cancer detection, as it circulates in the blood and can be extracted non-invasively. However, methods of analysing the genetic and epigenetic patterns present in cell-free DNA are outdated, and fail to fully capture the wealth of biological information contained within these molecules. We present a Transformer based deep learning model that combines the three distinct modalities contained within cell-free DNA: epigenetic information in the form of DNA methylation patterns, genetic sequence, and cell-free DNA fragment length. After training on publicly available data, we demonstrate our model can accurately distinguish liver cancer patients using cell-free DNA samples alone. We demonstrate model generalisability by accurate classification of liver cancer patients from entirely distinct patient cohorts. Finally, we show that the vector embeddings of cell-free DNA learnt by this multimodal deep-learning model are biologically informative, and may help shed light on the origins and aetiology of this elusive bio-molecule.

Cite

Text

Jackson. "Multimodal Cell-Free DNA Embeddings Are Informative for Early Cancer Detection." NeurIPS 2022 Workshops: LMRL, 2022.

Markdown

[Jackson. "Multimodal Cell-Free DNA Embeddings Are Informative for Early Cancer Detection." NeurIPS 2022 Workshops: LMRL, 2022.](https://mlanthology.org/neuripsw/2022/jackson2022neuripsw-multimodal/)

BibTeX

@inproceedings{jackson2022neuripsw-multimodal,
  title     = {{Multimodal Cell-Free DNA Embeddings Are Informative for Early Cancer Detection}},
  author    = {Jackson, Felix},
  booktitle = {NeurIPS 2022 Workshops: LMRL},
  year      = {2022},
  url       = {https://mlanthology.org/neuripsw/2022/jackson2022neuripsw-multimodal/}
}