Enhancing Generative Perturbation Models with LLM-Informed Gene Embeddings
Abstract
Genetic perturbations are key to understanding how genes regulate cell behavior, yet the ability to predict responses to these perturbations remains a significant challenge. While numerous generative models have been developed for perturbation data, they typically lack the capability to generalize to perturbations not encountered during training. To alleviate this limitation, we introduce a novel methodology that incorporates prior knowledge through embeddings derived from Large Language Models (LLMs), effectively informing our predictive models with a deeper biological context. By leveraging this source of pre-existing information, our models achieve state-of-the-art performance in predicting the outcomes of single-gene perturbations.
Cite
Text
Märtens et al. "Enhancing Generative Perturbation Models with LLM-Informed Gene Embeddings." ICLR 2024 Workshops: MLGenX, 2024.Markdown
[Märtens et al. "Enhancing Generative Perturbation Models with LLM-Informed Gene Embeddings." ICLR 2024 Workshops: MLGenX, 2024.](https://mlanthology.org/iclrw/2024/martens2024iclrw-enhancing/)BibTeX
@inproceedings{martens2024iclrw-enhancing,
title = {{Enhancing Generative Perturbation Models with LLM-Informed Gene Embeddings}},
author = {Märtens, Kaspar and Donovan-Maiye, Rory and Ferkinghoff-Borg, Jesper},
booktitle = {ICLR 2024 Workshops: MLGenX},
year = {2024},
url = {https://mlanthology.org/iclrw/2024/martens2024iclrw-enhancing/}
}