LatentDE: Latent-Based Directed Evolution Accelerated by Gradient Ascent for Protein Sequence Design

Abstract

Directed evolution has been the most effective method for protein engineering that optimizes biological functionalities through a resource-intensive process of screening or selecting among a vast range of mutations. To mitigate this extensive procedure, recent advancements in machine learning-guided methodologies center around the establishment of a surrogate sequence-function model. In this paper, we propose Latent-based Directed Evolution (LDE), an evolutionary algorithm designed to prioritize the exploration of high-fitness mutants in the latent space. At its core, LDE is a regularized variational autoencoder (VAE), harnessing the capabilities of the state-of-the-art Protein Language Model (pLM), ESM-2, to construct a meaningful latent space of sequences. From this encoded representation, we present a novel approach for efficient traversal on the fitness landscape, employing a combination of gradient-based methods and directed evolution. Experimental evaluations conducted on eight protein sequence design tasks demonstrate the superior performance of our proposed LDE over previous baseline algorithms. We public our code at https://github.com/HySonLab/LatentDE.

Cite

Text

Tran et al. "LatentDE: Latent-Based Directed Evolution Accelerated by Gradient Ascent for Protein Sequence Design." NeurIPS 2024 Workshops: AIDrugX, 2024.

Markdown

[Tran et al. "LatentDE: Latent-Based Directed Evolution Accelerated by Gradient Ascent for Protein Sequence Design." NeurIPS 2024 Workshops: AIDrugX, 2024.](https://mlanthology.org/neuripsw/2024/tran2024neuripsw-latentde-a/)

BibTeX

@inproceedings{tran2024neuripsw-latentde-a,
  title     = {{LatentDE: Latent-Based Directed Evolution Accelerated by Gradient Ascent for Protein Sequence Design}},
  author    = {Tran, Thanh V. T. and Ngo, Nhat Khang and Nguyen, Viet Thanh Duy and Hy, Truong Son},
  booktitle = {NeurIPS 2024 Workshops: AIDrugX},
  year      = {2024},
  url       = {https://mlanthology.org/neuripsw/2024/tran2024neuripsw-latentde-a/}
}