Deep Fitness Inference for Drug Discovery with Directed Evolution

Abstract

Directed evolution, with iterated mutation and human-designed selection, is a powerful approach for drug discovery. Here, we establish a fitness inference problem given time series DNA sequencing data. We describe maximum likelihood solutions for the nonlinear dynamical system induced by fitness-based competition. Our approach learns from multiple time series rounds in a principled manner, in contrast to prior work focused on two-round enrichment prediction. While fitness inference does not require deep learning in principle, we show that inferring fitness while jointly learning a sequence-to-fitness transformer (DeepFitness) improves performance over a non-deep baseline, and a two-round enrichment baseline. Finally, we highlight how DeepFitness can improve the diversity of the discovered hits in a directed evolution experiment. (Non-archival paper removed at authors' request)

NeurIPSW OpenReview Semantic Scholar

Cite

Text

Diamant et al. "Deep Fitness Inference for Drug Discovery with Directed Evolution." NeurIPS 2022 Workshops: LMRL, 2022.

Markdown

[Diamant et al. "Deep Fitness Inference for Drug Discovery with Directed Evolution." NeurIPS 2022 Workshops: LMRL, 2022.](https://mlanthology.org/neuripsw/2022/diamant2022neuripsw-deep/)

BibTeX

@inproceedings{diamant2022neuripsw-deep,
  title     = {{Deep Fitness Inference for Drug Discovery with Directed Evolution}},
  author    = {Diamant, Nathaniel Lee and Lu, Ziqing and Helmling, Christina and Chuang, Kangway V and Cunningham, Christian and Biancalani, Tommaso and Scalia, Gabriele and Shen, Max W},
  booktitle = {NeurIPS 2022 Workshops: LMRL},
  year      = {2022},
  url       = {https://mlanthology.org/neuripsw/2022/diamant2022neuripsw-deep/}
}