Designing Active and Thermostable Enzymes with Sequence-Only Predictive Models

Abstract

Data-driven models of fitness can be useful in designing novel proteins with desired properties, but many questions remain regarding how and in what settings they should be used. Here, we ask: How can we use predictive models of protein fitness, whose predictions we might not always trust, to design protein sequences enhanced for multiple fitness functions? We propose a general approach for doing so, and apply it to design novel variants of eight different acylphosphatase and lysozyme wild types, intended to be more thermostable and at least as catalytically active as the wild types. Our method does not require a structure, experimental measurements of activity, curation of homologous sequences, or family-specific thermostability data. Experimental characterizations of our designed sequences, as well as sequences designed by PROSS, a competitive baseline method for improving protein thermostability, are currently underway and forthcoming.

Cite

Text

Fannjiang et al. "Designing Active and Thermostable Enzymes with Sequence-Only Predictive Models." NeurIPS 2022 Workshops: LMRL, 2022.

Markdown

[Fannjiang et al. "Designing Active and Thermostable Enzymes with Sequence-Only Predictive Models." NeurIPS 2022 Workshops: LMRL, 2022.](https://mlanthology.org/neuripsw/2022/fannjiang2022neuripsw-designing/)

BibTeX

@inproceedings{fannjiang2022neuripsw-designing,
  title     = {{Designing Active and Thermostable Enzymes with Sequence-Only Predictive Models}},
  author    = {Fannjiang, Clara and Olivas, Micah and Greene, Eric R. and Markin, Craig J. and Wallace, Bram and Krause, Ben and Pinney, Margaux M. and Fraser, James and Fordyce, Polly M and Madani, Ali and Naik, Nikhil},
  booktitle = {NeurIPS 2022 Workshops: LMRL},
  year      = {2022},
  url       = {https://mlanthology.org/neuripsw/2022/fannjiang2022neuripsw-designing/}
}