Camp: Combinatorial Engineering of Proteins
Abstract
Protein recombination has long been a key method in protein engineering to diver- sify and optimize sequences. We enhance and evolve this approach by using a pro- tein language model, where we found that when attention in the language model is represented as a spline, abrupt transitions in the spline identify optimal crossover sites for recombination. As we show, these sites also correlate with transitions between various secondary structure elements in the corresponding protein struc- ture. We use these sites to guide recombination of sequence blocks from diverse sources using MCMC sampling. Language models also enable generation of novel recombinant blocks beyond traditional MSAs, increasing diversity, while a direct preference optimization algorithm is used to fine-tune these blocks for reduced immunogenicity. This method integrates modern deep learning architectures with traditional protein engineering techniques to improve success rate of the libraries designed for wetlab verification.
Cite
Text
Ponnapati et al. "Camp: Combinatorial Engineering of Proteins." ICLR 2025 Workshops: GEM, 2025.Markdown
[Ponnapati et al. "Camp: Combinatorial Engineering of Proteins." ICLR 2025 Workshops: GEM, 2025.](https://mlanthology.org/iclrw/2025/ponnapati2025iclrw-camp/)BibTeX
@inproceedings{ponnapati2025iclrw-camp,
title = {{Camp: Combinatorial Engineering of Proteins}},
author = {Ponnapati, Manvitha and Sinha, Sapna and Lynch, Brian and Boyden, Edward and Jacobson, Joseph},
booktitle = {ICLR 2025 Workshops: GEM},
year = {2025},
url = {https://mlanthology.org/iclrw/2025/ponnapati2025iclrw-camp/}
}