PPI-Llama2: De Novo Generation of Binding Proteins Conditioned on Target Sequence Alone

Abstract

The targeting of disease-driving proteins is a critical goal of biomedicine. However, many of these proteins do not possess accessible small molecule binding pockets and are oftentimes conformationally disordered, precluding binder design via structure-dependent methods. Here, we present PPI-Llama2, which tasks Meta's Llama2 autoregressive language model architecture to de novo generate protein binders conditioned directly on target sequences. Without relying on structural data and training only on protein-protein interaction (PPI) sequences, PPI-Llama2 effectively learns the evolutionary semantics of PPIs, enabling the generation of both novel and biologically-plausible binders. Comparative evaluations highlight PPI-Llama2's performance in generating binders for evolutionarily distant targets, performing strongly against structure-dependent methods like RFDiffusion. In total, our findings showcase PPI-Llama2's potential to aid therapeutic discovery for diseases driven by undruggable and disordered target proteins, and motivate further experimental screening efforts.

Cite

Text

Zhang et al. "PPI-Llama2: De Novo Generation of Binding Proteins Conditioned on Target Sequence Alone." ICLR 2024 Workshops: GEM, 2024.

Markdown

[Zhang et al. "PPI-Llama2: De Novo Generation of Binding Proteins Conditioned on Target Sequence Alone." ICLR 2024 Workshops: GEM, 2024.](https://mlanthology.org/iclrw/2024/zhang2024iclrw-ppillama2/)

BibTeX

@inproceedings{zhang2024iclrw-ppillama2,
  title     = {{PPI-Llama2: De Novo Generation of Binding Proteins Conditioned on Target Sequence Alone}},
  author    = {Zhang, Yinuo and He, Phil and Hsu, Ashley and Vincoff, Sophia and Chatterjee, Pranam},
  booktitle = {ICLR 2024 Workshops: GEM},
  year      = {2024},
  url       = {https://mlanthology.org/iclrw/2024/zhang2024iclrw-ppillama2/}
}