Understanding Protein-DNA Interactions by Paying Attention to Protein and Genomics Foundation Models
Abstract
Protein-nucleic acid (NA) interactions are key in controlling gene regulation. There lies a strong motivation in understanding these interactions, with a goal of engineering these interactions to solve biological problems. Current methods to quantify protein-nucleic acids are mainly experimental and require much time and money. To mitigate this, Deep learning methods have recently been applied to predict Protein-DNA contacts. Although promising, these methods are computationally expensive and face challenges in accuracy. To address these challenges, we propose Seq2Contact, a novel method to predict the protein-NA binding at a single nucleotide (DNA) and single amino acid (Protein) level. Seq2Contact is built on protein and DNA foundation models to obtain nucleotide and amino acid-specific embeddings and then introduces a cross-attention module to obtain the binding contact maps. We employ a sequence-similarity-based clustering method to split the train-test data and empirically illustrate that Seq2Contact can achieve state-of-the-art performance, beating existing baselines by almost 20% (F1-Score) for Protein-NA binding prediction. Our method is computationally more efficient, with up to 80% less memory cost and more than 90% less inference time. Code is available at https://github.com/DhruvaRajwade/Seq2Contact
Cite
Text
Rajwade et al. "Understanding Protein-DNA Interactions by Paying Attention to Protein and Genomics Foundation Models." NeurIPS 2024 Workshops: AIDrugX, 2024.Markdown
[Rajwade et al. "Understanding Protein-DNA Interactions by Paying Attention to Protein and Genomics Foundation Models." NeurIPS 2024 Workshops: AIDrugX, 2024.](https://mlanthology.org/neuripsw/2024/rajwade2024neuripsw-understanding/)BibTeX
@inproceedings{rajwade2024neuripsw-understanding,
title = {{Understanding Protein-DNA Interactions by Paying Attention to Protein and Genomics Foundation Models}},
author = {Rajwade, Dhruva and Wang, Erica and Satpathy, Aryan and Brace, Alexander and Guo, Hongyu and Ramanathan, Arvind and Liu, Shengchao and Anandkumar, Anima},
booktitle = {NeurIPS 2024 Workshops: AIDrugX},
year = {2024},
url = {https://mlanthology.org/neuripsw/2024/rajwade2024neuripsw-understanding/}
}