RNA-Protein Interaction Classification via Sequence Embeddings

Abstract

RNA-protein interactions (RPI) are ubiquitous in cellular organisms and essential for gene regulation. In particular, protein interactions with non-coding RNAs (ncRNAs) play a critical role in these processes. Experimental analysis of RPIs is time-consuming and expensive, and existing computational methods rely on small and limited datasets. This work introduces $\textit{RNAInterAct}$, a comprehensive RPI dataset, alongside $\textit{RPIembeddor}$, a novel transformer-based model designed for classifying ncRNA-protein interactions. By leveraging two foundation models for sequence embedding, we incorporate essential structural and functional insights into our task. We demonstrate RPIembeddor's strong performance and generalization capability compared to state-of-the-art methods across different datasets and analyze the impact of the proposed embedding strategy on the performance in an ablation study.

Cite

Text

Matus et al. "RNA-Protein Interaction Classification via Sequence Embeddings." ICLR 2024 Workshops: GEM, 2024.

Markdown

[Matus et al. "RNA-Protein Interaction Classification via Sequence Embeddings." ICLR 2024 Workshops: GEM, 2024.](https://mlanthology.org/iclrw/2024/matus2024iclrw-rnaprotein/)

BibTeX

@inproceedings{matus2024iclrw-rnaprotein,
  title     = {{RNA-Protein Interaction Classification via Sequence Embeddings}},
  author    = {Matus, Dominika and Runge, Frederic and Franke, Jörg K.H. and Gerne, Lars and Uhl, Michael and Hutter, Frank and Backofen, Rolf},
  booktitle = {ICLR 2024 Workshops: GEM},
  year      = {2024},
  url       = {https://mlanthology.org/iclrw/2024/matus2024iclrw-rnaprotein/}
}