Benchmarking Fine-Tuned RNA Language Models for Intronic Branch Point Prediction
Abstract
Accurate prediction of RNA branch points is critical for understanding splicing mechanisms and identifying variants that may lead to genetic diseases. Despite their biological importance, few computational methods have been developed for reliably identifying branch points. In this work, we fine-tune several RNA language models for branch point prediction. The top-performing model, ERNIE-RNA, achieved an $F_1$ score of 0.811, a sequence accuracy of 0.790, and an average precision score of 0.868, outperforming previous leading models. These results showcase the potential of RNA-specific language models in capturing the subtle sequence features relevant to splicing. Our findings suggest that extended training and hyperparameter tuning could yield additional performance gains, positioning this study as a strong baseline for future research in RNA splicing.
Cite
Text
Ruiz et al. "Benchmarking Fine-Tuned RNA Language Models for Intronic Branch Point Prediction." ICLR 2025 Workshops: AI4NA, 2025.Markdown
[Ruiz et al. "Benchmarking Fine-Tuned RNA Language Models for Intronic Branch Point Prediction." ICLR 2025 Workshops: AI4NA, 2025.](https://mlanthology.org/iclrw/2025/ruiz2025iclrw-benchmarking/)BibTeX
@inproceedings{ruiz2025iclrw-benchmarking,
title = {{Benchmarking Fine-Tuned RNA Language Models for Intronic Branch Point Prediction}},
author = {Ruiz, Pablo Rodenas and Saadat, Ali and Tran, Timothy T. and Smedt, Oliver Müller and Zhang, Peng and Fellay, Jacques},
booktitle = {ICLR 2025 Workshops: AI4NA},
year = {2025},
url = {https://mlanthology.org/iclrw/2025/ruiz2025iclrw-benchmarking/}
}