Interpretable Enzyme Function Prediction via Residue-Level Detection

Abstract

Predicting multiple functions labeled with Enzyme Commission (EC) numbers from the enzyme sequence is of great significance but remains a challenge due to its sparse multi-label classification nature, i.e., each enzyme is typically associated with only a few labels out of more than 6000 possible EC numbers. However, existing machine learning algorithms generally learn a fixed global representation for each enzyme to classify all functions, thereby they lack interpretability and the fine-grained information of some function-specific local residue fragments may be overwhelmed. Here we present an attention-based framework, namely ProtDETR (Protein Detection Transformer), by casting enzyme function prediction as a detection problem. It uses a set of learnable functional queries to adaptatively extract different local representations from the sequence of residue-level features for predicting different EC numbers. ProtDETR not only significantly outperforms existing deep learning-based enzyme function prediction methods, but also provides a new interpretable perspective on automatically detecting different local regions for identifying different functions through cross-attentions between queries and residue-level features.

Cite

Text

Yang et al. "Interpretable Enzyme Function Prediction via Residue-Level Detection." ICLR 2025 Workshops: LMRL, 2025.

Markdown

[Yang et al. "Interpretable Enzyme Function Prediction via Residue-Level Detection." ICLR 2025 Workshops: LMRL, 2025.](https://mlanthology.org/iclrw/2025/yang2025iclrw-interpretable/)

BibTeX

@inproceedings{yang2025iclrw-interpretable,
  title     = {{Interpretable Enzyme Function Prediction via Residue-Level Detection}},
  author    = {Yang, Zhao and Su, Bing and Chen, Jiahao and Wen, Ji-Rong},
  booktitle = {ICLR 2025 Workshops: LMRL},
  year      = {2025},
  url       = {https://mlanthology.org/iclrw/2025/yang2025iclrw-interpretable/}
}