Hierarchical Contrastive Learning for Enzyme Function Prediction
Abstract
Enzymes are biological catalysts with numerous industrial applications, and they are categorized by the Enzyme Commission (EC) number system based on their catalytic activities. With over 200 million protein sequences identified, experimental characterization of enzymes is impractical, necessitating computational methods. Current approaches face challenges with class imbalance and intrinsic hierarchy of the EC number system. This study employs hierarchical contrastive learning for EC number prediction, effectively integrating the EC number hierarchy into the model. Our approach addresses severe class imbalance and improves prediction performance, particularly for higher hierarchical levels and previously unseen EC numbers, demonstrating enhanced robustness and outperforming existing methods.
Cite
Text
Yim et al. "Hierarchical Contrastive Learning for Enzyme Function Prediction." ICML 2024 Workshops: ML4LMS, 2024.Markdown
[Yim et al. "Hierarchical Contrastive Learning for Enzyme Function Prediction." ICML 2024 Workshops: ML4LMS, 2024.](https://mlanthology.org/icmlw/2024/yim2024icmlw-hierarchical/)BibTeX
@inproceedings{yim2024icmlw-hierarchical,
title = {{Hierarchical Contrastive Learning for Enzyme Function Prediction}},
author = {Yim, Soorin and Hwang, Doyeong and Kim, Kiyoung and Han, Sehui},
booktitle = {ICML 2024 Workshops: ML4LMS},
year = {2024},
url = {https://mlanthology.org/icmlw/2024/yim2024icmlw-hierarchical/}
}