Cramming Protein Language Model Training in 24 GPU Hours

Frey, Nathan C.; Joren, Taylor; Ismail, Aya Abdelsalam; Goodman, Allen; Bonneau, Richard; Cho, Kyunghyun; Gligorijevic, Vladimir

Cramming Protein Language Model Training in 24 GPU Hours

Nathan C. Frey, Taylor Joren, Aya Abdelsalam Ismail, Allen Goodman, Richard Bonneau, Kyunghyun Cho, Vladimir Gligorijevic

ICMLW 2024

/icmlw/2024/frey2024icmlw-cramming/

Abstract

Protein language models (pLMs) are ubiquitous across biological machine learning research, but state-of-the-art models like ESM2 take hundreds of thousands of GPU hours to pre-train on the vast protein universe. Resource requirements for scaling up pLMs prevent fundamental investigations into how optimal modeling choices might differ from those used in natural language. Here, we define a "cramming'' challenge for pLMs and train performant models in 24 hours on a single GPU. By re-examining many aspects of pLM training, we are able to train a 67 million parameter model in a single day that achieves comparable performance on downstream protein fitness landscape inference tasks to ESM-3B, a model trained for over $15,000 \times$ more GPU hours than ours. We open source our library for training and inference, LBSTER: Language models for Biological Sequence Transformation and Evolutionary Representation.

PDF ICMLW OpenReview Semantic Scholar

Cite

Text

Frey et al. "Cramming Protein Language Model Training in 24 GPU Hours." ICML 2024 Workshops: AccMLBio, 2024.

Markdown

[Frey et al. "Cramming Protein Language Model Training in 24 GPU Hours." ICML 2024 Workshops: AccMLBio, 2024.](https://mlanthology.org/icmlw/2024/frey2024icmlw-cramming/)

BibTeX

@inproceedings{frey2024icmlw-cramming,
  title     = {{Cramming Protein Language Model Training in 24 GPU Hours}},
  author    = {Frey, Nathan C. and Joren, Taylor and Ismail, Aya Abdelsalam and Goodman, Allen and Bonneau, Richard and Cho, Kyunghyun and Gligorijevic, Vladimir},
  booktitle = {ICML 2024 Workshops: AccMLBio},
  year      = {2024},
  url       = {https://mlanthology.org/icmlw/2024/frey2024icmlw-cramming/}
}