Large Batch Optimization for Deep Learning: Training BERT in 76 Minutes

Cite

Text

You et al. "Large Batch Optimization for Deep Learning: Training BERT in 76 Minutes." International Conference on Learning Representations, 2020.

Markdown

[You et al. "Large Batch Optimization for Deep Learning: Training BERT in 76 Minutes." International Conference on Learning Representations, 2020.](https://mlanthology.org/iclr/2020/you2020iclr-large/)

BibTeX

@inproceedings{you2020iclr-large,
  title     = {{Large Batch Optimization for Deep Learning: Training BERT in 76 Minutes}},
  author    = {You, Yang and Li, Jing and Reddi, Sashank and Hseu, Jonathan and Kumar, Sanjiv and Bhojanapalli, Srinadh and Song, Xiaodan and Demmel, James and Keutzer, Kurt and Hsieh, Cho-Jui},
  booktitle = {International Conference on Learning Representations},
  year      = {2020},
  url       = {https://mlanthology.org/iclr/2020/you2020iclr-large/}
}