Stochastic Weight Averaging in Parallel: Large-Batch Training That Generalizes Well

Cite

Text

Gupta et al. "Stochastic Weight Averaging in Parallel: Large-Batch Training That Generalizes Well." International Conference on Learning Representations, 2020.

Markdown

[Gupta et al. "Stochastic Weight Averaging in Parallel: Large-Batch Training That Generalizes Well." International Conference on Learning Representations, 2020.](https://mlanthology.org/iclr/2020/gupta2020iclr-stochastic/)

BibTeX

@inproceedings{gupta2020iclr-stochastic,
  title     = {{Stochastic Weight Averaging in Parallel: Large-Batch Training That Generalizes Well}},
  author    = {Gupta, Vipul and Serrano, Santiago Akle and DeCoste, Dennis},
  booktitle = {International Conference on Learning Representations},
  year      = {2020},
  url       = {https://mlanthology.org/iclr/2020/gupta2020iclr-stochastic/}
}