ML Anthology
Authors
Search
About
Ghorbani, Behrooz
14 publications
NeurIPSW
2023
Adaptive Gradient Methods at the Edge of Stability
Jeremy Cohen
,
Behrooz Ghorbani
,
Shankar Krishnan
,
Naman Agarwal
,
Sourabh Medapati
,
Michal Badura
,
Daniel Suo
,
Zachary Nado
,
George E. Dahl
,
Justin Gilmer
NeurIPS
2023
Binarized Neural Machine Translation
Yichi Zhang
,
Ankush Garg
,
Yuan Cao
,
Lukasz Lew
,
Behrooz Ghorbani
,
Zhiru Zhang
,
Orhan Firat
NeurIPS
2023
Order Matters in the Presence of Dataset Imbalance for Multilingual Learning
Dami Choi
,
Derrick Xin
,
Hamid Dadkhahi
,
Justin Gilmer
,
Ankush Garg
,
Orhan Firat
,
Chih-Kuan Yeh
,
Andrew M Dai
,
Behrooz Ghorbani
ICML
2023
Scaling Laws for Multilingual Neural Machine Translation
Patrick Fernandes
,
Behrooz Ghorbani
,
Xavier Garcia
,
Markus Freitag
,
Orhan Firat
ICLR
2022
A Loss Curvature Perspective on Training Instabilities of Deep Learning Models
Justin Gilmer
,
Behrooz Ghorbani
,
Ankush Garg
,
Sneha Kudugunta
,
Behnam Neyshabur
,
David Cardoze
,
George Edward Dahl
,
Zachary Nado
,
Orhan Firat
ICML
2022
Data Scaling Laws in NMT: The Effect of Noise and Architecture
Yamini Bansal
,
Behrooz Ghorbani
,
Ankush Garg
,
Biao Zhang
,
Colin Cherry
,
Behnam Neyshabur
,
Orhan Firat
NeurIPS
2022
Do Current Multi-Task Optimization Methods in Deep Learning Even Help?
Derrick Xin
,
Behrooz Ghorbani
,
Justin Gilmer
,
Ankush Garg
,
Orhan Firat
ICML
2022
Examining Scaling and Transfer of Language Model Architectures for Machine Translation
Biao Zhang
,
Behrooz Ghorbani
,
Ankur Bapna
,
Yong Cheng
,
Xavier Garcia
,
Jonathan Shen
,
Orhan Firat
ICLR
2022
Scaling Laws for Neural Machine Translation
Behrooz Ghorbani
,
Orhan Firat
,
Markus Freitag
,
Ankur Bapna
,
Maxim Krikun
,
Xavier Garcia
,
Ciprian Chelba
,
Colin Cherry
NeurIPS
2020
When Do Neural Networks Outperform Kernel Methods?
Behrooz Ghorbani
,
Song Mei
,
Theodor Misiakiewicz
,
Andrea Montanari
ICML
2019
An Instability in Variational Inference for Topic Models
Behrooz Ghorbani
,
Hamid Javadi
,
Andrea Montanari
ICML
2019
An Investigation into Neural Net Optimization via Hessian Eigenvalue Density
Behrooz Ghorbani
,
Shankar Krishnan
,
Ying Xiao
NeurIPS
2019
Limitations of Lazy Training of Two-Layers Neural Network
Behrooz Ghorbani
,
Song Mei
,
Theodor Misiakiewicz
,
Andrea Montanari
ICMLW
2019
The Effect of Network Depth on the Optimization Landscape
Behrooz Ghorbani
,
Ying Xiao
,
Shankar Krishnan