Ghorbani, Behrooz

14 publications

NeurIPSW 2023 Adaptive Gradient Methods at the Edge of Stability Jeremy Cohen, Behrooz Ghorbani, Shankar Krishnan, Naman Agarwal, Sourabh Medapati, Michal Badura, Daniel Suo, Zachary Nado, George E. Dahl, Justin Gilmer
NeurIPS 2023 Binarized Neural Machine Translation Yichi Zhang, Ankush Garg, Yuan Cao, Lukasz Lew, Behrooz Ghorbani, Zhiru Zhang, Orhan Firat
NeurIPS 2023 Order Matters in the Presence of Dataset Imbalance for Multilingual Learning Dami Choi, Derrick Xin, Hamid Dadkhahi, Justin Gilmer, Ankush Garg, Orhan Firat, Chih-Kuan Yeh, Andrew M Dai, Behrooz Ghorbani
ICML 2023 Scaling Laws for Multilingual Neural Machine Translation Patrick Fernandes, Behrooz Ghorbani, Xavier Garcia, Markus Freitag, Orhan Firat
ICLR 2022 A Loss Curvature Perspective on Training Instabilities of Deep Learning Models Justin Gilmer, Behrooz Ghorbani, Ankush Garg, Sneha Kudugunta, Behnam Neyshabur, David Cardoze, George Edward Dahl, Zachary Nado, Orhan Firat
ICML 2022 Data Scaling Laws in NMT: The Effect of Noise and Architecture Yamini Bansal, Behrooz Ghorbani, Ankush Garg, Biao Zhang, Colin Cherry, Behnam Neyshabur, Orhan Firat
NeurIPS 2022 Do Current Multi-Task Optimization Methods in Deep Learning Even Help? Derrick Xin, Behrooz Ghorbani, Justin Gilmer, Ankush Garg, Orhan Firat
ICML 2022 Examining Scaling and Transfer of Language Model Architectures for Machine Translation Biao Zhang, Behrooz Ghorbani, Ankur Bapna, Yong Cheng, Xavier Garcia, Jonathan Shen, Orhan Firat
ICLR 2022 Scaling Laws for Neural Machine Translation Behrooz Ghorbani, Orhan Firat, Markus Freitag, Ankur Bapna, Maxim Krikun, Xavier Garcia, Ciprian Chelba, Colin Cherry
NeurIPS 2020 When Do Neural Networks Outperform Kernel Methods? Behrooz Ghorbani, Song Mei, Theodor Misiakiewicz, Andrea Montanari
ICML 2019 An Instability in Variational Inference for Topic Models Behrooz Ghorbani, Hamid Javadi, Andrea Montanari
ICML 2019 An Investigation into Neural Net Optimization via Hessian Eigenvalue Density Behrooz Ghorbani, Shankar Krishnan, Ying Xiao
NeurIPS 2019 Limitations of Lazy Training of Two-Layers Neural Network Behrooz Ghorbani, Song Mei, Theodor Misiakiewicz, Andrea Montanari
ICMLW 2019 The Effect of Network Depth on the Optimization Landscape Behrooz Ghorbani, Ying Xiao, Shankar Krishnan