Hassan, Hany

2 publications

ICML 2022 Gating Dropout: Communication-Efficient Regularization for Sparsely Activated Transformers Rui Liu, Young Jin Kim, Alexandre Muzio, Hany Hassan
ICLR 2022 Taming Sparsely Activated Transformer with Stochastic Experts Simiao Zuo, Xiaodong Liu, Jian Jiao, Young Jin Kim, Hany Hassan, Ruofei Zhang, Jianfeng Gao, Tuo Zhao