Chen, Wuyang
29 publications
ICLR
2024
Mixture-of-Experts Meets Instruction Tuning: A Winning Combination for Large Language Models
AutoML
2023
“No Free Lunch” in Neural Architectures? a Joint Analysis of Expressivity, Convergence, and Generalization
ICLR
2022
Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining
WACV
2022
Sandwich Batch Normalization: A Drop-in Replacement for Feature Distribution Heterogeneity
ICLR
2021
Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective