Awadallah, Ahmed Hassan
17 publications
ICLR
2025
Exploratory Preference Optimization: Harnessing Implicit Q*-Approximation for Sample-Efficient RLHF
NeurIPS
2025
Learning to Specialize: Joint Gating-Expert Training for Adaptive MoEs in Decentralized Settings
AutoML
2023
Cost-Effective Hyperparameter Optimization for Large Language Model Generation Inference