Üstün, Ahmet

9 publications

NeurIPS 2025 The Leaderboard Illusion Shivalika Singh, Yiyang Nan, Alex Wang, Daniel D'souza, Sayash Kapoor, Ahmet Üstün, Sanmi Koyejo, Yuntian Deng, Shayne Longpre, Noah A. Smith, Beyza Ermis, Marzieh Fadaee, Sara Hooker

ICLR 2025 To Code or Not to Code? Exploring Impact of Code in Pre-Training Viraat Aryabumi, Yixuan Su, Raymond Ma, Adrien Morisot, Ivan Zhang, Acyr Locatelli, Marzieh Fadaee, Ahmet Üstün, Sara Hooker

NeurIPS 2025 Treasure Hunt: Real-Time Targeting of the Long Tail Using Training-Time Markers Daniel D'souza, Julia Kreutzer, Adrien Morisot, Ahmet Üstün, Sara Hooker

NeurIPS 2024 BAM! Just like That: Simple and Efficient Parameter Upcycling for Mixture of Experts Qizhen Zhang, Nikolas Gritsch, Dwaraknath Gnaneshwar, Simon Guo, David Cairuz, Bharat Venkitesh, Jakob Foerster, Phil Blunsom, Sebastian Ruder, Ahmet Üstün, Acyr Locatelli

ICMLW 2024 BAM! Just like That: Simple and Efficient Parameter Upcycling for Mixture of Experts Qizhen Zhang, Nikolas Gritsch, Dwaraknath Gnaneshwar, Simon Guo, David Cairuz, Bharat Venkitesh, Jakob Nicolaus Foerster, Phil Blunsom, Sebastian Ruder, Ahmet Üstün, Acyr Locatelli

NeurIPSW 2024 Nexus: Specialization Meets Adaptability for Efficiently Training Mixture of Experts Nikolas Gritsch, Qizhen Zhang, Acyr Locatelli, Sara Hooker, Ahmet Üstün

ICLR 2024 Pushing Mixture of Experts to the Limit: Extremely Parameter Efficient MoE for Instruction Tuning Ted Zadouri, Ahmet Üstün, Arash Ahmadian, Beyza Ermis, Acyr Locatelli, Sara Hooker

ICMLW 2024 Seeded LoRA: Collaborative Fine-Tuning Through Seed Initialization of Adapters Alejandro R. Salamanca, Ahmet Üstün, Nicki Skafte Detlefsen, Tim Dettmers

NeurIPS 2023 Intriguing Properties of Quantization at Scale Arash Ahmadian, Saurabh Dash, Hongyu Chen, Bharat Venkitesh, Zhen Stephen Gou, Phil Blunsom, Ahmet Üstün, Sara Hooker