Koneputugodage, Chamin P Hewa

3 publications

ICLR 2026 Taming Curvature: Architecture Warm-up for Stable Transformer Training Sameera Ramasinghe, Thalaiyasingam Ajanthan, Hadi Mohaghegh Dolatabadi, Chamin P Hewa Koneputugodage, Gil Avraham, Violetta Shevchenko, Yan Zuo, Karol Pajak, Alexander Long
NeurIPS 2025 Mixtures of Subspaces for Bandwidth Efficient Context Parallel Training Sameera Ramasinghe, Thalaiyasingam Ajanthan, Hadi Mohaghegh Dolatabadi, Gil Avraham, Violetta Shevchenko, Yan Zuo, Chamin P Hewa Koneputugodage, Alexander Long
NeurIPS 2025 Unextractable Protocol Models: Collaborative Training and Inference Without Weight Materialization Alexander Long, Chamin P Hewa Koneputugodage, Thalaiyasingam Ajanthan, Yan Zuo, Gil Avraham, Violetta Shevchenko, Hadi Mohaghegh Dolatabadi, Sameera Ramasinghe