ML Anthology
Authors
Search
About
Sahu, Sambit
7 publications
TMLR
2025
Continual Pre-Training of MoEs: How Robust Is Your Router?
Benjamin Thérien
,
Charles-Étienne Joseph
,
Zain Sarwar
,
Ashwinee Panda
,
Anirban Das
,
Shi-Xiong Zhang
,
Stephen Rawls
,
Sambit Sahu
,
Eugene Belilovsky
,
Irina Rish
NeurIPS
2025
Dense Backpropagation Improves Training for Sparse Mixture-of-Experts
Ashwinee Panda
,
Vatsal Baherwani
,
Zain Sarwar
,
Benjamin Thérien
,
Sambit Sahu
,
Tom Goldstein
,
Supriyo Chakraborty
JAIR
2025
Preference Tuning with Human Feedback on Language, Speech, and Vision Tasks: A Survey
Genta Indra Winata
,
Hanyang Zhao
,
Anirban Das
,
Wenpin Tang
,
David D. Yao
,
Shi-Xiong Zhang
,
Sambit Sahu
ICLR
2025
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
Hanyang Zhao
,
Genta Indra Winata
,
Anirban Das
,
Shi-Xiong Zhang
,
David Yao
,
Wenpin Tang
,
Sambit Sahu
NeurIPS
2025
T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning
Amartya Chakraborty
,
Paresh Dashore
,
Nadia Bathaee
,
Anmol Jain
,
Anirban Das
,
Shi-Xiong Zhang
,
Sambit Sahu
,
Milind Naphade
,
Genta Indra Winata
NeurIPSW
2024
Dense Backpropagation Improves Routing for Sparsely-Gated Mixture-of-Experts
Ashwinee Panda
,
Vatsal Baherwani
,
Zain Sarwar
,
Benjamin Thérien
,
Stephen Rawls
,
Sambit Sahu
,
Supriyo Chakraborty
,
Tom Goldstein
NeurIPSW
2024
Dense Backpropagation Improves Routing for Sparsely-Gated Mixture-of-Experts
Ashwinee Panda
,
Vatsal Baherwani
,
Zain Sarwar
,
Benjamin Thérien
,
Stephen Rawls
,
Sambit Sahu
,
Supriyo Chakraborty
,
Tom Goldstein