Oda, Yusuke

1 publications

ICLR 2025 Drop-Upcycling: Training Sparse Mixture of Experts with Partial Re-Initialization Taishi Nakamura, Takuya Akiba, Kazuki Fujii, Yusuke Oda, Rio Yokota, Jun Suzuki