Takashiro, Shota

1 publications

ICLR 2026 RL Squeezes, SFT Expands: A Comparative Study of Reasoning LLMs Kohsei Matsutani, Shota Takashiro, Gouki Minegishi, Takeshi Kojima, Yusuke Iwasawa, Yutaka Matsuo