Sakamoto, Mitsuki

6 publications

ICLR 2025 Boosting Perturbed Gradient Ascent for Last-Iterate Convergence in Games Kenshi Abe, Mitsuki Sakamoto, Kaito Ariu, Atsushi Iwasaki
TMLR 2025 Evaluation of Best-of-N Sampling Strategies for Language Model Alignment Yuki Ichihara, Yuu Jinnai, Tetsuro Morimura, Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto, Eiji Uchibe
ICML 2024 Adaptively Perturbed Mirror Descent for Learning in Games Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto, Atsushi Iwasaki
ICMLW 2024 Filtered Direct Preference Optimization Tetsuro Morimura, Mitsuki Sakamoto, Yuu Jinnai, Kenshi Abe, Kaito Ariu
AISTATS 2023 Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto, Kentaro Toyoshima, Atsushi Iwasaki
UAI 2022 Mutation-Driven Follow the Regularized Leader for Last-Iterate Convergence in Zero-Sum Games Kenshi Abe, Mitsuki Sakamoto, Atsushi Iwasaki