Ariu, Kaito

21 publications

ICLR 2025 Boosting Perturbed Gradient Ascent for Last-Iterate Convergence in Games Kenshi Abe, Mitsuki Sakamoto, Kaito Ariu, Atsushi Iwasaki
TMLR 2025 Evaluation of Best-of-N Sampling Strategies for Language Model Alignment Yuki Ichihara, Yuu Jinnai, Tetsuro Morimura, Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto, Eiji Uchibe
NeurIPS 2025 Last Iterate Convergence in Monotone Mean Field Games Noboru Isobe, Kenshi Abe, Kaito Ariu
NeurIPS 2025 Learning from Delayed Feedback in Games via Extra Prediction Yuma Fujimoto, Kenshi Abe, Kaito Ariu
TMLR 2025 Return-Aligned Decision Transformer Tsunehiko Tanaka, Kenshi Abe, Kaito Ariu, Tetsuro Morimura, Edgar Simo-Serra
ICML 2025 Revisiting Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model Kaito Ariu, Alexandre Proutiere, Se-Young Yun
AAAI 2025 Synchronization in Learning in Periodic Zero-Sum Games Triggers Divergence from Nash Equilibrium Yuma Fujimoto, Kaito Ariu, Kenshi Abe
ICML 2024 Adaptively Perturbed Mirror Descent for Learning in Games Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto, Atsushi Iwasaki
ICMLW 2024 Filtered Direct Preference Optimization Tetsuro Morimura, Mitsuki Sakamoto, Yuu Jinnai, Kenshi Abe, Kaito Ariu
ICML 2024 Matroid Semi-Bandits in Sublinear Time Ruo-Chun Tzeng, Naoto Ohsaka, Kaito Ariu
AAAI 2024 Memory Asymmetry Creates Heteroclinic Orbits to Nash Equilibrium in Learning in Zero-Sum Games Yuma Fujimoto, Kaito Ariu, Kenshi Abe
ICML 2024 Model-Based Minimum Bayes Risk Decoding for Text Generation Yuu Jinnai, Tetsuro Morimura, Ukyo Honda, Kaito Ariu, Kenshi Abe
ICML 2024 On Universally Optimal Algorithms for A/B Testing Po-An Wang, Kaito Ariu, Alexandre Proutiere
MLJ 2024 Optimal Clustering from Noisy Binary Feedback Kaito Ariu, Jungseul Ok, Alexandre Proutière, Seyoung Yun
ICMLW 2024 Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment Yuu Jinnai, Tetsuro Morimura, Kaito Ariu, Kenshi Abe
ICMLW 2023 An Optimal Clustering Algorithm for the Labeled Stochastic Block Model Kaito Ariu, Se-Young Yun, Alexandre Proutiere
AISTATS 2023 Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto, Kentaro Toyoshima, Atsushi Iwasaki
IJCAI 2023 Learning in Multi-Memory Games Triggers Complex Dynamics Diverging from Nash Equilibrium Yuma Fujimoto, Kaito Ariu, Kenshi Abe
ICML 2022 Thresholded Lasso Bandit Kaito Ariu, Kenshi Abe, Alexandre Proutiere
AISTATS 2020 Optimal Algorithms for Multiplayer Multi-Armed Bandits Po-An Wang, Alexandre Proutiere, Kaito Ariu, Yassir Jedra, Alessio Russo
NeurIPS 2020 Regret in Online Recommendation Systems Kaito Ariu, Narae Ryu, Se-Young Yun, Alexandre Proutiere