ML Anthology
Authors
Search
About
Ariu, Kaito
21 publications
ICLR
2025
Boosting Perturbed Gradient Ascent for Last-Iterate Convergence in Games
Kenshi Abe
,
Mitsuki Sakamoto
,
Kaito Ariu
,
Atsushi Iwasaki
TMLR
2025
Evaluation of Best-of-N Sampling Strategies for Language Model Alignment
Yuki Ichihara
,
Yuu Jinnai
,
Tetsuro Morimura
,
Kenshi Abe
,
Kaito Ariu
,
Mitsuki Sakamoto
,
Eiji Uchibe
NeurIPS
2025
Last Iterate Convergence in Monotone Mean Field Games
Noboru Isobe
,
Kenshi Abe
,
Kaito Ariu
NeurIPS
2025
Learning from Delayed Feedback in Games via Extra Prediction
Yuma Fujimoto
,
Kenshi Abe
,
Kaito Ariu
TMLR
2025
Return-Aligned Decision Transformer
Tsunehiko Tanaka
,
Kenshi Abe
,
Kaito Ariu
,
Tetsuro Morimura
,
Edgar Simo-Serra
ICML
2025
Revisiting Instance-Optimal Cluster Recovery in the Labeled Stochastic Block Model
Kaito Ariu
,
Alexandre Proutiere
,
Se-Young Yun
AAAI
2025
Synchronization in Learning in Periodic Zero-Sum Games Triggers Divergence from Nash Equilibrium
Yuma Fujimoto
,
Kaito Ariu
,
Kenshi Abe
ICML
2024
Adaptively Perturbed Mirror Descent for Learning in Games
Kenshi Abe
,
Kaito Ariu
,
Mitsuki Sakamoto
,
Atsushi Iwasaki
ICMLW
2024
Filtered Direct Preference Optimization
Tetsuro Morimura
,
Mitsuki Sakamoto
,
Yuu Jinnai
,
Kenshi Abe
,
Kaito Ariu
ICML
2024
Matroid Semi-Bandits in Sublinear Time
Ruo-Chun Tzeng
,
Naoto Ohsaka
,
Kaito Ariu
AAAI
2024
Memory Asymmetry Creates Heteroclinic Orbits to Nash Equilibrium in Learning in Zero-Sum Games
Yuma Fujimoto
,
Kaito Ariu
,
Kenshi Abe
ICML
2024
Model-Based Minimum Bayes Risk Decoding for Text Generation
Yuu Jinnai
,
Tetsuro Morimura
,
Ukyo Honda
,
Kaito Ariu
,
Kenshi Abe
ICML
2024
On Universally Optimal Algorithms for A/B Testing
Po-An Wang
,
Kaito Ariu
,
Alexandre Proutiere
MLJ
2024
Optimal Clustering from Noisy Binary Feedback
Kaito Ariu
,
Jungseul Ok
,
Alexandre Proutière
,
Seyoung Yun
ICMLW
2024
Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment
Yuu Jinnai
,
Tetsuro Morimura
,
Kaito Ariu
,
Kenshi Abe
ICMLW
2023
An Optimal Clustering Algorithm for the Labeled Stochastic Block Model
Kaito Ariu
,
Se-Young Yun
,
Alexandre Proutiere
AISTATS
2023
Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games
Kenshi Abe
,
Kaito Ariu
,
Mitsuki Sakamoto
,
Kentaro Toyoshima
,
Atsushi Iwasaki
IJCAI
2023
Learning in Multi-Memory Games Triggers Complex Dynamics Diverging from Nash Equilibrium
Yuma Fujimoto
,
Kaito Ariu
,
Kenshi Abe
ICML
2022
Thresholded Lasso Bandit
Kaito Ariu
,
Kenshi Abe
,
Alexandre Proutiere
AISTATS
2020
Optimal Algorithms for Multiplayer Multi-Armed Bandits
Po-An Wang
,
Alexandre Proutiere
,
Kaito Ariu
,
Yassir Jedra
,
Alessio Russo
NeurIPS
2020
Regret in Online Recommendation Systems
Kaito Ariu
,
Narae Ryu
,
Se-Young Yun
,
Alexandre Proutiere