Abe, Kenshi

19 publications

AAAI 2025 Approximate State Abstraction for Markov Games Hiroki Ishibashi, Kenshi Abe, Atsushi Iwasaki
ICLR 2025 Boosting Perturbed Gradient Ascent for Last-Iterate Convergence in Games Kenshi Abe, Mitsuki Sakamoto, Kaito Ariu, Atsushi Iwasaki
TMLR 2025 Evaluation of Best-of-N Sampling Strategies for Language Model Alignment Yuki Ichihara, Yuu Jinnai, Tetsuro Morimura, Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto, Eiji Uchibe
NeurIPS 2025 Last Iterate Convergence in Monotone Mean Field Games Noboru Isobe, Kenshi Abe, Kaito Ariu
NeurIPS 2025 Learning from Delayed Feedback in Games via Extra Prediction Yuma Fujimoto, Kenshi Abe, Kaito Ariu
TMLR 2025 Return-Aligned Decision Transformer Tsunehiko Tanaka, Kenshi Abe, Kaito Ariu, Tetsuro Morimura, Edgar Simo-Serra
AAAI 2025 Synchronization in Learning in Periodic Zero-Sum Games Triggers Divergence from Nash Equilibrium Yuma Fujimoto, Kaito Ariu, Kenshi Abe
ICML 2024 Adaptively Perturbed Mirror Descent for Learning in Games Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto, Atsushi Iwasaki
ICMLW 2024 Filtered Direct Preference Optimization Tetsuro Morimura, Mitsuki Sakamoto, Yuu Jinnai, Kenshi Abe, Kaito Ariu
AISTATS 2024 Learning Fair Division from Bandit Feedback Hakuei Yamada, Junpei Komiyama, Kenshi Abe, Atsushi Iwasaki
AAAI 2024 Memory Asymmetry Creates Heteroclinic Orbits to Nash Equilibrium in Learning in Zero-Sum Games Yuma Fujimoto, Kaito Ariu, Kenshi Abe
ICML 2024 Model-Based Minimum Bayes Risk Decoding for Text Generation Yuu Jinnai, Tetsuro Morimura, Ukyo Honda, Kaito Ariu, Kenshi Abe
ICMLW 2024 Regularized Best-of-N Sampling to Mitigate Reward Hacking for Language Model Alignment Yuu Jinnai, Tetsuro Morimura, Kaito Ariu, Kenshi Abe
AISTATS 2023 Last-Iterate Convergence with Full and Noisy Feedback in Two-Player Zero-Sum Games Kenshi Abe, Kaito Ariu, Mitsuki Sakamoto, Kentaro Toyoshima, Atsushi Iwasaki
IJCAI 2023 Learning in Multi-Memory Games Triggers Complex Dynamics Diverging from Nash Equilibrium Yuma Fujimoto, Kaito Ariu, Kenshi Abe
IJCAI 2022 Anytime Capacity Expansion in Medical Residency Match by Monte Carlo Tree Search Kenshi Abe, Junpei Komiyama, Atsushi Iwasaki
UAI 2022 Mutation-Driven Follow the Regularized Leader for Last-Iterate Convergence in Zero-Sum Games Kenshi Abe, Mitsuki Sakamoto, Atsushi Iwasaki
ICML 2022 Thresholded Lasso Bandit Kaito Ariu, Kenshi Abe, Alexandre Proutiere
NeurIPSW 2021 Mean-Variance Efficient Reinforcement Learning by Expected Quadratic Utility Maximization Masahiro Kato, Kei Nakagawa, Kenshi Abe, Tetsuro Morimura