Ito, Shinji
57 publications
NeurIPS
2025
Adapting to Stochastic and Adversarial Losses in Episodic MDPs with Aggregate Bandit Feedback
COLT
2025
Instance-Dependent Regret Bounds for Learning Two-Player Zero-Sum Games with Bandit Feedback
NeurIPS
2024
Fast Rates in Stochastic Online Convex Optimization by Exploiting the Curvature of Feasible Sets
AAAI
2024
New Classes of the Greedy-Applicable Arm Feature Distributions in the Sparse Linear Bandit Problem
NeurIPS
2024
On the Minimax Regret for Contextual Linear Bandits and Multi-Armed Bandits with Expert Advice
NeurIPS
2023
Stability-Penalty-Adaptive Follow-the-Regularized-Leader: Sparsity, Game-Dependency, and Best-of-Both-Worlds
AAAI
2021
Near-Optimal Regret Bounds for Contextual Combinatorial Semi-Bandits with Linear Payoff Functions
AISTATS
2020
An Optimal Algorithm for Bandit Convex Optimization with Strongly-Convex and Smooth Loss