Tsuchiya, Taira
21 publications
NeurIPS
2025
Adapting to Stochastic and Adversarial Losses in Episodic MDPs with Aggregate Bandit Feedback
COLT
2025
Instance-Dependent Regret Bounds for Learning Two-Player Zero-Sum Games with Bandit Feedback
NeurIPS
2024
Fast Rates in Stochastic Online Convex Optimization by Exploiting the Curvature of Feasible Sets