Mansour, Yishay
187 publications
ICML
2025
Near-Optimal Regret Using Policy Optimization in Online MDPs with Aggregate Bandit Feedback
NeurIPS
2025
Regret Bounds for Adversarial Contextual Bandits with General Function Approximation and Delayed Feedback
ICML
2023
Efficient Rate Optimal Regret for Adversarial Contextual MDPs Using Online Function Approximation
NeurIPS
2023
Eliciting User Preferences for Personalized Multi-Objective Decision Making Through Comparative Feedback
ICML
2023
Improved Regret for Efficient Online Reinforcement Learning with Linear Function Approximation