Jamieson, Kevin
34 publications
NeurIPS
2025
Adapting to Stochastic and Adversarial Losses in Episodic MDPs with Aggregate Bandit Feedback
ICML
2025
Learning to Incentivize in Repeated Principal-Agent Problems with Adversarial Agent Arrivals
AISTATS
2024
A/B Testing and Best-Arm Identification for Linear Bandits with Robustness to Non-Stationarity
NeurIPS
2024
Active Learning of Neural Population Dynamics Using Two-Photon Holographic Optogenetics
NeurIPS
2024
Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning
AISTATS
2024
Near-Optimal Pure Exploration in Matrix Games: A Generalization of Stochastic Bandits & Dueling Bandits
NeurIPS
2024
Overcoming the Sim-to-Real Gap: Leveraging Simulation to Learn to Explore for Real-World RL
NeurIPS
2024
Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement Learning