ML Anthology
Authors
Search
About
Abbasi-Yadkori, Yasin
26 publications
JMLR
2023
A New Look at Dynamic Regret for Non-Stationary Stochastic Bandits
Yasin Abbasi-Yadkori
,
András György
,
Nevena Lazić
AISTATS
2022
Confident Least Square Value Iteration with Local Access to a Simulator
Botao Hao
,
Nevena Lazic
,
Dong Yin
,
Yasin Abbasi-Yadkori
,
Csaba Szepesvari
ALT
2022
Efficient Local Planning with Linear Function Approximation
Dong Yin
,
Botao Hao
,
Yasin Abbasi-Yadkori
,
Nevena Lazić
,
Csaba Szepesvári
ICML
2022
Feature and Parameter Selection in Stochastic Linear Bandits
Ahmadreza Moradipari
,
Berkay Turan
,
Yasin Abbasi-Yadkori
,
Mahnoosh Alizadeh
,
Mohammad Ghavamzadeh
AISTATS
2021
Adaptive Approximate Policy Iteration
Botao Hao
,
Nevena Lazic
,
Yasin Abbasi-Yadkori
,
Pooria Joulani
,
Csaba Szepesvari
ICML
2021
Improved Regret Bound and Experience Replay in Regularized Policy Iteration
Nevena Lazic
,
Dong Yin
,
Yasin Abbasi-Yadkori
,
Csaba Szepesvari
COLT
2021
On Query-Efficient Planning in MDPs Under Linear Realizability of the Optimal State-Value Function
Gellert Weisz
,
Philip Amortila
,
Barnabás Janzer
,
Yasin Abbasi-Yadkori
,
Nan Jiang
,
Csaba Szepesvari
AISTATS
2019
Model-Free Linear Quadratic Control via Reduction to Expert Prediction
Yasin Abbasi-Yadkori
,
Nevena Lazic
,
Csaba Szepesvari
UAI
2019
On Densification for Minwise Hashing
Tung Mai
,
Anup Rao
,
Matt Kapilevich
,
Ryan Rossi
,
Yasin Abbasi-Yadkori
,
Ritwik Sinha
AISTATS
2019
Optimizing over a Restricted Policy Class in MDPs
Ershad Banijamali
,
Yasin Abbasi-Yadkori
,
Mohammad Ghavamzadeh
,
Nikos Vlassis
ICML
2019
POLITEX: Regret Bounds for Policy Iteration Using Expert Prediction
Yasin Abbasi-Yadkori
,
Peter Bartlett
,
Kush Bhatia
,
Nevena Lazic
,
Csaba Szepesvari
,
Gellert Weisz
AISTATS
2019
Sample Efficient Graph-Based Optimization with Noisy Observations
Thanh Tan Nguyen
,
Ali Shameli
,
Yasin Abbasi-Yadkori
,
Anup Rao
,
Branislav Kveton
COLT
2018
Best of Both Worlds: Stochastic & Adversarial Best-Arm Identification
Yasin Abbasi-Yadkori
,
Peter L. Bartlett
,
Victor Gabillon
,
Alan Malek
,
Michal Valko
AISTATS
2017
Hit-and-Run for Sampling and Planning in Non-Convex Spaces
Yasin Abbasi-Yadkori
,
Peter L. Bartlett
,
Victor Gabillon
,
Alan Malek
AISTATS
2016
A Fast and Reliable Policy Improvement Algorithm
Yasin Abbasi-Yadkori
,
Peter L. Bartlett
,
Stephen J. Wright
UAI
2015
Bayesian Optimal Control of Smoothly Parameterized Systems
Yasin Abbasi-Yadkori
,
Csaba Szepesvári
ICML
2015
Large-Scale Markov Decision Problems with KL Control Cost and Its Application to Crowdsourcing
Yasin Abbasi-Yadkori
,
Peter Bartlett
,
Xi Chen
,
Alan Malek
ICML
2014
Linear Programming for Large-Scale Markov Decision Problems
Alan Malek
,
Yasin Abbasi-Yadkori
,
Peter Bartlett
ICML
2014
Prediction with Limited Advice and Multiarmed Bandits with Paid Observations
Yevgeny Seldin
,
Peter Bartlett
,
Koby Crammer
,
Yasin Abbasi-Yadkori
ICML
2014
Tracking Adversarial Targets
Yasin Abbasi-Yadkori
,
Peter Bartlett
,
Varun Kanade
AISTATS
2012
Online-to-Confidence-Set Conversions and Application to Sparse Stochastic Bandits
Yasin Abbasi-Yadkori
,
David Pal
,
Csaba Szepesvari
IJCAI
2011
Fast Approximate Nearest-Neighbor Search with K-Nearest Neighbor Graph
Kiana Hajebi
,
Yasin Abbasi-Yadkori
,
Hossein Shahbazi
,
Hong Zhang
NeurIPS
2011
Improved Algorithms for Linear Stochastic Bandits
Yasin Abbasi-yadkori
,
Dávid Pál
,
Csaba Szepesvári
COLT
2011
Regret Bounds for the Adaptive Control of Linear Quadratic Systems
Yasin Abbasi-Yadkori
,
Csaba Szepesvári
UAI
2009
Improved Mean and Variance Approximations for Belief Net Responses via Network Doubling
Peter Hooper
,
Yasin Abbasi-Yadkori
,
Russell Greiner
,
Bret Hoehn
ICML
2009
Learning When to Stop Thinking and Do Something!
Barnabás Póczos
,
Yasin Abbasi-Yadkori
,
Csaba Szepesvári
,
Russell Greiner
,
Nathan R. Sturtevant