ML Anthology
Authors
Search
About
Gabillon, Victor
17 publications
MLJ
2024
Manas: Multi-Agent Neural Architecture Search
Vasco Lopes
,
Fabio Maria Carlucci
,
Pedro M. Esperança
,
Marco Singh
,
Antoine Yang
,
Victor Gabillon
,
Hang Xu
,
Zewei Chen
,
Jun Wang
AISTATS
2020
Adaptive Multi-Fidelity Optimization with Fast Learning Rates
Côme Fiegel
,
Victor Gabillon
,
Michal Valko
AISTATS
2020
Derivative-Free & Order-Robust Optimisation
Haitham Ammar
,
Victor Gabillon
,
Rasul Tutunov
,
Michal Valko
ALT
2019
A Simple Parameter-Free and Adaptive Approach to Optimization Under a Minimal Local Smoothness Assumption
Peter L. Bartlett
,
Victor Gabillon
,
Michal Valko
ICML
2019
Scale-Free Adaptive Planning for Deterministic Dynamics & Discounted Rewards
Peter Bartlett
,
Victor Gabillon
,
Jennifer Healey
,
Michal Valko
COLT
2018
Best of Both Worlds: Stochastic & Adversarial Best-Arm Identification
Yasin Abbasi-Yadkori
,
Peter L. Bartlett
,
Victor Gabillon
,
Alan Malek
,
Michal Valko
AISTATS
2017
Hit-and-Run for Sampling and Planning in Non-Convex Spaces
Yasin Abbasi-Yadkori
,
Peter L. Bartlett
,
Victor Gabillon
,
Alan Malek
NeurIPS
2017
Near Minimax Optimal Players for the Finite-Time 3-Expert Prediction Problem
Yasin Abbasi Yadkori
,
Peter L Bartlett
,
Victor Gabillon
AISTATS
2016
Improved Learning Complexity in Combinatorial Pure Exploration Bandits
Victor Gabillon
,
Alessandro Lazaric
,
Mohammad Ghavamzadeh
,
Ronald Ortner
,
Peter L. Bartlett
JMLR
2015
Approximate Modified Policy Iteration and Its Application to the Game of Tetris
Bruno Scherrer
,
Mohammad Ghavamzadeh
,
Victor Gabillon
,
Boris Lesner
,
Matthieu Geist
AAAI
2014
Large-Scale Optimistic Adaptive Submodularity
Victor Gabillon
,
Branislav Kveton
,
Zheng Wen
,
Brian Eriksson
,
S. Muthukrishnan
NeurIPS
2013
Adaptive Submodular Maximization in Bandit Setting
Victor Gabillon
,
Branislav Kveton
,
Zheng Wen
,
Brian Eriksson
,
S. Muthukrishnan
NeurIPS
2013
Approximate Dynamic Programming Finally Performs Well in the Game of Tetris
Victor Gabillon
,
Mohammad Ghavamzadeh
,
Bruno Scherrer
ICML
2012
Approximate Modified Policy Iteration
Bruno Scherrer
,
Victor Gabillon
,
Mohammad Ghavamzadeh
,
Matthieu Geist
NeurIPS
2012
Best Arm Identification: A Unified Approach to Fixed Budget and Fixed Confidence
Victor Gabillon
,
Mohammad Ghavamzadeh
,
Alessandro Lazaric
ICML
2011
Classification-Based Policy Iteration with a Critic
Victor Gabillon
,
Alessandro Lazaric
,
Mohammad Ghavamzadeh
,
Bruno Scherrer
NeurIPS
2011
Multi-Bandit Best Arm Identification
Victor Gabillon
,
Mohammad Ghavamzadeh
,
Alessandro Lazaric
,
Sébastien Bubeck