ML Anthology
Authors
Search
About
Preux, Philippe
24 publications
TMLR
2024
AdaStop: Adaptive Statistical Testing for Sound Comparisons of Deep RL Agents
Timothée Mathieu
,
Matheus Medeiros Centa
,
Riccardo Della Vecchia
,
Hector Kohler
,
Alena Shilova
,
Odalric-Ambrym Maillard
,
Philippe Preux
TMLR
2024
Augmenting Ad-Hoc IR Dataset for Interactive Conversational Search
Pierre Erbacher
,
Jian-Yun Nie
,
Philippe Preux
,
Laure Soulier
ICMLW
2024
Learning HJB Viscosity Solutions with PINNs for Continuous-Time Reinforcement Learning
Alena Shilova
,
Thomas Delliaux
,
Philippe Preux
,
Bruno Raffin
AAAI
2023
Soft Action Priors: Towards Robust Policy Transfer
Matheus Centa
,
Philippe Preux
NeurIPSW
2022
Better State Exploration Using Action Sequence Equivalence
Nathan Grinsztajn
,
Toby Johnstone
,
Johan Ferret
,
Philippe Preux
ICLR
2021
Adversarially Guided Actor-Critic
Yannis Flet-Berliac
,
Johan Ferret
,
Olivier Pietquin
,
Philippe Preux
,
Matthieu Geist
IJCAI
2021
Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness
Mathieu Seurin
,
Florian Strub
,
Philippe Preux
,
Olivier Pietquin
ICLR
2021
Learning Value Functions in Deep Policy Gradients Using Residual Variance
Yannis Flet-Berliac
,
Reda Ouhamma
,
Odalric-Ambrym Maillard
,
Philippe Preux
ICLRW
2021
Low-Rank Projections of GCNs Laplacian
Nathan Grinsztajn
,
Philippe Preux
,
Edouard Oyallon
NeurIPS
2021
There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning
Nathan Grinsztajn
,
Johan Ferret
,
Olivier Pietquin
,
Philippe Preux
,
Matthieu Geist
IJCAI
2020
Only Relevant Information Matters: Filtering Out Noisy Samples to Boost RL
Yannis Flet-Berliac
,
Philippe Preux
ECCV
2018
Visual Reasoning with Multi-Hop Feature Modulation
Florian Strub
,
Mathieu Seurin
,
Ethan Perez
,
Harm de Vries
,
Jeremie Mary
,
Philippe Preux
,
Aaron CourvilleOlivier Pietquin
JMLR
2016
Consistent Algorithms for Clustering Time Series
Azadeh Khaleghi
,
Daniil Ryabko
,
Jérémie Mary
,
Philippe Preux
JMLR
2016
Operator-Valued Kernels for Learning from Functional Response Data
Hachem Kadri
,
Emmanuel Duflos
,
Philippe Preux
,
Stéphane Canu
,
Alain Rakotomamonjy
,
Julien Audiffren
ICML
2014
Improving Offline Evaluation of Contextual Bandit Algorithms via Bootstrapping Techniques
Jérémie Mary
,
Philippe Preux
,
Olivier Nicol
ICML
2013
A Generalized Kernel Approach to Structured Output Learning
Hachem Kadri
,
Mohammad Ghavamzadeh
,
Philippe Preux
ECML-PKDD
2012
Fast Reinforcement Learning with Large Action Sets Using Error-Correcting Output Codes for MDP Factorization
Gabriel Dulac-Arnold
,
Ludovic Denoyer
,
Philippe Preux
,
Patrick Gallinari
NeurIPS
2012
Multiple Operator-Valued Kernel Learning
Hachem Kadri
,
Alain Rakotomamonjy
,
Philippe Preux
,
Francis R. Bach
AISTATS
2012
Online Clustering of Processes
Azadeh Khaleghi
,
Daniil Ryabko
,
Jeremie Mary
,
Philippe Preux
MLJ
2012
Sequential Approaches for Learning Datum-Wise Sparse Representations
Gabriel Dulac-Arnold
,
Ludovic Denoyer
,
Philippe Preux
,
Patrick Gallinari
ECML-PKDD
2011
Datum-Wise Classification: A Sequential Approach to Sparsity
Gabriel Dulac-Arnold
,
Ludovic Denoyer
,
Philippe Preux
,
Patrick Gallinari
ICML
2011
Functional Regularized Least Squares Classication with Operator-Valued Kernels
Hachem Kadri
,
Asma Rabaoui
,
Philippe Preux
,
Emmanuel Duflos
,
Alain Rakotomamonjy
AISTATS
2010
Nonlinear Functional Regression: A Functional RKHS Approach
Hachem Kadri
,
Emmanuel Duflos
,
Philippe Preux
,
Stéphane Canu
,
Manuel Davy
ECML-PKDD
2002
Propagation of Q-Values in Tabular TD(lambda)
Philippe Preux