Preux, Philippe

24 publications

TMLR 2024 AdaStop: Adaptive Statistical Testing for Sound Comparisons of Deep RL Agents Timothée Mathieu, Matheus Medeiros Centa, Riccardo Della Vecchia, Hector Kohler, Alena Shilova, Odalric-Ambrym Maillard, Philippe Preux

TMLR 2024 Augmenting Ad-Hoc IR Dataset for Interactive Conversational Search Pierre Erbacher, Jian-Yun Nie, Philippe Preux, Laure Soulier

ICMLW 2024 Learning HJB Viscosity Solutions with PINNs for Continuous-Time Reinforcement Learning Alena Shilova, Thomas Delliaux, Philippe Preux, Bruno Raffin

AAAI 2023 Soft Action Priors: Towards Robust Policy Transfer Matheus Centa, Philippe Preux

NeurIPSW 2022 Better State Exploration Using Action Sequence Equivalence Nathan Grinsztajn, Toby Johnstone, Johan Ferret, Philippe Preux

ICLR 2021 Adversarially Guided Actor-Critic Yannis Flet-Berliac, Johan Ferret, Olivier Pietquin, Philippe Preux, Matthieu Geist

IJCAI 2021 Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness Mathieu Seurin, Florian Strub, Philippe Preux, Olivier Pietquin

ICLR 2021 Learning Value Functions in Deep Policy Gradients Using Residual Variance Yannis Flet-Berliac, Reda Ouhamma, Odalric-Ambrym Maillard, Philippe Preux

ICLRW 2021 Low-Rank Projections of GCNs Laplacian Nathan Grinsztajn, Philippe Preux, Edouard Oyallon

NeurIPS 2021 There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning Nathan Grinsztajn, Johan Ferret, Olivier Pietquin, Philippe Preux, Matthieu Geist

IJCAI 2020 Only Relevant Information Matters: Filtering Out Noisy Samples to Boost RL Yannis Flet-Berliac, Philippe Preux

ECCV 2018 Visual Reasoning with Multi-Hop Feature Modulation Florian Strub, Mathieu Seurin, Ethan Perez, Harm de Vries, Jeremie Mary, Philippe Preux, Aaron CourvilleOlivier Pietquin

JMLR 2016 Consistent Algorithms for Clustering Time Series Azadeh Khaleghi, Daniil Ryabko, Jérémie Mary, Philippe Preux

JMLR 2016 Operator-Valued Kernels for Learning from Functional Response Data Hachem Kadri, Emmanuel Duflos, Philippe Preux, Stéphane Canu, Alain Rakotomamonjy, Julien Audiffren

ICML 2014 Improving Offline Evaluation of Contextual Bandit Algorithms via Bootstrapping Techniques Jérémie Mary, Philippe Preux, Olivier Nicol

ICML 2013 A Generalized Kernel Approach to Structured Output Learning Hachem Kadri, Mohammad Ghavamzadeh, Philippe Preux

ECML-PKDD 2012 Fast Reinforcement Learning with Large Action Sets Using Error-Correcting Output Codes for MDP Factorization Gabriel Dulac-Arnold, Ludovic Denoyer, Philippe Preux, Patrick Gallinari

NeurIPS 2012 Multiple Operator-Valued Kernel Learning Hachem Kadri, Alain Rakotomamonjy, Philippe Preux, Francis R. Bach

AISTATS 2012 Online Clustering of Processes Azadeh Khaleghi, Daniil Ryabko, Jeremie Mary, Philippe Preux

MLJ 2012 Sequential Approaches for Learning Datum-Wise Sparse Representations Gabriel Dulac-Arnold, Ludovic Denoyer, Philippe Preux, Patrick Gallinari

ECML-PKDD 2011 Datum-Wise Classification: A Sequential Approach to Sparsity Gabriel Dulac-Arnold, Ludovic Denoyer, Philippe Preux, Patrick Gallinari

ICML 2011 Functional Regularized Least Squares Classication with Operator-Valued Kernels Hachem Kadri, Asma Rabaoui, Philippe Preux, Emmanuel Duflos, Alain Rakotomamonjy

AISTATS 2010 Nonlinear Functional Regression: A Functional RKHS Approach Hachem Kadri, Emmanuel Duflos, Philippe Preux, Stéphane Canu, Manuel Davy

ECML-PKDD 2002 Propagation of Q-Values in Tabular TD(lambda) Philippe Preux