Restelli, Marcello
132 publications
AISTATS
2025
Achieving $\widetilde\mathcal{O}(\sqrt{T})$ Regret in Average-Reward POMDPs with Known Observation Models
AISTATS
2025
Efficient Exploitation of Hierarchical Structure in Sparse Reward Reinforcement Learning
ECML-PKDD
2024
Interpetable Target-Feature Aggregation for Multi-Task Learning Based on Bias-Variance Analysis
NeurIPS
2023
Distributional Policy Evaluation: A Maximum Entropy Approach to Representation Learning
NeurIPSW
2023
Exploiting Causal Representations in Reinforcement Learning: A Posterior Sampling Approach
UAI
2023
On the Relation Between Policy Improvement and Off-Policy Minimum-Variance Policy Evaluation
AAAI
2023
Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization
ECML-PKDD
2023
Stepsize Learning for Policy Gradient Methods in Contextual Markov Decision Processes
AAAI
2023
Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control
AISTATS
2022
Finite Sample Analysis of Mean-Volatility Actor-Critic for Risk-Averse Reinforcement Learning
IJCAI
2022
Multi-Armed Bandit Problem with Temporally-Partitioned Rewards: When Partial Feedback Counts
NeurIPSW
2022
Provably Efficient Causal Model-Based Reinforcement Learning for Environment-Agnostic Generalization
NeurIPS
2021
Subgaussian and Differentiable Importance Sampling for Off-Policy Evaluation and Learning
NeurIPS
2020
An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits
AAAI
2020
An Intrinsically-Motivated Approach for Learning Highly Exploring and Fast Mixing Policies
UAI
2017
Regret Minimization Algorithms for the Followers Behaviour Identification in Leadership Games
AAAI
2016
Sequence-Form and Evolutionary Dynamics: Realization Equivalence to Agent Form and Logit Dynamics