Talebi, Mohammad Sadegh

10 publications

ICLR 2025 Offline RL in Regular Decision Processes: Sample Efficiency via Language Metrics Ahana Deb, Roberto Cipollone, Anders Jonsson, Alessandro Ronca, Mohammad Sadegh Talebi
AISTATS 2023 Exploration in Reward Machines with Low Regret Hippolyte Bourel, Anders Jonsson, Odalric-Ambrym Maillard, Mohammad Sadegh Talebi
ACML 2023 Logarithmic Regret in Communicating MDPs: Leveraging Known Dynamics with Bandits Hassan Saber, Fabien Pesquerel, Odalric-Ambrym Maillard, Mohammad Sadegh Talebi
NeurIPS 2023 Provably Efficient Offline Reinforcement Learning in Regular Decision Processes Roberto Cipollone, Anders Jonsson, Alessandro Ronca, Mohammad Sadegh Talebi
ICMLW 2022 Exploration in Reward Machines with Low Regret Hippolyte Bourel, Anders Jonsson, Odalric-Ambrym Maillard, Mohammad Sadegh Talebi
NeurIPS 2020 Adversarial Bandits with Corruptions: Regret Lower Bound and No-Regret Algorithm Lin Yang, Mohammad Hajiesmaili, Mohammad Sadegh Talebi, John C. S. Lui, Wing Shing Wong
ICML 2020 Tightening Exploration in Upper Confidence Reinforcement Learning Hippolyte Bourel, Odalric Maillard, Mohammad Sadegh Talebi
NeurIPS 2019 Learning Multiple Markov Chains via Adaptive Allocation Mohammad Sadegh Talebi, Odalric-Ambrym Maillard
ACML 2019 Model-Based Reinforcement Learning Exploiting State-Action Equivalence Mahsa Asadi, Mohammad Sadegh Talebi, Hippolyte Bourel, Odalric-Ambrym Maillard
ALT 2018 Variance-Aware Regret Bounds for Undiscounted Reinforcement Learning in MDPs Mohammad Sadegh Talebi, Odalric-Ambrym Maillard