Merlis, Nadav

20 publications

ICLR 2025 On Bits and Bandits: Quantifying the Regret-Information Trade-Off Itai Shufaro, Nadav Merlis, Nir Weinberger, Shie Mannor
NeurIPS 2025 Stable Matching with Ties: Approximation Ratios and Learning Shiyun Lin, Simon Mauras, Nadav Merlis, Vianney Perchet
NeurIPS 2024 Improved Algorithms for Contextual Dynamic Pricing Matilde Tullii, Solenne Gaucher, Nadav Merlis, Vianney Perchet
ICMLW 2024 Improved Algorithms for Contextual Dynamic Pricing Matilde Tullii, Solenne Gaucher, Nadav Merlis, Vianney Perchet
AISTATS 2024 Multi-Armed Bandits with Guaranteed Revenue per Arm Dorian Baudry, Nadav Merlis, Mathieu Benjamin Molina, Hugo Richard, Vianney Perchet
NeurIPS 2024 Reinforcement Learning with Lookahead Information Nadav Merlis
ICMLW 2024 Reinforcement Learning with Lookahead Information Nadav Merlis
NeurIPS 2024 The Value of Reward Lookahead in Reinforcement Learning Nadav Merlis, Dorian Baudry, Vianney Perchet
ICMLW 2024 The Value of Reward Lookahead in Reinforcement Learning Nadav Merlis, Dorian Baudry, Vianney Perchet
ICML 2023 On Preemption and Learning in Stochastic Scheduling Nadav Merlis, Hugo Richard, Flore Sentenac, Corentin Odic, Mathieu Molina, Vianney Perchet
ICML 2023 Reinforcement Learning with History Dependent Dynamic Contexts Guy Tennenholtz, Nadav Merlis, Lior Shani, Martin Mladenov, Craig Boutilier
NeurIPS 2022 Reinforcement Learning with a Terminator Guy Tennenholtz, Nadav Merlis, Lior Shani, Shie Mannor, Uri Shalit, Gal Chechik, Assaf Hallak, Gal Dalal
ICML 2021 Confidence-Budget Matching for Sequential Budgeted Learning Yonathan Efroni, Nadav Merlis, Aadirupa Saha, Shie Mannor
ICML 2021 Ensemble Bootstrapping for Q-Learning Oren Peer, Chen Tessler, Nadav Merlis, Ron Meir
AAAI 2021 Lenient Regret for Multi-Armed Bandits Nadav Merlis, Shie Mannor
AAAI 2021 Reinforcement Learning with Trajectory Feedback Yonathan Efroni, Nadav Merlis, Shie Mannor
COLT 2020 Tight Lower Bounds for Combinatorial Multi-Armed Bandits Nadav Merlis, Shie Mannor
COLT 2019 Batch-Size Independent Regret Bounds for the Combinatorial Multi-Armed Bandit Problem Nadav Merlis, Shie Mannor
NeurIPS 2019 Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies Yonathan Efroni, Nadav Merlis, Mohammad Ghavamzadeh, Shie Mannor
NeurIPS 2018 Learn What Not to Learn: Action Elimination with Deep Reinforcement Learning Tom Zahavy, Matan Haroush, Nadav Merlis, Daniel J Mankowitz, Shie Mannor