Müller, Martin

34 publications

TMLR 2025 ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control Ehsan Futuhi, Shayan Karimi, Chao Gao, Martin Müller
TMLR 2024 A Distance-Based Anomaly Detection Framework for Deep Reinforcement Learning Hongming Zhang, Ke Sun, Bo Xu, Linglong Kong, Martin Müller
IJCAI 2024 Expected Work Search: Combining Win Rate and Proof Size Estimation Owen Randall, Martin Müller, Ting-Han Wei, Ryan Hayward
NeurIPS 2024 Exploiting the Replay Memory Before Exploring the Environment: Enhancing Reinforcement Learning Through Empirical MDP Iteration Hongming Zhang, Chenjun Xiao, Chao Gao, Han Wang, Bo Xu, Martin Müller
AAAI 2024 Monte Carlo Tree Search in the Presence of Transition Uncertainty Farnaz Kohankhaki, Kiarash Aghakasiri, Hongming Zhang, Ting-Han Wei, Chao Gao, Martin Müller
ICLR 2023 Replay Memory as an Empirical MDP: Combining Conservative Estimation with Experience Replay Hongming Zhang, Chenjun Xiao, Han Wang, Jun Jin, Bo Xu, Martin Müller
AAAI 2020 Guiding CDCL SAT Search via Random Exploration amid Conflict Depression Md. Solimul Chowdhury, Martin Müller, Jia-Huai You
NeurIPS 2019 Maximum Entropy Monte-Carlo Planning Chenjun Xiao, Ruitong Huang, Jincheng Mei, Dale Schuurmans, Martin Müller
IJCAI 2019 On Principled Entropy Exploration in Policy Optimization Jincheng Mei, Chenjun Xiao, Ruitong Huang, Dale Schuurmans, Martin Müller
IJCAI 2018 Analyzing the Impact of Knowledge and Search in Monte Carlo Tree Search in Go Farhad Haqiqat, Martin Müller
AAAI 2018 Memory-Augmented Monte Carlo Tree Search Chenjun Xiao, Jincheng Mei, Martin Müller
AAAI 2018 Preliminary Results on Exploration-Driven Satisfiability Solving Md. Solimul Chowdhury, Martin Müller, Jia-Huai You
IJCAI 2018 Three-Head Neural Network Architecture for Monte Carlo Tree Search Chao Gao, Martin Müller, Ryan Hayward
IJCAI 2017 Additive Merge-and-Shrink Heuristics for Diverse Action Costs Gaojian Fan, Martin Müller, Robert Holte
IJCAI 2017 Focused Depth-First Proof Number Search Using Convolutional Neural Networks for the Game of Hex Chao Gao, Martin Müller, Ryan Hayward
ALT 2017 Structured Best Arm Identification with Fixed Confidence Ruitong Huang, Mohammad M. Ajallooeian, Csaba Szepesvári, Martin Müller
AAAI 2016 Factorization Ranking Model for Move Prediction in the Game of Go Chenjun Xiao, Martin Müller
AAAI 2015 TDS+: Improving Temperature Discovery Search Yeqin Zhang, Martin Müller
AAAI 2014 Adding Local Exploration to Greedy Best-First Search in Satisficing Planning Fan Xie, Martin Müller, Robert Holte
AAAI 2014 Type-Based Exploration with Multiple Search Queues for Satisficing Planning Fan Xie, Martin Müller, Robert Holte, Tatsuya Imai
IJCAI 2013 Towards a Second Generation Random Walk Planner: An Experimental Exploration Hootan Nakhost, Martin Müller
MLJ 2012 Temporal-Difference Search in Computer Go David Silver, Richard S. Sutton, Martin Müller
AAAI 2011 A Local Monte Carlo Tree Search Approach in Deterministic Planning Fan Xie, Hootan Nakhost, Martin Müller
IJCAI 2009 Monte-Carlo Exploration for Deterministic Planning Hootan Nakhost, Martin Müller
ICML 2008 Sample-Based Learning and Search with Permanent and Transient Memories David Silver, Richard S. Sutton, Martin Müller
IJCAI 2007 Fast Planning with Iterative Macros Adi Botea, Martin Müller, Jonathan Schaeffer
IJCAI 2007 Lambda Depth-First Proof Number Search and Its Application to Go Kazuki Yoshizoe, Akihiro Kishimoto, Martin Müller
IJCAI 2007 Reinforcement Learning of Local Shape in the Game of Go David Silver, Richard S. Sutton, Martin Müller
JAIR 2005 Macro-FF: Improving AI Planning with Automatically Learned Macro-Operators Adi Botea, Markus Enzenberger, Martin Müller, Jonathan Schaeffer
AAAI 2005 Search Versus Knowledge for Solving Life and Death Problems in Go Akihiro Kishimoto, Martin Müller
IJCAI 2005 Solving Checkers Jonathan Schaeffer, Yngvi Björnsson, Neil Burch, Akihiro Kishimoto, Martin Müller, Robert Lake, Paul Lu, Steve Sutphen
AAAI 2004 A General Solution to the Graph History Interaction Problem Akihiro Kishimoto, Martin Müller
AAAI 2004 Temperature Discovery Search Martin Müller, Markus Enzenberger, Jonathan Schaeffer
IJCAI 1999 Decomposition Search: A Combinatorial Games Approach to Game Tree Search, with Applications to Solving Go Endgames Martin Müller