Bin Moon, Sang

1 publications

UAI 2024 Optimistic Regret Bounds for Online Learning in Adversarial Markov Decision Processes Sang Bin Moon, Abolfazl Hashemi