Kermanshah, Mehdi

1 publications

L4DC 2025 Accelerating Proximal Policy Optimization Learning Using Task Prediction for Solving Environments with Delayed Rewards Ahmad Ahmad, Mehdi Kermanshah, Kevin Leahy, Zachary Serlin, Ho Chit Siu, Makai Mann, Cristian-Ioan Vasile, Roberto Tron, Calin Belta