Azimi, Alireza

1 publications

NeurIPS 2024 Deep Policy Gradient Methods Without Batch Updates, Target Networks, or Replay Buffers Gautham Vasan, Mohamed Elsayed, Alireza Azimi, Jiamin He, Fahim Shariar, Colin Bellinger, Martha White, A. Rupam Mahmood