Geramifard, Alborz

14 publications

ICLR 2024 Score Models for Offline Goal-Conditioned Reinforcement Learning Harshit Sikchi, Rohan Chitnis, Ahmed Touati, Alborz Geramifard, Amy Zhang, Scott Niekum
ICLR 2024 When Should We Prefer Decision Transformers for Offline Reinforcement Learning? Prajjwal Bhargava, Rohan Chitnis, Alborz Geramifard, Shagun Sodhani, Amy Zhang
TMLR 2023 Robustness Through Data Augmentation Loss Consistency Tianjian Huang, Shaunak Ashish Halbe, Chinnadhurai Sankar, Pooyan Amini, Satwik Kottur, Alborz Geramifard, Meisam Razaviyayn, Ahmad Beirami
ICMLW 2023 Robustness Through Data Augmentation Loss Consistency Tianjian Huang, Shaunak Halbe, Chinnadhurai Sankar, Pooyan Amini, Satwik Kottur, Alborz Geramifard, Meisam Razaviyayn, Ahmad Beirami
NeurIPSW 2023 Score-Models for Offline Goal-Conditioned Reinforcement Learning Harshit Sikchi, Rohan Chitnis, Ahmed Touati, Alborz Geramifard, Amy Zhang, Scott Niekum
MLJ 2021 Guest Editorial: Special Issue on Reinforcement Learning for Real Life Yuxi Li, Alborz Geramifard, Lihong Li, Csaba Szepesvári, Tao Wang
MLOSS 2015 RLPy: A Value-Function-Based Reinforcement Learning Framework for Education and Research Alborz Geramifard, Christoph Dann, Robert H. Klein, William Dabney, Jonathan P. How
FnTML 2013 A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning Alborz Geramifard, Thomas J. Walsh, Stefanie Tellex, Girish Chowdhary, Nicholas Roy, Jonathan P. How
UAI 2013 Batch-iFDD for Representation Expansion in Large MDPs Alborz Geramifard, Thomas J. Walsh, Nicholas Roy, Jonathan P. How
ECML-PKDD 2012 Adaptive Planning for Markov Decision Processes with Uncertain Transition Models via Incremental Feature Dependency Discovery N. Kemal Ure, Alborz Geramifard, Girish Chowdhary, Jonathan P. How
ICML 2011 Online Discovery of Feature Dependencies Alborz Geramifard, Finale Doshi, Josh Redding, Nicholas Roy, Jonathan P. How
UAI 2008 Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping Richard S. Sutton, Csaba Szepesvári, Alborz Geramifard, Michael H. Bowling
AAAI 2006 Incremental Least-Squares Temporal Difference Learning Alborz Geramifard, Michael H. Bowling, Richard S. Sutton
NeurIPS 2006 iLSTD: Eligibility Traces and Convergence Analysis Alborz Geramifard, Michael Bowling, Martin Zinkevich, Richard S. Sutton