ML Anthology
Authors
Search
About
Geramifard, Alborz
14 publications
ICLR
2024
Score Models for Offline Goal-Conditioned Reinforcement Learning
Harshit Sikchi
,
Rohan Chitnis
,
Ahmed Touati
,
Alborz Geramifard
,
Amy Zhang
,
Scott Niekum
ICLR
2024
When Should We Prefer Decision Transformers for Offline Reinforcement Learning?
Prajjwal Bhargava
,
Rohan Chitnis
,
Alborz Geramifard
,
Shagun Sodhani
,
Amy Zhang
TMLR
2023
Robustness Through Data Augmentation Loss Consistency
Tianjian Huang
,
Shaunak Ashish Halbe
,
Chinnadhurai Sankar
,
Pooyan Amini
,
Satwik Kottur
,
Alborz Geramifard
,
Meisam Razaviyayn
,
Ahmad Beirami
ICMLW
2023
Robustness Through Data Augmentation Loss Consistency
Tianjian Huang
,
Shaunak Halbe
,
Chinnadhurai Sankar
,
Pooyan Amini
,
Satwik Kottur
,
Alborz Geramifard
,
Meisam Razaviyayn
,
Ahmad Beirami
NeurIPSW
2023
Score-Models for Offline Goal-Conditioned Reinforcement Learning
Harshit Sikchi
,
Rohan Chitnis
,
Ahmed Touati
,
Alborz Geramifard
,
Amy Zhang
,
Scott Niekum
MLJ
2021
Guest Editorial: Special Issue on Reinforcement Learning for Real Life
Yuxi Li
,
Alborz Geramifard
,
Lihong Li
,
Csaba Szepesvári
,
Tao Wang
MLOSS
2015
RLPy: A Value-Function-Based Reinforcement Learning Framework for Education and Research
Alborz Geramifard
,
Christoph Dann
,
Robert H. Klein
,
William Dabney
,
Jonathan P. How
FnTML
2013
A Tutorial on Linear Function Approximators for Dynamic Programming and Reinforcement Learning
Alborz Geramifard
,
Thomas J. Walsh
,
Stefanie Tellex
,
Girish Chowdhary
,
Nicholas Roy
,
Jonathan P. How
UAI
2013
Batch-iFDD for Representation Expansion in Large MDPs
Alborz Geramifard
,
Thomas J. Walsh
,
Nicholas Roy
,
Jonathan P. How
ECML-PKDD
2012
Adaptive Planning for Markov Decision Processes with Uncertain Transition Models via Incremental Feature Dependency Discovery
N. Kemal Ure
,
Alborz Geramifard
,
Girish Chowdhary
,
Jonathan P. How
ICML
2011
Online Discovery of Feature Dependencies
Alborz Geramifard
,
Finale Doshi
,
Josh Redding
,
Nicholas Roy
,
Jonathan P. How
UAI
2008
Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping
Richard S. Sutton
,
Csaba Szepesvári
,
Alborz Geramifard
,
Michael H. Bowling
AAAI
2006
Incremental Least-Squares Temporal Difference Learning
Alborz Geramifard
,
Michael H. Bowling
,
Richard S. Sutton
NeurIPS
2006
iLSTD: Eligibility Traces and Convergence Analysis
Alborz Geramifard
,
Michael Bowling
,
Martin Zinkevich
,
Richard S. Sutton