Parr, Ronald
52 publications
NeurIPS
2025
A Unifying View of Linear Function Approximation in Off-Policy RL Through Matrix Splitting and Preconditioning
NeurIPS
2024
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
ICMLW
2024
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
AAAI
2016
Efficient PAC-Optimal Exploration in Concurrent, Continuous State MDPs with Delayed Updates
AAAI
2013
Sample Complexity and Performance Bounds for Non-Parametric Approximate Linear Programming
AAAI
2010
Complexity of Computing Optimal Stackelberg Strategies in Security Resource Allocation Games
ICML
2010
Feature Selection Using Regularization in Approximate Linear Programs for Markov Decision Processes