Gosavi, Abhijit

1 publications

MLJ 2004 A Reinforcement Learning Algorithm Based on Policy Iteration for Average Reward: Empirical Results with Yield Management and Convergence Analysis Abhijit Gosavi