Estimating Agent Skill in Continuous Action Domains

Abstract

Actions in most real-world continuous domains cannot be executed exactly. An agent’s performance in these domains is influenced by two critical factors: the ability to select effective actions (decision-making skill), and how precisely it can execute those selected actions (execution skill). This article addresses the problem of estimating the execution and decision-making skill of an agent, given observations. Several execution skill estimation methods are presented, each of which utilize different information from the observations and make assumptions about the agent’s decision-making ability. A final novel method forgoes these assumptions about decision-making and instead estimates the execution and decision-making skills simultaneously under a single Bayesian framework. Experimental results in several domains evaluate the estimation accuracy of the estimators, especially focusing on how robust they are as agents and their decision-making methods are varied. These results demonstrate that reasoning about both types of skill together significantly improves the robustness and accuracy of execution skill estimation. A case study is presented using the proposed methods to estimate the skill of Major League Baseball pitchers, demonstrating how these methods can be applied to real-world data sources.

Cite

Text

Archibald and Nieves-Rivera. "Estimating Agent Skill in Continuous Action Domains." Journal of Artificial Intelligence Research, 2024. doi:10.1613/JAIR.1.15326

Markdown

[Archibald and Nieves-Rivera. "Estimating Agent Skill in Continuous Action Domains." Journal of Artificial Intelligence Research, 2024.](https://mlanthology.org/jair/2024/archibald2024jair-estimating/) doi:10.1613/JAIR.1.15326

BibTeX

@article{archibald2024jair-estimating,
  title     = {{Estimating Agent Skill in Continuous Action Domains}},
  author    = {Archibald, Christopher and Nieves-Rivera, Delma},
  journal   = {Journal of Artificial Intelligence Research},
  year      = {2024},
  pages     = {27-86},
  doi       = {10.1613/JAIR.1.15326},
  volume    = {80},
  url       = {https://mlanthology.org/jair/2024/archibald2024jair-estimating/}
}