Combining Learned Discrete and Continuous Action Models

Abstract

Action modeling is an important skill for agents that must perform tasks in novel domains. Previous work on action modeling has focused on learning STRIPS operators in discrete, relational domains. There has also been a separate vein of work in continuous function approximation for use in optimal control in robotics. Most real world domains are grounded in continuous dynamics but also exhibit emergent regularities at an abstract relational level of description. These two levels of regularity are often difficult to capture using a single action representation and learning method. In this paper we describe a system that combines discrete and continuous action modeling techniques in the Soar cognitive architecture. Our system accepts a continuous state representation from the environment and derives a relational state on top of it using spatial relations. The dynamics over each representation is learned separately using two simple instance-based algorithms. The predictions from the individual models are then combined in a way that takes advantage of the information captured by each representation. We empirically show that this combined model is more accurate and generalizable than each of the individual models in a spatial navigation domain.

Cite

Text

Xu and Laird. "Combining Learned Discrete and Continuous Action Models." AAAI Conference on Artificial Intelligence, 2011. doi:10.1609/AAAI.V25I1.7833

Markdown

[Xu and Laird. "Combining Learned Discrete and Continuous Action Models." AAAI Conference on Artificial Intelligence, 2011.](https://mlanthology.org/aaai/2011/xu2011aaai-combining/) doi:10.1609/AAAI.V25I1.7833

BibTeX

@inproceedings{xu2011aaai-combining,
  title     = {{Combining Learned Discrete and Continuous Action Models}},
  author    = {Xu, Joseph Z. and Laird, John E.},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2011},
  pages     = {1449-1454},
  doi       = {10.1609/AAAI.V25I1.7833},
  url       = {https://mlanthology.org/aaai/2011/xu2011aaai-combining/}
}