Lifelong Learning with a Changing Action Set

Chandak, Yash; Theocharous, Georgios; Nota, Chris; Thomas, Philip S.

doi:10.1609/AAAI.V34I04.5739

Lifelong Learning with a Changing Action Set

Yash Chandak, Georgios Theocharous, Chris Nota, Philip S. Thomas

AAAI 2020 pp. 3373-3380

doi:10.1609/AAAI.V34I04.5739 /aaai/2020/chandak2020aaai-lifelong/

Abstract

In many real-world sequential decision making problems, the number of available actions (decisions) can vary over time. While problems like catastrophic forgetting, changing transition dynamics, changing rewards functions, etc. have been well-studied in the lifelong learning literature, the setting where the size of the action set changes remains unaddressed. In this paper, we present first steps towards developing an algorithm that autonomously adapts to an action set whose size changes over time. To tackle this open problem, we break it into two problems that can be solved iteratively: inferring the underlying, unknown, structure in the space of actions and optimizing a policy that leverages this structure. We demonstrate the efficiency of this approach on large-scale real-world lifelong learning problems.

PDF AAAI Semantic Scholar

Cite

Text

Chandak et al. "Lifelong Learning with a Changing Action Set." AAAI Conference on Artificial Intelligence, 2020. doi:10.1609/AAAI.V34I04.5739

Markdown

[Chandak et al. "Lifelong Learning with a Changing Action Set." AAAI Conference on Artificial Intelligence, 2020.](https://mlanthology.org/aaai/2020/chandak2020aaai-lifelong/) doi:10.1609/AAAI.V34I04.5739

BibTeX

@inproceedings{chandak2020aaai-lifelong,
  title     = {{Lifelong Learning with a Changing Action Set}},
  author    = {Chandak, Yash and Theocharous, Georgios and Nota, Chris and Thomas, Philip S.},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2020},
  pages     = {3373-3380},
  doi       = {10.1609/AAAI.V34I04.5739},
  url       = {https://mlanthology.org/aaai/2020/chandak2020aaai-lifelong/}
}