Hybrid CtrlFormer: Learning Adaptive Search Space Partition for Hybrid Action Control via Transformer-Based Monte Carlo Tree Search

Liu, Jiashun; Hao, Xiaotian; Hao, Jianye; Zheng, Yan; Hu, Yujing; Fan, Changjie; Lv, Tangjie; Hu, Zhipeng

Hybrid CtrlFormer: Learning Adaptive Search Space Partition for Hybrid Action Control via Transformer-Based Monte Carlo Tree Search

Jiashun Liu, Xiaotian Hao, Jianye Hao, Yan Zheng, Yujing Hu, Changjie Fan, Tangjie Lv, Zhipeng Hu

UAI 2024 pp. 2294-2308

/uai/2024/liu2024uai-hybrid/

Abstract

Hybrid action control tasks are common in the real world, which require controlling some discrete and continuous actions simultaneously. To solve these tasks, existing Deep Reinforcement learning (DRL) methods either directly build a separate policy for each type of action or simplify the hybrid action space into a discrete or continuous action control problem. However, these methods neglect the challenge of exploration resulting from the complexity of the hybrid action space. Thus, it is necessary to design more sample efficient algorithms. To this end, we propose a novel Hybrid Control Transformer (Hybrid CtrlFormer), to achieve better exploration and exploitation for the hybrid action control problems. The core idea is: 1) we construct a hybrid action space tree with the discrete actions at the higher level and the continuous parameter space at the lower level. Each parameter space is split into multiple subregions. 2) To simplify the exploration space, a Transformer-based Monte-Carlo tree search method is designed to efficiently evaluate and partition the hybrid action space into good and bad subregions along the tree. Our method achieves state-of-the-art performance and sample efficiency in a variety of environments with discrete-continuous action space.

PDF UAI OpenReview Semantic Scholar

Cite

Text

Liu et al. "Hybrid CtrlFormer: Learning Adaptive Search Space Partition for Hybrid Action Control via Transformer-Based Monte Carlo Tree Search." Uncertainty in Artificial Intelligence, 2024.

Markdown

[Liu et al. "Hybrid CtrlFormer: Learning Adaptive Search Space Partition for Hybrid Action Control via Transformer-Based Monte Carlo Tree Search." Uncertainty in Artificial Intelligence, 2024.](https://mlanthology.org/uai/2024/liu2024uai-hybrid/)

BibTeX

@inproceedings{liu2024uai-hybrid,
  title     = {{Hybrid CtrlFormer: Learning Adaptive Search Space Partition for Hybrid Action Control via Transformer-Based Monte Carlo Tree Search}},
  author    = {Liu, Jiashun and Hao, Xiaotian and Hao, Jianye and Zheng, Yan and Hu, Yujing and Fan, Changjie and Lv, Tangjie and Hu, Zhipeng},
  booktitle = {Uncertainty in Artificial Intelligence},
  year      = {2024},
  pages     = {2294-2308},
  volume    = {244},
  url       = {https://mlanthology.org/uai/2024/liu2024uai-hybrid/}
}