Fast and Fine-Grained Autoscaler for Streaming Jobs with Reinforcement Learning

Abstract

On computing clusters, the autoscaler is responsible for allocating resources for jobs or fine-grained tasks to ensure their Quality of Service. Due to a more precise resource management, fine-grained autoscaling can generally achieve better performance. However, the fine-grained autoscaling for streaming jobs needs intensive computation to model the complicated running states of tasks, and has not been adequately studied previously. In this paper, we propose a novel fine-grained autoscaler for streaming jobs based on reinforcement learning. We first organize the running states of streaming jobs as spatio-temporal graphs. To efficiently make autoscaling decisions, we propose a Neural Variational Subgraph Sampler to sample spatio-temporal subgraphs. Furthermore, we propose a mutual-information-based objective function to explicitly guide the sampler to extract more representative subgraphs. After that, the autoscaler makes decisions based on the learned subgraph representations. Experiments conducted on real-world datasets demonstrate the superiority of our method over six competitive baselines.

Cite

Text

Xing et al. "Fast and Fine-Grained Autoscaler for Streaming Jobs with Reinforcement Learning." International Joint Conference on Artificial Intelligence, 2022. doi:10.24963/IJCAI.2022/80

Markdown

[Xing et al. "Fast and Fine-Grained Autoscaler for Streaming Jobs with Reinforcement Learning." International Joint Conference on Artificial Intelligence, 2022.](https://mlanthology.org/ijcai/2022/xing2022ijcai-fast/) doi:10.24963/IJCAI.2022/80

BibTeX

@inproceedings{xing2022ijcai-fast,
  title     = {{Fast and Fine-Grained Autoscaler for Streaming Jobs with Reinforcement Learning}},
  author    = {Xing, Mingzhe and Mao, Hangyu and Xiao, Zhen},
  booktitle = {International Joint Conference on Artificial Intelligence},
  year      = {2022},
  pages     = {564-570},
  doi       = {10.24963/IJCAI.2022/80},
  url       = {https://mlanthology.org/ijcai/2022/xing2022ijcai-fast/}
}