FastPERT: Towards Fast Microservice Application Latency Prediction via Structural Inductive Bias over PERT Networks

Abstract

The recent surge in popularity of cloud-native applications using microservice architectures has led to a focus on accurate end-to-end latency prediction for proactive resource allocation. Existing models leverage Graph Transformers to Microservice Call Graphs or the Program Evaluation and Review Technique (PERT) graphs to capture complex temporal dependencies between microservices. However, these models incur a high computational cost during both training and inference phases. This paper introduces FastPERT, an efficient model for predicting end-to-end latency in microservice applications. FastPERT dissects an execution trace into several microservices tasks, using observations from prior execution traces of the application, akin to the PERT approach. Subsequently, a prediction model is constructed to estimate the completion time for each individual task. This information, coupled with the computational and structural inductive bias of the PERT graph, facilitates the efficient computation of the end-to-end latency of an execution trace. As a result, FastPERT can efficiently capture the complex temporal causality of different microservice tasks without relying on Graph Neural Networks, leading to more accurate and robust latency predictions across a variety of applications. An evaluation based on datasets generated from large-scale Alibaba microservice traces reveals that FastPERT significantly improves training and inference efficiency without compromising performance, demonstrating its potential as a superior solution for real-time end-to-end latency prediction in cloud-native microservice applications.

Cite

Text

Tam et al. "FastPERT: Towards Fast Microservice Application Latency Prediction via Structural Inductive Bias over PERT Networks." AAAI Conference on Artificial Intelligence, 2025. doi:10.1609/AAAI.V39I19.34291

Markdown

[Tam et al. "FastPERT: Towards Fast Microservice Application Latency Prediction via Structural Inductive Bias over PERT Networks." AAAI Conference on Artificial Intelligence, 2025.](https://mlanthology.org/aaai/2025/tam2025aaai-fastpert/) doi:10.1609/AAAI.V39I19.34291

BibTeX

@inproceedings{tam2025aaai-fastpert,
  title     = {{FastPERT: Towards Fast Microservice Application Latency Prediction via Structural Inductive Bias over PERT Networks}},
  author    = {Tam, Da Sun Handason and Xu, Huanle and Liu, Yang and Xie, Siyue and Lau, Wing Cheong},
  booktitle = {AAAI Conference on Artificial Intelligence},
  year      = {2025},
  pages     = {20787-20795},
  doi       = {10.1609/AAAI.V39I19.34291},
  url       = {https://mlanthology.org/aaai/2025/tam2025aaai-fastpert/}
}