Automated Superscalar Processor Design by Learning Data Dependencies
Abstract
Automated processor design, which can significantly reduce human efforts and accelerate design cycles, has received considerable attention. While recent advancements have automatically designed single-cycle processors that execute one instruction per cycle, their performance cannot compete with modern superscalar processors that execute multiple instructions per cycle. Previous methods fail on superscalar processor design because they cannot address inter-instruction data dependencies, leading to inefficient sequential instruction execution. This paper proposes a novel approach to automatically designing superscalar processors using a hardware-friendly model called the Stateful Binary Speculation Diagram (State-BSD). We observe that processor parallelism can be enhanced through on-the-fly inter-instruction dependent data predictors, reusing the processor's internal states to learn the data dependency. To meet the challenge of both hardware-resource limitation and design functional correctness, State-BSD consists of two components: 1) a lightweight state-selector trained by simulated annealing method to detect the most reusable processor states and store them in a small buffer; and 2) a highly precise state-speculator trained by BSD expansion method to predict the inter-instruction dependent data using the selected states. It is the first work to achieve the automated superscalar processor design, i.e. QiMeng-CPU-v2, which improves the performance by about 380x than the state-of-the-art automated design and is comparable to human-designed superscalar processors such as ARM Cortex A53.
Cite
Text
Cheng et al. "Automated Superscalar Processor Design by Learning Data Dependencies." International Joint Conference on Artificial Intelligence, 2025. doi:10.24963/IJCAI.2025/549Markdown
[Cheng et al. "Automated Superscalar Processor Design by Learning Data Dependencies." International Joint Conference on Artificial Intelligence, 2025.](https://mlanthology.org/ijcai/2025/cheng2025ijcai-automated/) doi:10.24963/IJCAI.2025/549BibTeX
@inproceedings{cheng2025ijcai-automated,
title = {{Automated Superscalar Processor Design by Learning Data Dependencies}},
author = {Cheng, Shuyao and Zhang, Rui and He, Wenkai and Jin, Pengwei and Li, Chongxiao and Du, Zidong and Hu, Xing and Hao, Yifan and Xu, Guanglin and Wen, Yuanbo and Li, Ling and Guo, Qi and Chen, Yunji},
booktitle = {International Joint Conference on Artificial Intelligence},
year = {2025},
pages = {4932-4939},
doi = {10.24963/IJCAI.2025/549},
url = {https://mlanthology.org/ijcai/2025/cheng2025ijcai-automated/}
}