ArchCAD-400k: A Large-Scale CAD Drawings Dataset and New Baseline for Panoptic Symbol Spotting

Abstract

Recognizing symbols in architectural CAD drawings is critical for various advanced engineering applications. In this paper, we propose a novel CAD data annotation engine that leverages intrinsic attributes from systematically archived CAD drawings to automatically generate high-quality annotations, thus significantly reducing manual labeling efforts. Utilizing this engine, we construct ArchCAD-400K, a large-scale CAD dataset consisting of 413,062 chunks from 5538 highly standardized drawings, making it over 26 times larger than the largest existing CAD dataset. ArchCAD-400K boasts an extended drawing diversity and broader categories, offering line-grained annotations. Furthermore, we present a new baseline model for panoptic symbol spotting, termed Dual-Pathway Symbol Spotter (DPSS). It incorporates an adaptive fusion module to enhance primitive features with complementary image features, achieving state-of-the-art performance and enhanced robustness. Extensive experiments validate the effectiveness of DPSS, demonstrating the value of ArchCAD-400K and its potential to drive innovation in architectural design and construction.

Cite

Text

Luo et al. "ArchCAD-400k: A Large-Scale CAD Drawings Dataset and New Baseline for Panoptic Symbol Spotting." Advances in Neural Information Processing Systems, 2025.

Markdown

[Luo et al. "ArchCAD-400k: A Large-Scale CAD Drawings Dataset and New Baseline for Panoptic Symbol Spotting." Advances in Neural Information Processing Systems, 2025.](https://mlanthology.org/neurips/2025/luo2025neurips-archcad400k/)

BibTeX

@inproceedings{luo2025neurips-archcad400k,
  title     = {{ArchCAD-400k: A Large-Scale CAD Drawings Dataset and New Baseline for Panoptic Symbol Spotting}},
  author    = {Luo, Ruifeng and Liu, Zhengjie and Cheng, Tianxiao and Wang, Jie and Wang, Tongjie and Cheng, Fei and Chai, Fu and Li, Yanpeng and Wei, Xingguang and Wang, Haomin and Ye, Shenglong and Wang, Wenhai and Zhang, Yanting and Qiao, Yu and Zhang, Hongjie and Zhao, Xianzhong},
  booktitle = {Advances in Neural Information Processing Systems},
  year      = {2025},
  url       = {https://mlanthology.org/neurips/2025/luo2025neurips-archcad400k/}
}