ArchCAD-400K: An Open Large-Scale Architectural CAD Dataset and New Baseline for Panoptic Symbol Spotting

📅 2025-03-28
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Symbol recognition in architectural CAD drawings suffers from a lack of large-scale, high-quality datasets and domain-specific methods. Method: This paper introduces ArchCAD-400K—the first ultra-large-scale, open-source CAD symbol dataset (413K samples), featuring the most comprehensive category coverage and line-level instance annotations, 26× larger than the previous largest CAD dataset. We propose a fully automated annotation engine grounded in native CAD semantic parsing and formally define the novel panoptic symbol spotting task. To address it, we design DPSS, a dual-path CNN–Transformer model that jointly models line-granularity segmentation and symbol classification. Contribution/Results: On the symbol spotting benchmark, DPSS achieves a 12.7% mAP improvement over prior work, establishing new state-of-the-art performance with significantly enhanced robustness.

Technology Category

Application Category

📝 Abstract
Recognizing symbols in architectural CAD drawings is critical for various advanced engineering applications. In this paper, we propose a novel CAD data annotation engine that leverages intrinsic attributes from systematically archived CAD drawings to automatically generate high-quality annotations, thus significantly reducing manual labeling efforts. Utilizing this engine, we construct ArchCAD-400K, a large-scale CAD dataset consisting of 413,062 chunks from 5538 highly standardized drawings, making it over 26 times larger than the largest existing CAD dataset. ArchCAD-400K boasts an extended drawing diversity and broader categories, offering line-grained annotations. Furthermore, we present a new baseline model for panoptic symbol spotting, termed Dual-Pathway Symbol Spotter (DPSS). It incorporates an adaptive fusion module to enhance primitive features with complementary image features, achieving state-of-the-art performance and enhanced robustness. Extensive experiments validate the effectiveness of DPSS, demonstrating the value of ArchCAD-400K and its potential to drive innovation in architectural design and construction.
Problem

Research questions and friction points this paper is trying to address.

Automating symbol recognition in CAD drawings to reduce manual labeling.
Creating a large-scale CAD dataset with diverse architectural drawings.
Developing a robust baseline model for panoptic symbol spotting.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Automated CAD annotation engine reduces manual labeling
Large-scale ArchCAD-400K dataset with line-grained annotations
Dual-Pathway Symbol Spotter model enhances feature fusion
🔎 Similar Papers
No similar papers found.
R
Ruifeng Luo
Tongji University, Arcplus East China Architectural Design & Research Institute Co., Ltd.
Z
Zhengjie Liu
Tongji University, Shanghai Innovation Institute
T
Tianxiao Cheng
Arcplus East China Architectural Design & Research Institute Co., Ltd.
J
Jie Wang
Arcplus East China Architectural Design & Research Institute Co., Ltd.
T
Tongjie Wang
Arcplus East China Architectural Design & Research Institute Co., Ltd.
X
Xingguang Wei
Shanghai AI Laboratory, University of Science and Technology of China
Haomin Wang
Haomin Wang
Shanghai AI Laboratory | Shanghai Jiao Tong University
Computer VisionMultimodal Large Language Models
Y
YanPeng Li
Arcplus East China Architectural Design & Research Institute Co., Ltd.
F
Fu Chai
Tongji University, Shanghai Innovation Institute
F
Fei Cheng
Tongji University
S
Shenglong Ye
Shanghai AI Laboratory
W
Wenhai Wang
Shanghai AI Laboratory
Yanting Zhang
Yanting Zhang
Donghua University
Y
Yu Qiao
Shanghai AI Laboratory
Hongjie Zhang
Hongjie Zhang
Nanjing University; Shanghai Artificial Intelligence Laboratory
Computer Vision
X
Xianzhong Zhao
Tongji University