🤖 AI Summary
The natural language complexity of building codes and the scarcity of annotated resources severely hinder the deployment of automated code compliance checking (ACC) in the architecture, engineering, and construction (AEC) domain. To address this, we propose the first structured corpus construction framework specifically designed for building regulations, enabling fine-grained semantic annotation and formal logical modeling of major codes from China, the U.S., and the EU—thereby bridging regulatory texts and computational rule engines. Our methodology integrates domain-specific ontology modeling, rule engineering, dependency parsing, and multi-source structured data alignment. The publicly released corpus covers 12 core codes across three jurisdictions. Leveraging this resource, generated executable compliance rules achieve 89.2% accuracy, significantly enhancing the cross-jurisdictional generalizability and practical engineering applicability of ACC systems.