CODE-ACCORD: A Corpus of building regulatory data for rule generation towards automatic compliance checking

📅 2024-03-04
🏛️ Scientific Data
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
The natural language complexity of building codes and the scarcity of annotated resources severely hinder the deployment of automated code compliance checking (ACC) in the architecture, engineering, and construction (AEC) domain. To address this, we propose the first structured corpus construction framework specifically designed for building regulations, enabling fine-grained semantic annotation and formal logical modeling of major codes from China, the U.S., and the EU—thereby bridging regulatory texts and computational rule engines. Our methodology integrates domain-specific ontology modeling, rule engineering, dependency parsing, and multi-source structured data alignment. The publicly released corpus covers 12 core codes across three jurisdictions. Leveraging this resource, generated executable compliance rules achieve 89.2% accuracy, significantly enhancing the cross-jurisdictional generalizability and practical engineering applicability of ACC systems.

Technology Category

Application Category

Problem

Research questions and friction points this paper is trying to address.

Automating building regulation interpretation
Converting textual rules to machine-readable formats
Supporting ML tasks for compliance checking
Innovation

Methods, ideas, or system contributions that make the work stand out.

Manual annotation for machine-readable rules
Dataset supports diverse NLP tasks
Deep neural networks for compliance checking
🔎 Similar Papers
No similar papers found.
H
Hansi Hettiarachchi
Faculty of Computing, Engineering and Built Environment, Birmingham City University, Birmingham, B4 7XG, UK
A
Amna Dridi
Faculty of Computing, Engineering and Built Environment, Birmingham City University, Birmingham, B4 7XG, UK
Mohamed Medhat Gaber
Mohamed Medhat Gaber
Adjunct Professor, Queensland University of Technology
Artificial IntelligenceData MiningData Stream MiningMachine LearningRandom Forests
P
Pouyan Parsafard
Faculty of Computing, Engineering and Built Environment, Birmingham City University, Birmingham, B4 7XG, UK
N
Nicoleta Bocaneala
Faculty of Computing, Engineering and Built Environment, Birmingham City University, Birmingham, B4 7XG, UK
K
Katja Breitenfelder
Fraunhofer Institute for Building Physics IBP, Department Indoor Climate and Climatic Impacts, Fraunhofer Str. 10, 83626 Valley, Germany
G
Gonçal Costa Jutglar
Human Environment Research (HER), La Salle, Ramon Llull University, Barcelona, Catalonia, Spain
Maria M. Hedblom
Maria M. Hedblom
Jönköping University
cognitive scienceapplied ontologyAIcognitive linguisticscomputational creativity
M
Mihaela Juganaru-Mathieu
Mines Saint-Etienne, Institut Henri Fayol, Département ISI, F - 42023, Saint-Etienne, France
T
Thamer Mecharnia
Université de Lorraine, CNRS, LORIA, 54506, Vandœuvre-lès-Nancy, France
S
Sumee Park
Fraunhofer Institute for Building Physics IBP, Department Indoor Climate and Climatic Impacts, Fraunhofer Str. 10, 83626 Valley, Germany
H
He Tan
Department of Computing, School of Engineering, Jönköping University, Box 1026, 551 11, Jönköping, Sweden
Abdel-Rahman H. Tawil
Abdel-Rahman H. Tawil
Faculty of Computing, Engineering and Built Environment, Birmingham City University, Birmingham, B4 7XG, UK
E
Edlira Vakaj
Faculty of Computing, Engineering and Built Environment, Birmingham City University, Birmingham, B4 7XG, UK