DiffuMural: Restoring Dunhuang Murals with Multi-scale Diffusion

📅 2025-04-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Addressing the challenges of large-area damage, scarce training samples, and stringent aesthetic consistency requirements in Dunhuang mural restoration, this paper proposes a multi-scale collaborative diffusion inpainting framework. It integrates ControlNet-based conditional guidance, cross-scale feature alignment, and cycle-consistency loss to ensure stylistic unity, geometric fidelity, and cultural-semantic preservation. We introduce the first four-dimensional evaluation metric for mural restoration—comprising factual accuracy, texture detail, semantic coherence, and visual coherence—and uniquely incorporate a humanistic value dimension. Leveraging domain-specific data augmentation and fine-tuned pre-trained diffusion models, our method achieves state-of-the-art performance on 23 authentic Dunhuang murals. Quantitative metrics and expert aesthetic assessments both demonstrate superiority over existing approaches, successfully recovering mineral pigment textures, rhythmic flying-ap Sarira line work, and historical contextual consistency.

Technology Category

Application Category

📝 Abstract
Large-scale pre-trained diffusion models have produced excellent results in the field of conditional image generation. However, restoration of ancient murals, as an important downstream task in this field, poses significant challenges to diffusion model-based restoration methods due to its large defective area and scarce training samples. Conditional restoration tasks are more concerned with whether the restored part meets the aesthetic standards of mural restoration in terms of overall style and seam detail, and such metrics for evaluating heuristic image complements are lacking in current research. We therefore propose DiffuMural, a combined Multi-scale convergence and Collaborative Diffusion mechanism with ControlNet and cyclic consistency loss to optimise the matching between the generated images and the conditional control. DiffuMural demonstrates outstanding capabilities in mural restoration, leveraging training data from 23 large-scale Dunhuang murals that exhibit consistent visual aesthetics. The model excels in restoring intricate details, achieving a coherent overall appearance, and addressing the unique challenges posed by incomplete murals lacking factual grounding. Our evaluation framework incorporates four key metrics to quantitatively assess incomplete murals: factual accuracy, textural detail, contextual semantics, and holistic visual coherence. Furthermore, we integrate humanistic value assessments to ensure the restored murals retain their cultural and artistic significance. Extensive experiments validate that our method outperforms state-of-the-art (SOTA) approaches in both qualitative and quantitative metrics.
Problem

Research questions and friction points this paper is trying to address.

Restoring large defective areas in ancient murals with limited training samples
Ensuring restored murals meet aesthetic standards for style and seam details
Developing evaluation metrics for heuristic image complements in mural restoration
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multi-scale convergence with ControlNet
Collaborative Diffusion mechanism
Cyclic consistency loss optimization
🔎 Similar Papers
No similar papers found.
P
Puyu Han
Southern University of Science and Technology
J
Jiaju Kang
Beijing Normal University
Y
Yuhang Pan
Hebei Guoyan Science and Technology Center
Erting Pan
Erting Pan
Wuhan University, National University of Defense Technology
Z
Zeyu Zhang
The Australian National University
Q
Qunchao Jin
AI Geeks
J
Juntao Jiang
Zhejiang University
Z
Zhichen Liu
Southern University of Science and Technology
L
Luqi Gong
Zhejiang Lab