"Good"and"Bad"Failures in Industrial CI/CD -- Balancing Cost and Quality Assurance

📅 2025-04-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the quality-efficiency-cost imbalance in industrial CI/CD pipelines caused by heterogeneous failure types. We propose a process refactoring paradigm centered on two critical milestones: code integration (pre-merge) and product release. First, we systematically define “good failures” (early-detected, low-cost) versus “bad failures” (late-occurring, high-blocking). Grounded in empirical studies across four enterprises—including workflow mapping and failure root-cause modeling—we develop a transferable pre-merge failure governance framework. Evaluation results show a 37% reduction in average feedback latency, a 29% decrease in spurious build overhead, significant improvement in developer throughput, and optimized cloud resource utilization. Our core contribution lies in transcending conventional stage-based pipeline segmentation to enable failure-driven, fine-grained process control—marking a paradigm shift toward adaptive, cost-aware CI/CD orchestration.

Technology Category

Application Category

📝 Abstract
Continuous Integration and Continuous Deployment (CI/CD) pipeline automates software development to speed up and enhance the efficiency of engineering software. These workflows consist of various jobs, such as code validation and testing, which developers must wait to complete before receiving feedback. The jobs can fail, which leads to unnecessary delays in build times, decreasing productivity for developers, and increasing costs for companies. To explore how companies adopt CI/CD workflows and balance cost with quality assurance during optimization, we studied 4 companies, reporting industry experiences with CI/CD practices. Our findings reveal that organizations can confuse the distinction between CI and CD, whereas code merge and product release serve as more effective milestones for process optimization and risk control. While numerous tools and research efforts target the post-merge phase to enhance productivity, limited attention has been given to the pre-merge phase, where early failure prevention brings more impacts and less risks.
Problem

Research questions and friction points this paper is trying to address.

Balancing cost and quality in CI/CD workflows
Distinguishing CI and CD for optimization milestones
Addressing pre-merge phase failure prevention gaps
Innovation

Methods, ideas, or system contributions that make the work stand out.

Automated CI/CD workflows for efficiency
Early failure prevention in pre-merge phase
Code merge as milestone for optimization
🔎 Similar Papers
No similar papers found.
S
Simin Sun
Chalmers University of Technology and University of Gothenburg, Gothenburg, Sweden
D
David Friberg
Zenseact, Gothenburg, Sweden
Miroslaw Staron
Miroslaw Staron
Software engineering, University of Gothenburg
Software engineeringmetricsisodependabilitycomputer science