AI-Augmented CI/CD Pipelines: From Code Commit to Production with Autonomous Decisions

📅 2025-08-15
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In modern CI/CD pipelines, manual intervention in unstable test diagnosis, rollback decisions, feature flag tuning, and canary promotion introduces release delays and operational overhead. To address this, we propose an AI-augmented autonomous software delivery framework that integrates large language models (LLMs) with policy-constrained autonomous agents, yielding a reference architecture for agent-based decision-making bounded by formal policies. Our approach introduces: (1) a taxonomy of deployment decisions; (2) policy-as-code guardrails enforcing safety and compliance; (3) a tiered trust framework governing agent autonomy; and (4) a DORA-metrics-driven, verifiable evaluation methodology. Evaluated in a React 19 microservices environment, the framework significantly reduces deployment latency and manual intervention frequency, improves release velocity and system reliability, and ensures auditable, formally verifiable autonomous decision paths.

Technology Category

Application Category

📝 Abstract
Modern software delivery has accelerated from quarterly releases to multiple deployments per day. While CI/CD tooling has matured, human decision points interpreting flaky tests, choosing rollback strategies, tuning feature flags, and deciding when to promote a canary remain major sources of latency and operational toil. We propose AI-Augmented CI/CD Pipelines, where large language models (LLMs) and autonomous agents act as policy-bounded co-pilots and progressively as decision makers. We contribute: (1) a reference architecture for embedding agentic decision points into CI/CD, (2) a decision taxonomy and policy-as-code guardrail pattern, (3) a trust-tier framework for staged autonomy, (4) an evaluation methodology using DevOps Research and Assessment ( DORA) metrics and AI-specific indicators, and (5) a detailed industrial-style case study migrating a React 19 microservice to an AI-augmented pipeline. We discuss ethics, verification, auditability, and threats to validity, and chart a roadmap for verifiable autonomy in production delivery systems.
Problem

Research questions and friction points this paper is trying to address.

Automating human decisions in CI/CD to reduce latency
Replacing manual rollback and testing choices with AI
Minimizing operational toil through autonomous policy-bounded agents
Innovation

Methods, ideas, or system contributions that make the work stand out.

AI agents replace human decisions in CI/CD
Policy-as-code guardrails ensure safe autonomy
Staged trust framework enables progressive AI control
🔎 Similar Papers
No similar papers found.
Mohammad Baqar
Mohammad Baqar
Software Engineer at Cisco Systems Inc
S
Saba Naqvi
MUFG Bank
R
Rajat Khanda
University of Houston