๐ค AI Summary
Traditional manual code review struggles to meet the high-throughput, low-cost demands of AI-assisted software development and faces scalability limitations in human-in-the-loop workflows. This work proposes a novel paradigm that fully replaces human reviewers with a large language modelโdriven autonomous coding agent, integrating capabilities in code comprehension, generation, testing, and repair within a unified framework. Experimental results demonstrate that the proposed agent not only fulfills all conventional code review objectives but also significantly outperforms existing approaches in terms of cost, efficiency, and scalability. These findings suggest the potential to fundamentally reshape software quality assurance practices that have remained largely unchanged for over five decades.
๐ Abstract
Code review has been the primary quality gate in software development since Fagan formalised code inspection in 1976. For five decades, having a human examine and comment on a colleague's changes before merge has been a cornerstone practice at organisations of every size. Coding agents are large language model (LLM)-based autonomous systems capable of reading, writing, testing, and repairing software. We argue that coding agents have crossed a threshold of capability at which traditional human code review is no longer a necessary component of a software quality pipeline. Our argument rests on two claims: every stated goal of code review can be served by agents at lower cost and higher throughput; the naive integration in which agents write code and humans remain the mandatory reviewers is a dead end because it neither provides meaningful assurance nor scales with AI-assisted throughput.