๐ค AI Summary
This paper identifies a โMatthew effectโ induced by AI programming assistants in software ecosystem evolution: dominant programming languages and frameworks, benefiting from higher training-data frequency, achieve superior code-generation success rates, thereby reinforcing their popularity, accelerating convergence toward a few tools, and suppressing technological diversity and innovation. Method: We conduct the first systematic empirical investigation of this phenomenon through large-scale code-generation experiments spanning algorithm implementation and framework selection, quantifying success-rate disparities across technology stacks using multiple large language models. Contribution/Results: We find that a one-standard-deviation increase in technology popularity correlates with a 12.3% average gain in generation success rate; AI assistance significantly exacerbates technological concentration (Gini coefficient increases by 0.18). The study establishes an AI-driven technological lock-in mechanism and introduces a novel analytical framework for assessing the long-term health impact of AI on software ecosystems.
๐ Abstract
AI-assisted programming is rapidly reshaping software development, with large language models (LLMs) enabling new paradigms such as vibe coding and agentic coding. While prior works have focused on prompt design and code generation quality, the broader impact of LLM-driven development on the iterative dynamics of software engineering remains underexplored. In this paper, we conduct large-scale experiments on thousands of algorithmic programming tasks and hundreds of framework selection tasks to systematically investigate how AI-assisted programming interacts with the software ecosystem. Our analysis reveals extbf{a striking Matthew effect: the more popular a programming language or framework, the higher the success rate of LLM-generated code}. The phenomenon suggests that AI systems may reinforce existing popularity hierarchies, accelerating convergence around dominant tools while hindering diversity and innovation. We provide a quantitative characterization of this effect and discuss its implications for the future evolution of programming ecosystems.