Governance Matters: Lessons From Restructuring the Data.Table OSS Project

📅 2025-09-07
🏛️ IEEE International Conference on Software Maintenance and Evolution
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses sustainability and scalability challenges in the data.table project, which stemmed from a highly centralized governance model that led to issue backlogs, ambiguous contribution pathways, and maintenance bottlenecks. To overcome these limitations, the project implemented a community-driven governance reform, systematically restructuring its collaboration mechanisms. This work presents the first empirical demonstration of a successful transition from a single-maintainer model to community governance in an open-source software context. Employing a mixed-methods approach—combining survey responses from 17 contributors with repository data mining—the study evaluates the reform’s impact. Findings reveal a 200% increase in new contributors, a reduction in average pull request resolution time from over 700 days to under one week, a threefold improvement in contributor retention, and significantly enhanced community sentiment. The research offers a replicable framework for open-source governance transformation.

Technology Category

Application Category

📝 Abstract
Open source software (OSS) forms the backbone of industrial data workflows and enterprise systems. However, many OSS projects face operational risks due to informal or centralized governance. This paper presents a practical case study of data.table, a high-performance R package widely adopted in production analytics pipelines, which underwent a community-led governance reform to address scalability and sustainability concerns. Before the reform, data.table faced a growing backlog of unresolved issues and open pull requests, unclear contributor pathways, and bottlenecks caused by reliance on a single core maintainer. In response, the community initiated a redesign of its governance structure. In this paper, we evaluated the impact of this transition through a mixed-methods approach, combining a contributor survey ($\mathbf{n} \boldsymbol{=} \mathbf{1 7}$) with mining project repository data. Our results show that following the reform, the project experienced a 200 % increase in new contributor recruitment, a drop in pull request resolution time from over 700 days to under a week, and a 3x increase in contributor retention. Community sentiment improved around transparency, onboarding, and project momentum, though concerns around fairness and conflict resolution remain. This case study provides practical guidance for maintainers, companies, and foundations seeking to enhance OSS governance.
Problem

Research questions and friction points this paper is trying to address.

open source software
governance
sustainability
scalability
maintainer bottleneck
Innovation

Methods, ideas, or system contributions that make the work stand out.

open source governance
community-driven reform
contributor retention
pull request latency
sustainable OSS
🔎 Similar Papers
No similar papers found.
P
Pedro Oliveira
Northern Arizona University, Flagstaff, AZ, USA
D
Doris Amoakohene
data.table Project
T
Toby Hocking
data.table Project
M
M. Gerosa
Northern Arizona University, Flagstaff, AZ, USA
Igor Steinmacher
Igor Steinmacher
Northern Arizona University
Software EngineeringCSCWMining Software RepositoriesOpen Source Software