đ¤ AI Summary
This study systematically examines the evolution of African natural language processing (AfricaNLP) from 2005 to 2025, addressing three core questions: shifts in NLP research paradigms, seminal scholarly contributions, and the distribution of key authors, institutions, and funders. Methodologically, we construct AfricaNLPContributionsâthe first annotated dataset of AfricaNLP scholarly contributionsâcomprising 1,900 paper abstracts, 4,900 authors, and 7,800 manually labeled contribution sentences. Leveraging text mining and quantitative analysis, we identify longitudinal research trends, collaboration networks, and domain-specific knowledge graphs. Our principal contribution is a novel âcontribution-drivenâ framework for evaluating regional AI research, complemented by an open dataset and a dynamic tracking platform that enables research landscape visualization and automated literature review generationâthereby filling a critical gap in empirical studies on AI development in Africa.
đ Abstract
Natural Language Processing (NLP) is undergoing constant transformation, as Large Language Models (LLMs) are driving daily breakthroughs in research and practice. In this regard, tracking the progress of NLP research and automatically analyzing the contributions of research papers provides key insights into the nature of the field and the researchers. This study explores the progress of African NLP (AfricaNLP) by asking (and answering) basic research questions such as: i) How has the nature of NLP evolved over the last two decades?, ii) What are the contributions of AfricaNLP papers?, and iii) Which individuals and organizations (authors, affiliated institutions, and funding bodies) have been involved in the development of AfricaNLP? We quantitatively examine the contributions of AfricaNLP research using 1.9K NLP paper abstracts, 4.9K author contributors, and 7.8K human-annotated contribution sentences (AfricaNLPContributions) along with benchmark results. Our dataset and continuously existing NLP progress tracking website provide a powerful lens for tracing AfricaNLP research trends and hold potential for generating data-driven literature surveys.