Structural Under-Representation of Women in News: Nonparametric Bayesian Mixtures Capture Time-Dependent Dynamics

📅 2026-06-09

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

This study addresses the persistent underrepresentation of women as sources in news media, revealing complex dynamics of gender bias across thematic, geographic, and temporal dimensions. Introducing a time-varying nonparametric Bayesian mixture model—adapted with Beta mixture kernels to accommodate citation proportions bounded in [0,1]—the authors conduct dynamic density estimation and time-dependent clustering on Canadian news data from 2019 to 2024. This approach effectively uncovers latent clusters reflecting structural inequities and their temporal stability, overcoming limitations of conventional static or parametric models. The analysis demonstrates that women remain consistently underrepresented across all identified clusters, with over 85% of topic–region sequences showing no discernible improvement; overall, the distribution of female source citation proportions has remained largely unchanged throughout the study period.

📝 Abstract

The under-representation of women as sources cited in news media is one prominent representation of gender bias. Understanding where gender bias concentrates and how it evolves is essential for targeted mitigation. Because gender representation varies across topics, time, and reported-on regions, creating complex dependencies that are difficult to capture parametrically, we employ a nonparametric model to uncover latent cluster structures and temporal dynamics. We combine time-dependent Bayesian mixture modeling techniques with a Beta mixture kernel tailored to female quote shares, bounded between 0 and 1. Fitted on Canadian news articles from 2019 to 2024, the model reveals structural under-representation of women across all clusters, with news topic driving differences in female quote shares more strongly than the reported-on region. More than 85% of topic-region time series show no improvement toward gender parity over the observation period. Dynamic density estimation confirms that the aggregate distribution of female quote shares remains stable between 2019 and 2024. Our application demonstrates that advanced probabilistic models not only reproduce findings in gender bias research but also reveal latent dependencies and structural patterns that simpler approaches miss, encouraging future adoption of model-based frameworks for studying media bias.

Problem

Research questions and friction points this paper is trying to address.

gender bias

under-representation

news media

female sources

structural inequality

Innovation

Methods, ideas, or system contributions that make the work stand out.

nonparametric Bayesian mixtures

time-dependent dynamics

Beta mixture kernel