Beats of Bias: Analyzing Lyrics with Topic Modeling and Gender Bias Measurements

๐Ÿ“… 2024-09-24
๐Ÿ›๏ธ arXiv.org
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
This study investigates the diachronic evolution and thematic heterogeneity of gender bias in English song lyrics. Methodologically, it analyzes 537,000 English song lyrics using a novel integration of BERTopic for thematic modeling and SC-WEAT (Single-Category Word Embedding Association Test) for quantifying implicit gendered semantic associations, leveraging BERT-based contextualized embeddings and temporal analysis. Results reveal a longitudinal thematic shift from romance toward sexualization of women; words denoting intelligence and strength exhibit significant male association, while appearance- and vulnerability-related terms show strong female associationโ€”both biases varying markedly across themes and genres. Additionally, high-density clusters of misogynistic and vulgar content are identified. The work contributes a reproducible methodological framework and a large-scale empirical benchmark for studying cultural bias in music, advancing computational social science approaches to linguistic stereotyping.

Technology Category

Application Category

๐Ÿ“ Abstract
This paper uses topic modeling and bias measurement techniques to analyze and determine gender bias in English song lyrics. We utilize BERTopic to cluster 537,553 English songs into distinct topics and chart their development over time. Our analysis shows the thematic shift in song lyrics over the years, from themes of romance to the increasing sexualization of women in songs. We observe large amounts of profanity and misogynistic lyrics on various topics, especially in the overall biggest cluster. Furthermore, to analyze gender bias across topics and genres, we employ the Single Category Word Embedding Association Test (SC-WEAT) to compute bias scores for the word embeddings trained on the most popular topics as well as for each genre. We find that words related to intelligence and strength tend to show a male bias across genres, as opposed to appearance and weakness words, which are more female-biased; however, a closer look also reveals differences in biases across topics.
Problem

Research questions and friction points this paper is trying to address.

Analyzes gender bias in English song lyrics using topic modeling.
Measures thematic shifts and misogynistic content over time.
Quantifies gender bias across topics and genres using SC-WEAT.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Used BERTopic for clustering song lyrics
Applied SC-WEAT for gender bias analysis
Analyzed thematic evolution and bias scores
๐Ÿ”Ž Similar Papers
No similar papers found.
Danqing Chen
Danqing Chen
Technical University of Munich
NLP
A
Adithi Satish
Technical University of Munich, Germany
R
Rasul Khanbayov
Technical University of Munich, Germany
C
Carolin M. Schuster
Technical University of Munich, Germany
Georg Groh
Georg Groh
Adjunct Professor
Social ComputingNatural Language Processing