🤖 AI Summary
Social media advertising targeting mechanisms lack transparency, raising concerns about algorithmic bias and unintended inference of sensitive attributes. This paper introduces the first multi-stage socio-technical auditing framework for automated advertising systems, integrating large-scale ad exposure log analysis (435,000 ads across 891 Australian users), multimodal large language model (LLM) reasoning, and statistical modeling. We empirically demonstrate that ad feed composition constitutes a “digital footprint” exploitable by publicly available AI: LMs reconstruct user demographic attributes from ad sequences alone with accuracy surpassing census baselines and approaching human judgment. Furthermore, we identify disproportionate exposure of marginalized groups to gambling and political advertisements. These findings reveal dual systemic risks—algorithmic discrimination and privacy leakage—and establish a reproducible, interdisciplinary evaluation paradigm for auditing advertising algorithms.
📝 Abstract
Automated ad targeting on social media is opaque, creating risks of exploitation and invisibility to external scrutiny. Users may be steered toward harmful content while independent auditing of these processes remains blocked. Large Language Models (LLMs) raise a new concern: the potential to reverse-engineer sensitive user attributes from exposure alone. We introduce a multi-stage auditing framework to investigate these risks. First, a large-scale audit of over 435,000 ad impressions delivered to 891 Australian Facebook users reveals algorithmic biases, including disproportionate Gambling and Politics ads shown to socioeconomically vulnerable and politically aligned groups. Second, a multimodal LLM can reconstruct users' demographic profiles from ad streams, outperforming census-based baselines and matching or exceeding human performance. Our results provide the first empirical evidence that ad streams constitute rich digital footprints for public AI inference, highlighting urgent privacy risks and the need for content-level auditing and governance.