Is Crowdsourcing a Puppet Show? Detecting a New Type of Fraud in Online Platforms

📅 2025-10-31

📈 Citations: 0

✨ Influential: 0

🤖 AI Summary

A novel fraud pattern—“puppeteers” operating multiple fake accounts (sockpuppets) to evade attention checks—has emerged on online crowdsourcing platforms (e.g., MTurk), severely compromising data integrity and research validity. Method: This paper presents the first systematic identification and formal definition of this human-coordinated multi-account fraud. We propose an integrated detection framework combining behavioral analysis, statistical modeling, and AI techniques to distinguish low-complexity sockpuppets from high-fidelity bots. Our approach leverages three key signals: temporal behavioral heterogeneity across account clusters, response consistency, and task participation topology. Contribution/Results: Evaluated on two real-world MTurk studies, our method identifies 33%–56.4% of accounts as suspicious sockpuppets, confirming the prevalence of this threat. The work advances crowdsourcing data quality assessment paradigms and delivers a deployable, platform-agnostic technical solution for fraud mitigation and governance.

Technology Category

Application Category

📝 Abstract

Crowdsourcing platforms such as Amazon Mechanical Turk (MTurk) are important tools for researchers seeking to conduct studies with a broad, global participant base. Despite their popularity and demonstrated utility, we present evidence that suggests the integrity of data collected through Amazon MTurk is being threatened by the presence of puppeteers, apparently human workers controlling multiple puppet accounts that are capable of bypassing standard attention checks. If left undetected, puppeteers and their puppets can undermine the integrity of data collected on these platforms. This paper investigates data from two Amazon MTurk studies, finding that a substantial proportion of accounts (33% to 56.4%) are likely puppets. Our findings highlight the importance of adopting multifaceted strategies to ensure data integrity on crowdsourcing platforms. With the goal of detecting this type of fraud, we discuss a set of potential countermeasures for both puppets and bots with varying degrees of sophistication (e.g., employing AI). The problem of single entities (or puppeteers) manually controlling multiple accounts could exist on other crowdsourcing platforms; as such, their detection may be of broader application. While our findings suggest the need to re-evaluate the quality of crowdsourced data, many previous studies likely remain valid, particularly those with robust experimental designs. However, the presence of puppets may have contributed to false null results in some studies, suggesting that unpublished work may be worth revisiting with effective puppet detection strategies.

Problem

Research questions and friction points this paper is trying to address.

Detecting puppeteer-controlled fraudulent accounts on crowdsourcing platforms

Investigating threats to data integrity from multi-account manipulation

Developing countermeasures against sophisticated fraud in online research

Innovation

Methods, ideas, or system contributions that make the work stand out.

Detects puppeteers controlling multiple fake accounts

Uses multifaceted strategies to ensure data integrity

Proposes AI-based countermeasures against sophisticated fraud

🔎 Similar Papers

No similar papers found.

Authors to Follow