Breaking Anonymity at Scale: Re-identifying the Trajectories of 100K Real Users in Japan

📅 2025-06-05
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study identifies critical privacy vulnerabilities in the Japanese Yjmob100k anonymized trajectory dataset (100,000 users), demonstrating that conventional anonymization techniques fail against a synergistic re-identification attack leveraging population density patterns, spatiotemporal structural correlations, and individual activity profiles. The authors present the first systematic evidence that large-scale, end-to-end identity and spatiotemporal trajectory re-identification is feasible using only publicly available geographic information and intrinsic trajectory statistics—without auxiliary external data. Their method integrates geospatial analysis, temporal behavioral modeling, and multi-source structural inference, challenging foundational assumptions of trajectory anonymization security. Experiments successfully recover numerous users’ true identities and fine-grained spatiotemporal trajectories, empirically revealing the dataset’s insufficient anonymization strength. These findings provide crucial theoretical insights and empirical validation for designing privacy-enhancing trajectory publishing mechanisms.

Technology Category

Application Category

📝 Abstract
Mobility traces represent a critical class of personal data, often subjected to privacy-preserving transformations before public release. In this study, we analyze the anonymized Yjmob100k dataset, which captures the trajectories of 100,000 users in Japan, and demonstrate how existing anonymization techniques fail to protect their sensitive attributes. We leverage population density patterns, structural correlations, and temporal activity profiles to re-identify the dataset's real-world location and timing. Our results reveal that the anonymization process carried out for Yjmob100k is inefficient and preserves enough spatial and temporal structure to enable re-identification. This work underscores the limitations of current trajectory anonymization methods and calls for more robust privacy mechanisms in the publication of mobility data.
Problem

Research questions and friction points this paper is trying to address.

Re-identifying anonymized user trajectories in mobility data
Exposing flaws in current trajectory anonymization techniques
Highlighting privacy risks in large-scale mobility datasets
Innovation

Methods, ideas, or system contributions that make the work stand out.

Leveraging population density patterns for re-identification
Using structural correlations to analyze trajectories
Applying temporal activity profiles to reveal identities
🔎 Similar Papers
No similar papers found.