🤖 AI Summary
This study identifies critical privacy vulnerabilities in the Japanese Yjmob100k anonymized trajectory dataset (100,000 users), demonstrating that conventional anonymization techniques fail against a synergistic re-identification attack leveraging population density patterns, spatiotemporal structural correlations, and individual activity profiles. The authors present the first systematic evidence that large-scale, end-to-end identity and spatiotemporal trajectory re-identification is feasible using only publicly available geographic information and intrinsic trajectory statistics—without auxiliary external data. Their method integrates geospatial analysis, temporal behavioral modeling, and multi-source structural inference, challenging foundational assumptions of trajectory anonymization security. Experiments successfully recover numerous users’ true identities and fine-grained spatiotemporal trajectories, empirically revealing the dataset’s insufficient anonymization strength. These findings provide crucial theoretical insights and empirical validation for designing privacy-enhancing trajectory publishing mechanisms.
📝 Abstract
Mobility traces represent a critical class of personal data, often subjected to privacy-preserving transformations before public release. In this study, we analyze the anonymized Yjmob100k dataset, which captures the trajectories of 100,000 users in Japan, and demonstrate how existing anonymization techniques fail to protect their sensitive attributes. We leverage population density patterns, structural correlations, and temporal activity profiles to re-identify the dataset's real-world location and timing. Our results reveal that the anonymization process carried out for Yjmob100k is inefficient and preserves enough spatial and temporal structure to enable re-identification. This work underscores the limitations of current trajectory anonymization methods and calls for more robust privacy mechanisms in the publication of mobility data.