Federated Learning for Traffic Flow Prediction with Synthetic Data Augmentation

📅 2024-12-11
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address privacy constraints, data heterogeneity, and sparsity arising from multi-institutional traffic data silos, this paper proposes FedTPS—a novel federated framework for privacy-preserving synthetic trajectory generation. FedTPS pioneers the federation of diffusion models for spatiotemporal trajectory synthesis and introduces a hybrid spatiotemporal graph neural network that jointly models dynamic road networks and long-range temporal dependencies via integrated temporal and graph attention mechanisms. Built upon the cross-silo federated learning paradigm, FedTPS is rigorously evaluated on large-scale real-world ride-hailing data. Experimental results demonstrate a 12.7% reduction in mean absolute error (MAE) for global traffic flow prediction, significantly outperforming state-of-the-art federated baselines. Key contributions include: (i) the first federated diffusion-based generative mechanism for trajectory synthesis; (ii) a dual-attention collaborative modeling architecture unifying temporal dynamics and topological structure; and (iii) end-to-end privacy preservation with cross-domain data synergy and utility enhancement.

Technology Category

Application Category

📝 Abstract
Deep-learning based traffic prediction models require vast amounts of data to learn embedded spatial and temporal dependencies. The inherent privacy and commercial sensitivity of such data has encouraged a shift towards decentralised data-driven methods, such as Federated Learning (FL). Under a traditional Machine Learning paradigm, traffic flow prediction models can capture spatial and temporal relationships within centralised data. In reality, traffic data is likely distributed across separate data silos owned by multiple stakeholders. In this work, a cross-silo FL setting is motivated to facilitate stakeholder collaboration for optimal traffic flow prediction applications. This work introduces an FL framework, referred to as FedTPS, to generate synthetic data to augment each client's local dataset by training a diffusion-based trajectory generation model through FL. The proposed framework is evaluated on a large-scale real world ride-sharing dataset using various FL methods and Traffic Flow Prediction models, including a novel prediction model we introduce, which leverages Temporal and Graph Attention mechanisms to learn the Spatio-Temporal dependencies embedded within regional traffic flow data. Experimental results show that FedTPS outperforms multiple other FL baselines with respect to global model performance.
Problem

Research questions and friction points this paper is trying to address.

Addresses data scarcity in traffic flow prediction models
Enhances privacy and collaboration in decentralized data environments
Improves global model performance using synthetic data augmentation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Federated Learning for decentralized traffic data analysis
Synthetic data augmentation using diffusion-based trajectory generation
Temporal and Graph Attention mechanisms for traffic prediction
🔎 Similar Papers
No similar papers found.
F
Fermin Orozco
University of Exeter, Exeter, UK
P
Pedro Porto Buarque de Gusmao
University of Surrey, Surrey, UK
Hongkai Wen
Hongkai Wen
University of Warwick
Machine LearningML/AI SystemsCyber-Physical Systems
Johan Wahlström
Johan Wahlström
University of Exeter
Sensor FusionSignal Processing
M
Man Luo
University of Exeter, Exeter, UK