Potential failures of physics-informed machine learning in traffic flow modeling: theoretical and experimental analysis

📅 2025-05-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Physical-informed machine learning (PIML) for traffic flow modeling often underperforms both purely data-driven and purely physics-based models—a failure mode not fully explained by residual magnitude alone. Method: We develop a PIML framework grounded in the LWR and ARZ macroscopic traffic models, integrating Hessian-dominated directional perturbation analysis, gradient angle quantification, and experiments with sparse and time-averaged data. Contribution/Results: We identify the core failure mechanism: ML and physics gradients must jointly lie within a conical region aligned with the quasi-true gradient—failure occurs when they misalign, not merely when residuals are large. We introduce the CFL condition as the first dataset suitability criterion for PIML, demonstrating that strong physical constraints can degrade performance via residual-induced bias, and that time averaging distorts residuals. We further establish higher inherent error bounds for higher-order models like ARZ. Empirically, CFL compliance is validated as a critical success indicator for PIML deployment.

Technology Category

Application Category

📝 Abstract
This study critically examines the performance of physics-informed machine learning (PIML) approaches for traffic flow modeling, defining the failure of a PIML model as the scenario where it underperforms both its purely data-driven and purely physics-based counterparts. We analyze the loss landscape by perturbing trained models along the principal eigenvectors of the Hessian matrix and evaluating corresponding loss values. Our results suggest that physics residuals in PIML do not inherently hinder optimization, contrary to a commonly assumed failure cause. Instead, successful parameter updates require both ML and physics gradients to form acute angles with the quasi-true gradient and lie within a conical region. Given inaccuracies in both the physics models and the training data, satisfying this condition is often difficult. Experiments reveal that physical residuals can degrade the performance of LWR- and ARZ-based PIML models, especially under highly physics-driven settings. Moreover, sparse sampling and the use of temporally averaged traffic data can produce misleadingly small physics residuals that fail to capture actual physical dynamics, contributing to model failure. We also identify the Courant-Friedrichs-Lewy (CFL) condition as a key indicator of dataset suitability for PIML, where successful applications consistently adhere to this criterion. Lastly, we observe that higher-order models like ARZ tend to have larger error lower bounds than lower-order models like LWR, which is consistent with the experimental findings of existing studies.
Problem

Research questions and friction points this paper is trying to address.

Evaluates PIML failures in traffic flow modeling compared to data-driven and physics-based models
Identifies conditions for successful PIML optimization involving ML and physics gradients alignment
Analyzes impact of data sampling and physics inaccuracies on PIML model performance
Innovation

Methods, ideas, or system contributions that make the work stand out.

Analyzing loss landscape via Hessian matrix perturbations
Defining conical region for successful parameter updates
Using CFL condition as dataset suitability indicator
🔎 Similar Papers
No similar papers found.
Yuan-Zheng Lei
Yuan-Zheng Lei
University of Maryland
transportationoptimizationmachine learning
Yaobang Gong
Yaobang Gong
University of Maryland
Dianwei Chen
Dianwei Chen
University of Maryland, College Park
Reinforcement LearningAutonomous VehicleTransportation
Y
Yao Cheng
University of Maryland, College Park, MD 20742, United States
X
Xianfeng Terry Yang
University of Maryland, College Park, MD 20742, United States