A Gravity-informed Spatiotemporal Transformer for Human Activity Intensity Prediction

📅 2025-06-16
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Human activity intensity prediction faces two key challenges: the absence of physical constraints in spatiotemporal modeling and oversmoothing in graph neural networks. To address these, we propose the Gravitation-guided Spatiotemporal Transformer (GST-Former), the first model to explicitly embed Newton’s law of universal gravitation into the attention mechanism—thereby encoding mass-dependent interaction strength and distance-decaying spatial influence. GST-Former introduces a decoupled, physics-consistent attention matrix and synergistically integrates parallel spatiotemporal graph convolutions with the Transformer architecture to enable closed-form spatial interaction modeling. Evaluated on six large-scale real-world datasets, it significantly outperforms state-of-the-art methods. Crucially, its learned attention weights strictly adhere to the geographic distance decay principle, achieving both superior predictive accuracy and explicit physical interpretability.

Technology Category

Application Category

📝 Abstract
Human activity intensity prediction is a crucial to many location-based services. Although tremendous progress has been made to model dynamic spatiotemporal patterns of human activity, most existing methods, including spatiotemporal graph neural networks (ST-GNNs), overlook physical constraints of spatial interactions and the over-smoothing phenomenon in spatial correlation modeling. To address these limitations, this work proposes a physics-informed deep learning framework, namely Gravity-informed Spatiotemporal Transformer (Gravityformer) by refining transformer attention to integrate the universal law of gravitation and explicitly incorporating constraints from spatial interactions. Specifically, it (1) estimates two spatially explicit mass parameters based on inflow and outflow, (2) models the likelihood of cross-unit interaction using closed-form solutions of spatial interactions to constrain spatial modeling randomness, and (3) utilizes the learned spatial interaction to guide and mitigate the over-smoothing phenomenon in transformer attention matrices. The underlying law of human activity can be explicitly modeled by the proposed adaptive gravity model. Moreover, a parallel spatiotemporal graph convolution transformer structure is proposed for achieving a balance between coupled spatial and temporal learning. Systematic experiments on six real-world large-scale activity datasets demonstrate the quantitative and qualitative superiority of our approach over state-of-the-art benchmarks. Additionally, the learned gravity attention matrix can be disentangled and interpreted based on geographical laws. This work provides a novel insight into integrating physical laws with deep learning for spatiotemporal predictive learning.
Problem

Research questions and friction points this paper is trying to address.

Overcoming physical constraints in human activity prediction
Addressing over-smoothing in spatial correlation modeling
Integrating gravity law with deep learning frameworks
Innovation

Methods, ideas, or system contributions that make the work stand out.

Integrates gravity law into transformer attention
Estimates mass parameters from inflow and outflow
Uses parallel spatiotemporal graph convolution transformer
🔎 Similar Papers
No similar papers found.
Y
Yi Wang
Institute of Remote Sensing and Geographic Information System, School of Earth and Space Sciences, Peking University, Beijing 100871, China
Z
Zhenghong Wang
Institute of Remote Sensing and Geographic Information System, School of Earth and Space Sciences, Peking University, Beijing 100871, China
F
Fan Zhang
Institute of Remote Sensing and Geographic Information System, School of Earth and Space Sciences, Peking University, Beijing 100871, China
C
Chengling Tang
Institute of Remote Sensing and Geographic Information System, School of Earth and Space Sciences, Peking University, Beijing 100871, China
Chaogui Kang
Chaogui Kang
China University of Geosciences
GIScienceTravel BehaviorBuilt EnvironmentSocial Inequality
D
Di Zhu
Department of Geography, Environment and Society, University of Minnesota, Twin Cities, Minneapolis, MN 55455, USA
Zhongfu Ma
Zhongfu Ma
University of Minnesota Twin Cities
Geographic information science (GIScience)Human mobilitySpatial evolution
Sijie Ruan
Sijie Ruan
Assistant Professor, Beijing Institute of Technology
Spatio-temporal Data MiningVolunteered Geographic InformationUrban Computing
W
Weiyu Zhang
Institute of Remote Sensing and Geographic Information System, School of Earth and Space Sciences, Peking University, Beijing 100871, China
Y
Yu Zheng
JD iCity, JD Technology, Beijing 100176, China
Philip S. Yu
Philip S. Yu
Professor of Computer Science, University of Illinons at Chicago
Data miningDatabasePrivacy
Y
Yu Liu
Institute of Remote Sensing and Geographic Information System, School of Earth and Space Sciences, Peking University, Beijing 100871, China