π€ AI Summary
This study addresses the challenges of modeling uncertainty and biased parameter estimation arising from data sparsity in short time series across multiple domains. To overcome the limitations of single-sequence modeling, the authors propose a multi-source time series fusion enhancement method grounded in statistical homogeneity matching. By integrating multiple sparse yet distributionally similar short sequences, the approach jointly models the data using a combination of Hawkes and Poisson processes. Empirical evaluation on pain event data from sickle cell disease patients demonstrates that the proposed method substantially improves the accuracy of parameter estimation and the reliability of model selection, while effectively uncovering the dynamic evolution patterns of patientsβ subjective pain experiences.
π Abstract
Researchers across different fields, including but not limited to ecology, biology, and healthcare, often face the challenge of sparse data. Such sparsity can lead to uncertainties, estimation difficulties, and potential biases in modeling. Here we introduce a novel data augmentation method that combines multiple sparse time series datasets when they share similar statistical properties, thereby improving parameter estimation and model selection reliability. We demonstrate the effectiveness of this approach through validation studies comparing Hawkes and Poisson processes, followed by application to subjective pain dynamics in patients with sickle cell disease (SCD), a condition affecting millions worldwide, particularly those of African, Mediterranean, Middle Eastern, and Indian descent.