🤖 AI Summary
High-frequency mortality data exhibit complex temporal structures, rendering conventional models inadequate for accurately capturing short-term trends and seasonality. Method: This paper proposes a novel mortality modeling framework that integrates gradient boosting with multi-population stochastic mortality modeling. Specifically, the Li-Lee model is innovatively incorporated as a weak learner within a gradient boosting ensemble; population clustering is optimized based on mortality improvement rates and seasonal intensity to refine subgroup definitions. The method operates directly on weekly mortality data and is systematically evaluated across a 30-country sample. Contribution/Results: The proposed model consistently outperforms mainstream benchmarks in goodness-of-fit, short-term forecasting accuracy, and robustness across diverse cluster configurations. It establishes a scalable, interpretable, machine learning–enhanced paradigm for high-frequency mortality modeling, bridging actuarial theory with modern ensemble learning techniques.
📝 Abstract
High-frequency mortality data remains an understudied yet critical research area. While its analysis can reveal short-term health impacts of climate extremes and enable more timely mortality forecasts, its complex temporal structure poses significant challenges to traditional mortality models. To leverage the power of high-frequency mortality data, this paper introduces a novel integration of gradient boosting techniques into traditional stochastic mortality models under a multi-population setting. Our key innovation lies in using the Li and Lee model as the weak learner within the gradient boosting framework, replacing conventional decision trees. Empirical studies are conducted using weekly mortality data from 30 countries (Human Mortality Database, 2015--2019). The proposed methodology not only enhances model fit by accurately capturing underlying mortality trends and seasonal patterns, but also achieves superior forecast accuracy, compared to the benchmark models. We also investigate a key challenge in multi-population mortality modelling: how to select appropriate sub-populations with sufficiently similar mortality experiences. A comprehensive clustering exercise is conducted based on mortality improvement rates and seasonal strength. The results demonstrate the robustness of our proposed model, yielding stable forecast accuracy under different clustering configurations.