Hostname: page-component-5db58dd55d-jnbmb Total loading time: 0 Render date: 2026-05-31T11:45:35.111Z Has data issue: false hasContentIssue false

Fine-grained mortality forecasting with deep learning

Published online by Cambridge University Press:  12 December 2025

Huiling Zheng*
Affiliation:
Department of Statistical Science, University College London, London, UK
Hai Wang
Affiliation:
Department of Statistical Science, University College London, London, UK
Rui Zhu
Affiliation:
Faculty of Actuarial Science and Insurance, Bayes Business School, City St George’s, University of London, London, UK
Jing-Hao Xue
Affiliation:
Department of Statistical Science, University College London, London, UK
*
Corresponding author: Huiling Zheng; Email: huiling.zheng.16@ucl.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

Fine-grained mortality forecasting has gained momentum in actuarial research due to its ability to capture localized, short-term fluctuations in death rates. This paper introduces MortFCNet, a deep-learning method that predicts weekly death rates using region-specific weather inputs. Unlike traditional Serfling-based methods and gradient-boosting models that rely on predefined fixed Fourier terms and manual feature engineering, MortFCNet automatically learns patterns from raw time-series data without needing explicitly defined Fourier terms or manual feature engineering. Extensive experiments across over 200 NUTS-3 regions in France, Italy, and Switzerland demonstrate that MortFCNet consistently outperforms both a standard Serfling-type baseline and XGBoost in terms of predictive accuracy. Our ablation studies further confirm its ability to uncover complex relationships in the data without feature engineering. Moreover, this work underscores a new perspective on exploring deep learning for advancing fine-grained mortality forecasting.

Information

Type
Original Research Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Institute and Faculty of Actuaries
Figure 0

Table 1. Weather variables retrieved from the E-OBS gridded meteorological dataset on the CDS

Figure 1

Table 2. Weekly averages of daily weather variables

Figure 2

Table 3. Weekly averages of daily extreme weather variables and seasonal indicators

Figure 3

Figure 1. Diagram of MortFCNet. The model consists of a gated recurrent unit (GRU) followed by three fully connected layers. The inputs $\xi _{t,v}^{(g)}$ and $\xi _{t,v-1}^{(g)}$ are processed by the GRU, producing the hidden state $h_v$. The fully connected layers sequentially transform $h_v$ into $f^1$, $f^2$, and $f^3$; a final linear layer then generates the output $\hat {r}_{t,v}^{(g)}$. Here, $C_{\xi }$ represents the input feature dimension, $C_m$ is the feature dimension after transformation, and $R$ represents the domain.

Figure 4

Figure 2. Structure of the gated recurrent unit (GRU) (Riaz et al., 2020). The input consists of feature vectors from the previous and current time steps, denoted as $\xi _{v-1}$ and $\xi _{v}$, respectively. At time step $v$, the GRU processes $\xi _{v}$ and the previous hidden state $h_{v-1}$ using the update gate $z_{v}$ and the reset gate $r_{v}$ to regulate information flow, ultimately producing the updated hidden state $ h_{v}$. The candidate’s hidden state is indicated as $\hat {h}_{v}$. Here, $\sigma$ represents the sigmoid activation function, and $\tanh$ is the hyperbolic tangent function.

Figure 5

Table 4. MSE results for Lee-Carter, baseline, XGBoost, and MortFCNet (seed = 1996). The training years 2013–2018, the test year 2019, for CH (Switzerland), FR (France), and IT (Italy)

Figure 6

Figure 3. The boxplots of test MSE by country for MortFCNet, compared with baseline and XGBoost, over 15 random seeds.

Figure 7

Figure 4. Map of mainland regions of France, Italy, and Switzerland. The colors indicate the country-specific regions (peach for France, green for Italy, and blue for Switzerland), while the pink highlights denote the selected regions.

Figure 8

Figure 5. Side-by-side mortality and squared error comparison for region ITC18.

Figure 9

Figure 6. Side-by-side mortality and squared error comparison for region CH011.

Figure 10

Figure 7. Side-by-side mortality and squared error comparison for region FRK25.

Figure 11

Table 5. Test (2019) MSE for the Lee-Carter, baseline, XGBoost, and MortFCNet models across regions

Figure 12

Figure 8. MSE for log of mortality by NUTS3 region for France, Italy, and Switzerland.

Figure 13

Figure 9. SHAP summary plot for MortFCNet predictions, computed by using Kernel SHAP. The color gradient represents the feature value, and the position along the x-axis reflects the impact on the model output.

Figure 14

Table 6. Top 10 features ranked by mean absolute SHAP values for MortFCNet

Figure 15

Table 7. MSE performance summary for ablation studies. The best-performing setting on the test data is in bold

Figure 16

Table 8. Per-country counts of total rows and unique NUTS_ID regions in the merged France–Italy–Switzerland and Scandinavian dataset

Figure 17

Table 9. MSE for baseline, XGBoost, and MortFCNet by country and overall on the training data (2013–2018) and the test data (2019) of the merged data. The lowest MSE is in bold

Figure 18

Table 10. Overall performance comparison with and without engineered features