Hostname: page-component-77f85d65b8-9nbrm Total loading time: 0 Render date: 2026-04-21T02:34:46.219Z Has data issue: false hasContentIssue false

POINT AND INTERVAL FORECASTS OF DEATH RATES USING NEURAL NETWORKS

Published online by Cambridge University Press:  03 December 2021

Simon Schnürch*
Affiliation:
Department of Financial Mathematics Fraunhofer Institute for Industrial Mathematics ITWM Fraunhofer-Platz 1 67663 Kaiserslautern, Germany Department of Mathematics University of Kaiserslautern Gottlieb-Daimler-Straße 48 67663 Kaiserslautern, Germany E-Mail: simon.schnuerch@itwm.fraunhofer.de
Ralf Korn
Affiliation:
Department of Financial Mathematics Fraunhofer Institute for Industrial Mathematics ITWM Fraunhofer-Platz 1 67663 Kaiserslautern, Germany Department of Mathematics University of Kaiserslautern Gottlieb-Daimler-Straße 48 67663 Kaiserslautern, Germany E-Mail: simon.schnuerch@itwm.fraunhofer.de
Rights & Permissions [Opens in a new window]

Abstract

The Lee–Carter model has become a benchmark in stochastic mortality modeling. However, its forecasting performance can be significantly improved upon by modern machine learning techniques. We propose a convolutional neural network (NN) architecture for mortality rate forecasting, empirically compare this model as well as other NN models to the Lee–Carter model and find that lower forecast errors are achievable for many countries in the Human Mortality Database. We provide details on the errors and forecasts of our model to make it more understandable and, thus, more trustworthy. As NN by default only yield point estimates, previous works applying them to mortality modeling have not investigated prediction uncertainty. We address this gap in the literature by implementing a bootstrapping-based technique and demonstrate that it yields highly reliable prediction intervals for our NN model.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2021. Published by Cambridge University Press on behalf of The International Actuarial Association
Figure 0

Figure 1: Illustration of an input matrix for the CNN with $x_1 = 0$, $x_A = 100$, $\tau = 10$, as a heat map displaying the log-transformed death rates of English and Welsh females. The colors range from blue (low death rates) to red (high death rates).

Figure 1

Figure 2: CNN with input size (1,101,10) consisting of a convolutional layer of size (10,99,8), a pooling layer of size (10,49,4), a dense layer of size 50 and a dense output layer of size 30. Figure produced using the tool by LeNail (2019).

Figure 2

Figure 3: The process for choosing hyperparameters, training and evaluating models.

Figure 3

Table 1 Goodness-of-fit measures for 54 populations, ages 60–89, years 1997–2006 (models trained on years up to 2006). The best value in each column is marked in bold.

Figure 4

Figure 4: Coefficients of a log-linear global surrogate model for the CNN.

Figure 5

Table 2 Out-of-sample error measures for 54 populations, ages 60–89, years 2007–2016 (models trained on years up to 2006). The best value in each column is marked in bold.

Figure 6

Figure 5: MdAPE by age, year and population of CNN (red circles), FFNN (brown squares), RNN (blue inverted triangles), ACF (green diamonds) and LC20 (magenta triangles).

Figure 7

Figure 6: CNN (red, long dash) and LC20 (magenta, dash) forecasts and ground truth (black, solid) from 2007 to 2016 for Japanese females aged 60, 65, 71, 77, 83, 89.

Figure 8

Table 3 Prediction interval measures over 54 populations, ages 60–89, years 2007–2016 (models trained on years up to 2006).

Figure 9

Figure 7: PICP by age, year and population of CNN (red circles), FFNN (brown squares), RNN (blue inverted triangles) ACF (green diamonds) and LC20 (magenta triangles).

Figure 10

Figure 8: Estimated variances by age, year and population for the forecasts of CNN (red circles), FFNN (brown squares) and RNN (blue inverted triangles).

Figure 11

Figure 9: CNN (red, long dash) and LC20 (magenta, dash) forecasts along with prediction intervals ($a=0.95$; dot for CNN, in (b) also dot and dash for LC20) and ground truth (black, solid) from 2007 to 2016 for ages 60, 65, 71, 77, 83, 89.

Figure 12

Table 4 Robustness check. Out-of-sample error measures for 50 populations, ages 60–89, years 1997–2016 (models trained on years up to 1996). The best value in each column, where applicable, is marked in bold.

Figure 13

Figure 10: Log death rate forecasts of different mortality models and, where available, ground truth for the years 2007 (red, solid), 2016 (lime green, dash), 2026 (blue, dot) and 2036 (violet, long dash; including prediction intervals) for England and Wales.

Supplementary material: PDF

Schnürch and Korn supplementary material

Schnürch and Korn supplementary material

Download Schnürch and Korn supplementary material(PDF)
PDF 5.7 MB