Hostname: page-component-77f85d65b8-g98kq Total loading time: 0 Render date: 2026-03-29T03:27:34.407Z Has data issue: false hasContentIssue false

Mortality forecasting using a Lexis-based state-space model

Published online by Cambridge University Press:  11 September 2020

Patrik Andersson*
Affiliation:
Department of Statistics, Uppsala University, Uppsala, Sweden
Mathias Lindholm
Affiliation:
Department of Mathematics, Stockholm University, Stockholm, Sweden
*
*Corresponding author. E-mail: patrik.andersson@statistics.uu.se
Rights & Permissions [Opens in a new window]

Abstract

A new method of forecasting mortality is introduced. The method is based on the continuous-time dynamics of the Lexis diagram, which given weak assumptions implies that the death count data are Poisson distributed. The underlying mortality rates are modelled with a hidden Markov model (HMM) which enables a fully likelihood-based inference. Likelihood inference is done by particle filter methods, which avoids approximating assumptions and also suggests natural model validation measures. The proposed model class contains as special cases many previous models with the important difference that the HMM methods make it possible to estimate the model efficiently. Another difference is that the population and latent variable variability can be explicitly modelled and estimated. Numerical examples show that the model performs well and that inefficient estimation methods can severely affect forecasts.

Information

Type
Original Research Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© Institute and Faculty of Actuaries 2020
Figure 0

Figure 1 Example of a Lexis diagram.

Figure 1

Figure 2 Trace of the estimate of $\psi^l$ from the SAEM algorithm, Algorithm 3, fitted to Swedish male data from 1930 to 1960. Each line corresponds to a component of the respective vector/matrix of parameters.

Figure 2

Table 1. Calculated values of $\text{R}^{2}_{\text{Dev}}$ for model (M2) fitted to Swedish male data for the years 1930–1960

Figure 3

Figure 3 Fig. 3(a)–(c): First three EPCA components, Swedish males, 1930–1960. In Fig. 3(d)–(f), the number of EPCA components, 1–5, are indicated by lines that are solid/brown, short dashed/dark green, dotted/light green, dash-dotted/dark brown and long dashed/light brown, respectively. Fig. 3(d)–(f) shows, from left to right, $\text{R}^{2,*}_{\text{Dev}}$, $\operatorname{MAE}$ and $\operatorname{MIS}$, calculated in-sample for the period 1930–1960 for Swedish males. Fig. 3(g): In-sample variance produced by the model for simulated mortality rates, Swedish males, age 40, three EPCA components; total variance (solid line), population variance (dashed line). Fig. 3(h): 95% yearly confidence levels for the simulated mortality rates $M^*$ for Swedish males using three EPCA components (grey area), median (solid line), observed mortality rates, $\widehat m$, (circles). Fig. 3(i): Same as in Fig. 3(h), but for the simulated latent M-process.

Figure 4

Figure 4 Fig. 4(a)–(c) shows the out-of-sample analog of Fig. 3(d)–(f), where all parameters have been estimated based on Swedish male data from 1930 to 1960, and the predictions are made for the period 1961 to 2016. Fig. 4(d)–(f): $\text{R}^{2,*}_{\text{Dev}}$ where solid lines correspond to in-sample performance and dashed lines correspond to out-of-sample performance when using three EPCA components – models fitted using data from 1930 to 1960, 1930 to 1990 and 1970 to 2000, respectively. Fig. 4(g)–(i): 95% yearly confidence/prediction intervals (grey area) for simulated mortality rates $M^*$, Swedish males, age 80, three EPCA components, median (solid/dashed line), observed mortality rates, $\widehat m$, (circles) – models fitted using data from 1930 to 1960, 1930 to 1990 and 1970 to 2000, respectively.

Figure 5

Figure 5 In all figures, 95% confidence/prediction intervals for the simulated centralised mortality rates (grey area), median (solid/dashed line) and observed centralised mortality rates (circles). In all figures, from left to right, age 10, age 40 and age 80, respectively. Fig. 5(a)–(c): Swedish males, three EPCA components, model fitted using data from 1970 to 2000. Fig. 5(d)–(f): Swedish females, five EPCA components, model fitted using data from 1950 to 1980. Fig. 5(g)–(i): US females, three EPCA components, model fitted using data from 1950 to 1980.