Hostname: page-component-6766d58669-mzsfj Total loading time: 0 Render date: 2026-05-14T11:05:56.546Z Has data issue: false hasContentIssue false

Modeling mortality with Kernel Principal Component Analysis (KPCA) method

Published online by Cambridge University Press:  03 December 2024

Yuanqi Wu
Affiliation:
School of Physical & Mathematical Sciences, Nanyang Technological University, Singapore
Andrew Chen
Affiliation:
Department of Electrical and Electronic Engineering, Imperial College London, London, UK
Yanbin Xu
Affiliation:
Division of Banking & Finance, Nanyang Business School, Nanyang Technological University, Singapore
Guangming Pan
Affiliation:
School of Physical & Mathematical Sciences, Nanyang Technological University, Singapore
Wenjun Zhu*
Affiliation:
Division of Banking & Finance, Nanyang Business School, Nanyang Technological University, Singapore
*
Corresponding author: Wenjun Zhu; Email: wjzhu@ntu.edu.sg
Rights & Permissions [Opens in a new window]

Abstract

As the global population continues to age, effective management of longevity risk becomes increasingly critical for various stakeholders. Accurate mortality forecasting serves as a cornerstone for addressing this challenge. This study proposes to leverage Kernel Principal Component Analysis (KPCA) to enhance mortality rate predictions. By extending the traditional Lee-Carter model with KPCA, we capture nonlinear patterns and complex relationships in mortality data. The newly proposed KPCA Lee-Carter algorithm is empirically tested and demonstrates superior forecasting performance. Furthermore, the model’s robustness was tested during the COVID-19 pandemic, showing that the KPCA Lee-Carter algorithm effectively captures increased uncertainty during extreme events while maintaining narrower prediction intervals. This makes it a valuable tool for mortality forecasting and risk management. Our findings contribute to the growing body of literature where actuarial science intersects with statistical learning, offering practical solutions to the challenges posed by an aging world population.

Information

Type
Original Research Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of Institute and Faculty of Actuaries
Figure 0

Figure 1 Life expectancy at birth (total) of the world and different regions (Source: World Bank).

Figure 1

Algorithm 1: KPCA Lee–Carter Algorithm for Mortality Forecasting

Figure 2

Figure 2 Log-mortality rate by year and age, with different colors representing age groups. The oldest age group is shown in violet, while the youngest is highlighted in red.

Figure 3

Table 1. Validation result for Kernel Principal Component Analysis 1 principal component model

Figure 4

Table 2. Validation result for Kernel Principal Component Analysis 2 principal component model

Figure 5

Table 3. Mean absolute percentage error (MAPE) for out-sample test set on all models

Figure 6

Table 4. Proportion of variance for Kernel Principal Component Analysis 2 principal component model

Figure 7

Figure 3 $\beta _x$ estimated across models.

Figure 8

Table 5. Projected $\kappa _t$ at 2021

Figure 9

Figure 4 $\kappa _t$ estimated across models.

Figure 10

Figure 5 Fan chart of $\kappa _t$ estimated across models, 95% CI.

Figure 11

Figure 6 $\beta ^{(1)}$ and $\beta ^{(2)}$ estimated by 2 principal components Kernel Principal Component Analysis model.

Figure 12

Figure 7 Fan plots of $\kappa _t^{(1)}$ and $\kappa _t^{(2)}$ estimated by 2 principal components Kernel Principal Component Analysis model, 95% CI.

Figure 13

Table 6. Model comparison for other countries

Figure 14

Table 7. Projected remaining life expectancy at age 60

Figure 15

Figure 8 Projected remaining life expectancy based on COVID inclusion and exclusion datasets.