Hostname: page-component-5db58dd55d-ggg9q Total loading time: 0 Render date: 2026-05-30T15:34:54.813Z Has data issue: false hasContentIssue false

What KAN mortality say: smooth and interpretable mortality modeling using Kolmogorov−Arnold networks

Published online by Cambridge University Press:  28 November 2025

Lianzeng Zhang
Affiliation:
Nankai-Taikang College of Insurance and Actuarial Science Nankai University Tianjin 300350, P. R. China
Yuan Zhuang*
Affiliation:
School of Risk and Actuarial Studies UNSW Sydney Sydney, Australia Department of Actuarial Science, School of Finance Nankai University Tianjin 300350, P. R. China
*
Corresponding author: Yuan Zhuang; Email: yuan.zhuang4@unsw.edu.au
Rights & Permissions [Opens in a new window]

Abstract

In machine learning-based mortality models, interpretation methods are well established, and they can reveal structures resembling the age or time effects in traditional mortality models. However, in the reverse direction, using such traditional components to guide the initialization of a neural network remains highly challenging due to information loss during model interpretation. This study addresses this gap by exploring how components from pre-fitted traditional mortality models can be used to initialize neural networks, enabling structural information to be incorporated into a deep learning framework. We introduce Kolmogorov–Arnold Networks (KAN) and first construct two shallow models, KAN[2,1] and ARIMAKAN, to examine their applicability to mortality modeling. We then extend the Combined Actuarial Neural Network (CANN) into a KAN-based Actuarial Neural Network (KANN), in which classical model components calibrated via generalized nonlinear models or generalized additive models are naturally used for initialization. Three KANN variants, namely KANN[2,1], KANNLC, and KANNAPC, are proposed. In these models, neural networks assist in improving the accuracy of traditional models and help refine the original parameter estimates. All KANN-based models can also produce smooth mortality curves as well as smooth age, period, and cohort effects through simple regularization. Experiments on 34 populations demonstrate that KAN-based approaches achieve stable performance while balancing interpretability, smoothness, and predictive accuracy.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of The International Actuarial Association
Figure 0

Figure 1. Sample KAN with three KAN layers (four node layers).

Figure 1

Figure 2. KANN[2,1] with a deep part of KAN[2,8,8,1]. The dashed arrows indicate the components of KANN. $\phi_{\text{age}}(x)$ and $\phi_{\text{year}}(t)$ are connected to the nodes in the final layer, forming an additive relationship with the deep part.

Figure 2

Figure 3. KANNLC architecture. $\phi_{\text{age}_2}(x)$ and $\phi_{\text{year}}(t)$ are multiplied together and connected to the final layer.

Figure 3

Figure 4. KANNAPC architecture. Age x is transformed into $-x$ after passing through $\phi_{\text{age}_2}(x)$, while year t remains t after passing through $\phi_{\text{year}_2}(t)$. The two are then added to obtain the birth year $\gamma$. The birth year (cohort) is transformed by $\phi_{\text{cohort}}(\gamma)$ and connected to the nodes in the final layer.

Figure 4

Figure 5. $\lambda$’s impact on fitted mortality from KAN[2,1] on 1975 US female population.

Figure 5

Table 1. Hyperparameter configurations for KAN-based models. The “Candidate Hyperparameters” column lists the hyperparameters to be tuned; if an entry is NA, no tuning is performed. The “Fixed Hyperparameters” column lists those kept constant throughout training.

Figure 6

Figure 6. Interpretation and outputs of KAN[2,1] and KANN[2,1], fitted on female population of Netherlands. Panel (a) shows the estimated age effects $\phi_{\text{age}}(x)$ from KAN[2,1] and KANN[2,1], compared with the GAM-based $f_{\text{age}}(x)$. Panel (b) presents the estimated time effects $\phi_{\text{year}}(t)$ from KAN[2,1] and KANN[2,1], along with the GAM-based $f_{\text{year}}(t)$. Panel (c) illustrates the deep-part outputs of KANN[2,1] across ages and years. Panel (d) compares the predicted mortality curves for 2019 (the last year of the test set) from five models. For KAN[2,1], KANN[2,1], and LSTM, the results are based on a single run.

Figure 7

Figure 7. Decomposition of KAN[2,1]’s trainable activation functions, fitted on Netherlands female mortality; $\lambda = 10^{-4}$.

Figure 8

Figure 8. Interpretation and outputs of KANNLC and KANNAPC. Panel (a) and panel (b) show the shallow-part outputs. Panel (c) presents the deep-part heatmap of KANNLC (left) and KANNAPC (right). Panels (d) and (e) provide predictions in 2019 with zoomed-in views on age interval 10–25.

Figure 9

Table 2. Model rankings and parameter complexity. Mean ranks are reported with standard deviations in parentheses; mean ranks close to 1 indicate stronger overall performance across populations. Parameter counts reflect the architectures ultimately adopted in mortality forecasting on test set, rather than the smaller or larger values that may have arisen during validation but were not retained.

Figure 10

Figure 9. Model stability across random initialization.

Figure 11

Figure 10. Predicted mortality curves from different KAN-based models for Italian female population in 2019.

Figure 12

Table A1. Countries and regions selected from Human Mortality Database.

Supplementary material: File

Zhang and Zhuang supplementary material 1

Zhang and Zhuang supplementary material
Download Zhang and Zhuang supplementary material 1(File)
File 2.1 MB
Supplementary material: File

Zhang and Zhuang supplementary material 2

Zhang and Zhuang supplementary material
Download Zhang and Zhuang supplementary material 2(File)
File 679 KB