Hostname: page-component-89b8bd64d-n8gtw Total loading time: 0 Render date: 2026-05-09T13:00:15.172Z Has data issue: false hasContentIssue false

Individual claims reserving using the Aalen–Johansen estimator

Published online by Cambridge University Press:  03 December 2024

Martin Bladt
Affiliation:
Department of Mathematical Sciences, University of Copenhagen, Copenhagen, Denmark
Gabriele Pittarello*
Affiliation:
Department of Statistical Sciences, Sapienza University of Rome, Rome, Italy
*
Corresponding author: Gabriele Pittarello; Email: gabriele.pittarello@uniroma1.it
Rights & Permissions [Opens in a new window]

Abstract

We propose an individual claims reserving model based on the conditional Aalen–Johansen estimator, as developed in Bladt and Furrer ((2023a) arXiv:2303.02119.). In our approach, we formulate a multi-state problem, where the underlying variable is the individual claim size, rather than time. The states in this model represent development periods, and we estimate the cumulative density function of individual claim sizes using the conditional Aalen–Johansen method as transition probabilities to an absorbing state. Our methodology reinterprets the concept of multi-state models and offers a strategy for modeling the complete curve of individual claim sizes. To illustrate our approach, we apply our model to both simulated and real datasets. Having access to the entire dataset enables us to support the use of our approach by comparing the predicted total final cost with the actual amount, as well as evaluating it in terms of the continuously ranked probability score.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of The International Actuarial Association
Figure 0

Figure 1. Example of multi-state model for claims reserving, with $k=5$. We interpret time spent in each state as increasing claims size clock, instead of calendar time. The states represent the development periods (DP’s) of the individual claims.

Figure 1

Figure 2. Comparison of the true severity curve (dark grey dotted line) to the fitted severity curve for different data sizes in the 4 simulated scenarios. The red curve is fitted on $120 (k-1)$ RBNS claims, the blue curve is fitted on 1200 RBNS claims. Z is rounded by millions.

Figure 2

Figure 3. Box plots of EI for AJ and CL over the 20 simulations, for each value of k, in the Alpha scenario (left column) and the Beta scenario (right column).

Figure 3

Table 1. Results for scenario Alpha. For each value of k (column one) we present the average results over the 20 simulations. Each row of the table corresponds to a different AJ model specification (column three). The table includes the (average) actual Y$^{\texttt{TOT}}$ simulated total cost (column four) and the error incidence for the AJ and the CL (columns five and six). In columns seven and eight, we reported the coefficients of variation of Y$^{\texttt{TOT}}$. The results for the CRPS are reported in column nine.

Figure 4

Table 2. Results for scenario Beta. For each value of k (column one), we present the average results over the 20 simulations. Each row of the table corresponds to a different AJ model specification (column three). The table includes the (average) actual simulated total cost (column four) and the error incidence for the AJ and the CL (columns five and six). In columns seven and eight, we reported the coefficients of variation of Y$^{\texttt{TOT}}$. The results for the CRPS, relative to the AJ model with features, are reported in column nine.

Figure 5

Table 3. Description of the real dataset.

Figure 6

Figure 4. Exploratory data analysis on the real dataset that we use in this section. We show the relative frequencies by type of claim Figure (a), the distribution of the number of payments Figure (b), the density plot of the total individual claim size Figure (c) and the distribution of the settlement delay from accident Figure (d).

Figure 7

Table 4. For different choices of k (column one), we fit a model with and without $\texttt{claim_type}$ (column two). The target Y$^{\texttt{TOT}}$ is shown in column three. The EI is shown in columns four and five. The CV of Y$^{\texttt{TOT}}$ are displayed in columns seven and eight for the AJ and CL respectively. The CRPS is shown in column nine.

Figure 8

Figure 5. For each different dimension $k=4,5,6,7,8$, we provide the individual claim size curves for our observations by covariate value $\texttt{claim_type}$. The x-axis represents Z and we scale it by $\texttt{log2}$ to ease the plot visualization.

Figure 9

Table 5. We selected for each data k (column one), the best-performing model in terms of CRPS and present the reserve (column one) and the standard deviation of the reserve (column two).

Figure 10

Table 6. For the dataset with depth 5 accident periods, we fit the AJ model both including and excluding the $\texttt{claim_type}$ feature (column two). The target Y$^{\texttt{TOT}}$ is displayed in column three. The EI for the AJ and the CL can be found in columns four and five. The coefficient of variation of Y$^{\texttt{TOT}}$ is displayed in columns six and seven. The CRPS is displayed in column eight.

Figure 11

Table A1. Results for scenario Beta. For each value of k (column one) we present the average results over the 20 simulations. Each row of the table corresponds to a different model (column two). The table includes the (average) actual simulated total cost (column three) and the error incidence for the AJ, the CL, and $\texttt{hirem}$ (columns four). In columns five we show the average relative variation and in column six we reported the average CRPS relative to the AJ CRPS.

Figure 12

Table A2. Description of the $\texttt{hirem}$ data.

Figure 13

Table A3. The hirem package includes four data generators in four different scenarios with $k=10$ (column one). We compare our models (AJ with and without features, column two) to the model in Crevecoeur et al. (2023) and the CL (column three). The actual Y$^{\texttt{TOT}}$ target is reported in column four. We show the EI and the CRPS results in columns five and seven, respectevely. The predicted relative variation of Y$^{\texttt{TOT}}$ is shown in column six.