Hostname: page-component-89b8bd64d-x2lbr Total loading time: 0 Render date: 2026-05-07T10:33:05.984Z Has data issue: false hasContentIssue false

Nonparametric intercept regularization for insurance claim frequency regression models

Published online by Cambridge University Press:  05 January 2024

Gee Y. Lee*
Affiliation:
Department of Statistics and Probability, Department of Mathematics, Michigan State University, East Lansing, MI, USA
Himchan Jeong
Affiliation:
Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, BC, Canada
*
Corresponding author: Gee Y. Lee; Email: leegee@msu.edu
Rights & Permissions [Opens in a new window]

Abstract

In a subgroup analysis for an actuarial problem, the goal is for the investigator to classify the policyholders into unique groups, where the claims experience within each group are made as homogenous as possible. In this paper, we illustrate how the alternating direction method of multipliers (ADMM) approach for subgroup analysis can be modified so that it can be more easily incorporated into an insurance claims analysis. We present an approach to penalize adjacent coefficients only and show how the algorithm can be implemented for fast estimation of the parameters. We present three different cases of the model, depending on the level of dependence among the different coverage groups within the data. In addition, we provide an interpretation of the credibility problem using both random effects and fixed effects, where the fixed effects approach corresponds to the ADMM approach to subgroup analysis, while the random effects approach represents the classic Bayesian approach. In an empirical study, we demonstrate how these approaches can be applied to real data using the Wisconsin Local Government Property Insurance Fund data. Our results show that the presented approach to subgroup analysis could provide a classification of the policyholders that improves the prediction accuracy of the claim frequencies in case other classifying variables are unavailable in the data.

Information

Type
Original Research Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of Institute and Faculty of Actuaries
Figure 0

Figure 1. The SCAD and MCP penalty functions with different $\kappa$ values.

Figure 1

Algorithm 1: A modified ADMM for fast and stable parameter estimation

Figure 2

Table 1. Out-of-sample validation results with simulated Coverage 1

Figure 3

Table 2. Out-of-sample validation results with simulated Coverage 2

Figure 4

Table 3. Out-of-sample validation results with simulated Coverage 3

Figure 5

Table 4. Summary of coverage amounts by year in millions of dollars

Figure 6

Table 5. Summary of claim frequencies by year

Figure 7

Table 6. Number of observations by the entity type variable

Figure 8

Figure 2. Relationship between coverage amounts and claim frequencies.

Figure 9

Table 7. Number of policies with positive coverage each year

Figure 10

Figure 3. Histogram plots of the Cox–Snell residuals for a basic GLM.

Figure 11

Table 8. Spearman correlations with the hold out sample frequencies

Figure 12

Table 9. Number of unique fixed-effect coefficients

Figure 13

Table 10. Number of groups in the intercepts of the individual effects model

Figure 14

Figure 4. Solution paths for the six coverage groups without entity type predictors.

Figure 15

Figure 5. Solution paths for the six coverage groups with entity type predictors.

Figure 16

Figure A.1. An illustration of the potential difficulties of direct optimization.

Figure 17

Algorithm 0: A naive ADMM algorithm for parameter estimation

Figure 18

Table E.1. Computation time for simulated Coverage 1 (in sec)

Figure 19

Table E.2. Computation time for simulated Coverage 2 (in sec)

Figure 20

Table E.3. Computation time for simulated Coverage 3 (in sec)