Hostname: page-component-6766d58669-fx4k7 Total loading time: 0 Render date: 2026-05-16T03:51:44.724Z Has data issue: false hasContentIssue false

A NEW MULTIVARIATE ZERO-INFLATED HURDLE MODEL WITH APPLICATIONS IN AUTOMOBILE INSURANCE

Published online by Cambridge University Press:  07 January 2022

Pengcheng Zhang
Affiliation:
School of Insurance, Shandong University of Finance and Economics, Jinan 250014, China, E-Mail: qingdaozpc@163.com
David Pitt
Affiliation:
Centre for Actuarial Studies, Department of Economics, The University of Melbourne, Melbourne, VIC 3010, Australia, E-Mail: david.pitt@unimelb.edu.au
Xueyuan Wu*
Affiliation:
Centre for Actuarial Studies, Department of Economics, The University of Melbourne, Melbourne, VIC 3010, Australia, E-Mail: xueyuanw@unimelb.edu.au
Rights & Permissions [Opens in a new window]

Abstract

The fact that a large proportion of insurance policyholders make no claims during a one-year period highlights the importance of zero-inflated count models when analyzing the frequency of insurance claims. There is a vast literature focused on the univariate case of zero-inflated count models, while work in the area of multivariate models is considerably less advanced. Given that insurance companies write multiple lines of insurance business, where the claim counts on these lines of business are often correlated, there is a strong incentive to analyze multivariate claim count models. Motivated by the idea of Liu and Tian (Computational Statistics and Data Analysis, 83, 200–222; 2015), we develop a multivariate zero-inflated hurdle model to describe multivariate count data with extra zeros. This generalization offers more flexibility in modeling the behavior of individual claim counts while also incorporating a correlation structure between claim counts for different lines of insurance business. We develop an application of the expectation–maximization (EM) algorithm to enable the statistical inference necessary to estimate the parameters associated with our model. Our model is then applied to an automobile insurance portfolio from a major insurance company in Spain. We demonstrate that the model performance for the multivariate zero-inflated hurdle model is superior when compared to several alternatives.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Author(s), 2022. Published by Cambridge University Press on behalf of The International Actuarial Association
Figure 0

Table 1. The description for explanatory variables.

Figure 1

Table 2. The empirical joint distribution of $Z_1$ and $Z_2$.

Figure 2

Table 3. Goodness-of-fit of marginal models.

Figure 3

Table 4. Estimates and t-ratios associated with the covariates of $\pi_j$ in MZIH model.

Figure 4

Table 5. Estimates and t-ratios associated with the covariates of $W_j$ in MZIH model.

Figure 5

Table 6. Observed and expected frequencies of MZIH model for the joint distribution of $Z_1$ and $Z_2$.

Figure 6

Table 7. Information criteria of four fitted models.

Figure 7

Table 8. Predicted frequencies of four models under four different scenarios.

Figure 8

Table 9. Information criteria of six candidate models.