Hostname: page-component-77f85d65b8-hzqq2 Total loading time: 0 Render date: 2026-04-18T09:07:12.898Z Has data issue: false hasContentIssue false

Interpretable zero-inflated neural network models for predicting admission counts

Published online by Cambridge University Press:  26 March 2024

Alex Jose*
Affiliation:
School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, UK Maxwell Institute for Mathematical Sciences, Edinburgh, UK
Angus S. Macdonald
Affiliation:
School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, UK Maxwell Institute for Mathematical Sciences, Edinburgh, UK
George Tzougas
Affiliation:
School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, UK Maxwell Institute for Mathematical Sciences, Edinburgh, UK
George Streftaris
Affiliation:
School of Mathematical and Computer Sciences, Heriot-Watt University, Edinburgh, UK Maxwell Institute for Mathematical Sciences, Edinburgh, UK
*
Corresponding author: Alex Jose; Email: aj61@hw.ac.uk
Rights & Permissions [Opens in a new window]

Abstract

In this paper, we construct interpretable zero-inflated neural network models for modeling hospital admission counts related to respiratory diseases among a health-insured population and their dependants in the United States. In particular, we exemplify our approach by considering the zero-inflated Poisson neural network (ZIPNN), and we follow the combined actuarial neural network (CANN) approach for developing zero-inflated combined actuarial neural network (ZIPCANN) models for modeling admission rates, which can accommodate the excess zero nature of admission counts data. Furthermore, we adopt the LocalGLMnet approach (Richman & Wüthrich (2023). Scandinavian Actuarial Journal, 2023(1), 71–95.) for interpreting the ZIPNN model results. This facilitates the analysis of the impact of a number of socio-demographic factors on the admission rates related to respiratory disease while benefiting from an improved predictive performance. The real-life utility of the methodologies developed as part of this work lies in the fact that they facilitate accurate rate setting, in addition to offering the potential to inform health interventions.

Information

Type
Original Research Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of Institute and Faculty of Actuaries
Figure 0

Table 1. Frequency of number of admissions related to respiratory diseases

Figure 1

Table 2. Description of variables in the admission data set

Figure 2

Table 3. Summary of the variable selection process of the logistic component of the ZIP regression model

Figure 3

Table 4. List of covariates and the corresponding coefficient parameters in both the components of the ZIP regression model

Figure 4

Table 5. Predictive performance of Poisson and ZIP regression models and network-based models with and without an additional layer for interpretation based on NLL. The empirical mean of the observed data is 0.0027

Figure 5

Figure 1 An illustration of a sample ZIPNN model with three hidden layers and 20,15,10,2 neurons in each layer.

Figure 6

Listing A1. Structure of the ZIPCANN model.

Figure 7

Figure 2 An illustration of a sample ZIPCANN model with skip connections, one-hot encoding, and 20,15,10,2 neurons in each of the layers.

Figure 8

Listing A2. Negative log-likelihood loss function used for training zero-inflated neural network models.

Figure 9

Figure 3 An illustration of a sample LocalGLMnet model with LocalGLM layer, one-hot encoding, and 20,15,10 neurons in each of the layers.

Figure 10

Figure 4 Graphical representation of covariate contribution from LocalGLMnet model for the testing set (a) age variable; (b) male, female; the blue line indicates a spline fit approximate curve and the yellow line has been added as a reference line at levels -0.25 and 0.25.

Figure 11

Figure 5 Age-wise crude rates of admission due to respiratory diseases for male and female patients for the entire data set.

Figure 12

Figure 6 An illustration of an interpretable ZIPNN model with regression attention, one-hot encoding, and 20,15,10 neurons in each of the layers.

Figure 13

Table 6. Coefficient estimates based on the ZIP regression model with the significance codes ("***,""**,""*," ".,"" ") indicating the level of significance of the estimates at levels (0, 0.001,0.01, 0.05, 0.1, 1 )

Figure 14

Figure 7 Graphical representation of covariate contributions for parameter $\lambda$ in the ZIPNN model: (a) AGE; (b) SEX; (c) UR; (d) REGION; (e) EECLASS; (f) EESTATU.

Figure 15

Figure 7 (continued). Graphical representation of covariate contributions for parameter $\lambda$ in the ZIPNN model: (a) EMPREL; (b) PLANTYP.

Figure 16

Figure 8 Graphical representation of covariate contributions for parameter $p$ in the ZIPNN model: (a) AGE; (b) SEX; (c) UR; (d) REGION; (e) EECLASS; (f) EESTATU.

Figure 17

Figure 8 (continued). Graphical representation of covariate contributions for parameter $p$ in the ZIPNN model: (a) EMPREL; (b) PLANTYP.

Figure 18

Table A.1. The testing loss, learning loss, and average fitted mean of the ZIPNN and ZIPCANN models with (50,35,25), (35,25,20), (25,20,15), (20,15,10), and (15,10,5) neurons in the initial three hidden layers.

Figure 19

Listing A3. Code for implementing ZIPNN.

Figure 20

Listing A4. Code for implementing ZIPCANN.

Figure 21

Listing A5. Structure of the ZIPCANN model.

Figure 22

Listing A6. Code for implementing LocalGLMnet.

Figure 23

Listing A7. Code for interpretable ZIPNN model.

Figure 24

Listing A8. Structure of the interpretable ZIPNN model.