Hostname: page-component-6766d58669-88psn Total loading time: 0 Render date: 2026-05-18T00:25:24.598Z Has data issue: false hasContentIssue false

Applying the zero-inflated Poisson model with random effects to detect abnormal rises in school absenteeism indicating infectious diseases outbreak

Published online by Cambridge University Press:  30 May 2018

X. X. Song
Affiliation:
School of Public Health, Fudan University, Shanghai, China Key Laboratory of Public Health Safety (Fudan University), Ministry of Education, Shanghai, China
Q. Zhao
Affiliation:
School of Public Health, Fudan University, Shanghai, China Key Laboratory of Public Health Safety (Fudan University), Ministry of Education, Shanghai, China
T. Tao
Affiliation:
School of Public Health, Fudan University, Shanghai, China Key Laboratory of Public Health Safety (Fudan University), Ministry of Education, Shanghai, China
C. M. Zhou
Affiliation:
School of Public Health, Fudan University, Shanghai, China Key Laboratory of Public Health Safety (Fudan University), Ministry of Education, Shanghai, China
V. K. Diwan
Affiliation:
Division of Global Health (IHCAR), Department of Public Health Sciences, Karolinska Institutet, Stockholm, Sweden
B. Xu*
Affiliation:
School of Public Health, Fudan University, Shanghai, China Key Laboratory of Public Health Safety (Fudan University), Ministry of Education, Shanghai, China Division of Global Health (IHCAR), Department of Public Health Sciences, Karolinska Institutet, Stockholm, Sweden
*
Author for correspondence: Biao Xu, E-mail: bxu@shmu.edu.cn
Rights & Permissions [Opens in a new window]

Abstract

Records of absenteeism from primary schools are valuable data for infectious diseases surveillance. However, the analysis of the absenteeism is complicated by the data features of clustering at zero, non-independence and overdispersion. This study aimed to generate an appropriate model to handle the absenteeism data collected in a European Commission granted project for infectious disease surveillance in rural China and to evaluate the validity and timeliness of the resulting model for early warnings of infectious disease outbreak. Four steps were taken: (1) building a ‘well-fitting’ model by the zero-inflated Poisson model with random effects (ZIP-RE) using the absenteeism data from the first implementation year; (2) applying the resulting model to predict the ‘expected’ number of absenteeism events in the second implementation year; (3) computing the differences between the observations and the expected values (O–E values) to generate an alternative series of data; (4) evaluating the early warning validity and timeliness of the observational data and model-based O–E values via the EARS-3C algorithms with regard to the detection of real cluster events. The results indicate that ZIP-RE and its corresponding O–E values could improve the detection of aberrations, reduce the false-positive signals and are applicable to the zero-inflated data.

Information

Type
Original Paper
Copyright
Copyright © Cambridge University Press 2018 
Figure 0

Fig. 1. Flowchart of identifying and evaluating ZIP-RE model to detect abnormal in school absenteeism.

Figure 1

Fig. 2. Histogram of daily absence data from April 2012 to March 2013 Note: The percentages in the figure showed the level of zero% in each month.

Figure 2

Table 1. Model selection results from ZIP-RE and ZINB-RE based on four information criteria and two random effects

Figure 3

Fig. 3. The time trends and key indicators of the observational data and O–E values analysed via EARS-3C Note: The topmost plot showed the time series of the raw observations of the numbers of absences (O) and the new index values (O–E). The red points indicated the starting dates of 18 cluster events. The EARS-3C results were shown in the bottom three plots (EARS-C1, EARS-C2 and EARS-C3). The red lines represented the thresholds for an alarm signal (3, 3 and 2). The open circles in the figure highlighted an inconsistency between the time series.

Figure 4

Table 2. Comparison of the early warning abilities between the observational data and the generated O–E values using the EARS-3C with regard to the real cluster events

Supplementary material: File

Song et al. supplementary material

Appendix A

Download Song et al. supplementary material(File)
File 22 KB