Hostname: page-component-6766d58669-nf276 Total loading time: 0 Render date: 2026-05-20T07:55:27.065Z Has data issue: false hasContentIssue false

Analysis of a SARIMA-XGBoost model for hand, foot, and mouth disease in Xinjiang, China

Published online by Cambridge University Press:  04 December 2023

Haojie Man
Affiliation:
School of Mathematics and Statistics, Beijing Institute of Technology, Beijing, China
Hanting Huang
Affiliation:
School of Mathematical Sciences, Beihang University, Beijing, China
Zhuangyan Qin
Affiliation:
College of Mathematics and System Science, Xinjiang University, Urumqi, China
Zhiming Li*
Affiliation:
College of Mathematics and System Science, Xinjiang University, Urumqi, China
*
Corresponding author: Zhiming Li; Email: zmli@xju.edu.cn
Rights & Permissions [Opens in a new window]

Abstract

Hand, foot, and mouth disease (HFMD) is a common childhood infectious disease. The incidence of HFMD has a pronounced seasonal tendency and is closely related to meteorological factors such as temperature, rainfall, and wind speed. In this paper, we propose a combined SARIMA-XGBoost model to improve the prediction accuracy of HFMD in 15 regions of Xinjiang, China. The SARIMA model is used for seasonal trends, and the XGBoost algorithm is applied for the nonlinear effects of meteorological factors. The geographical and temporal weighted regression model is designed to analyze the influence of meteorological factors from temporal and spatial perspectives. The analysis results show that the HFMD exhibits seasonal characteristics, peaking from May to August each year, and the HFMD incidence has significant spatial heterogeneity. The meteorological factors affecting the spread of HFMD vary among regions. Temperature and daylight significantly impact the transmission of the disease in most areas. Based on the verification experiment of forecasting, the proposed SARIMA-XGBoost model is superior to other models in accuracy, especially in regions with a high incidence of HFMD.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2023. Published by Cambridge University Press
Figure 0

Figure 1. The HFMD incidence in various regions of Xinjiang province.

Figure 1

Figure 2. The HFMD cases by the month of illness onset, standardized by the number of annual cases.

Figure 2

Figure 3. Monthly average values of meteorological indicators in Xinjiang regions.

Figure 3

Table 1. Spatial autocorrelation analysis of HFMD in Xinjiang from 2008 to 2018

Figure 4

Table 2. The mean values of GTWR standardized coefficients of each meteorological variable

Figure 5

Figure 4. The flow chart of the combined SARIMA-XGBoost model.

Figure 6

Table 3. Diagnosis of SARIMA (p, d, q) × (P, D, Q)s model

Figure 7

Table 4. The feature important and percentage of meteorological variables for each region

Figure 8

Figure 5. The fitting result graph for the training set in Urumqi.

Figure 9

Figure 6. The prediction results of Urumqi in five models.

Figure 10

Figure 7. Evaluation results of different models for the prediction of 15 regions in Xinjiang.