Hostname: page-component-89b8bd64d-nlwjb Total loading time: 0 Render date: 2026-05-07T07:48:58.981Z Has data issue: false hasContentIssue false

Estimating lengths-of-stay of hospitalised COVID-19 patients using a non-parametric model: a case study in Galicia (Spain)

Published online by Cambridge University Press:  27 April 2021

Ana López-Cheda
Affiliation:
Universidade da Coruña, CITIC, MODES, A Coruña, Spain
María-Amalia Jácome*
Affiliation:
Universidade da Coruña, CITIC, MODES, A Coruña, Spain
Ricardo Cao
Affiliation:
Universidade da Coruña, CITIC, ITMATI, MODES, A Coruña, Spain
Pablo M. De Salazar
Affiliation:
Department of Epidemiology, Center for Communicable Disease Dynamics, Harvard School of Public Health, Boston, USA
*
Author for correspondence: María-Amalia Jácome, E-mail: maria.amalia.jacome@udc.es
Rights & Permissions [Opens in a new window]

Abstract

Estimating the lengths-of-stay (LoS) of hospitalised COVID-19 patients is key for predicting the hospital beds’ demand and planning mitigation strategies, as overwhelming the healthcare systems has critical consequences for disease mortality. However, accurately mapping the time-to-event of hospital outcomes, such as the LoS in the intensive care unit (ICU), requires understanding patient trajectories while adjusting for covariates and observation bias, such as incomplete data. Standard methods, such as the Kaplan-Meier estimator, require prior assumptions that are untenable given current knowledge. Using real-time surveillance data from the first weeks of the COVID-19 epidemic in Galicia (Spain), we aimed to model the time-to-event and event probabilities of patients’ hospitalised, without parametric priors and adjusting for individual covariates. We applied a non-parametric mixture cure model and compared its performance in estimating hospital ward (HW)/ICU LoS to the performances of commonly used methods to estimate survival. We showed that the proposed model outperformed standard approaches, providing more accurate ICU and HW LoS estimates. Finally, we applied our model estimates to simulate COVID-19 hospital demand using a Monte Carlo algorithm. We provided evidence that adjusting for sex, generally overlooked in prediction models, together with age is key for accurately forecasting HW and ICU occupancy, as well as discharge or death outcomes.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
Copyright © The Author(s), 2021. Published by Cambridge University Press
Figure 0

Fig. 1. Estimates of the survival function of LoS using NP-MCM (thick black line), KM with the complete dataset (thin light grey line), KM with the reduced dataset (thin dark grey line) and the E estimator (red line) for all the COVID-19 hospitalised cases (n = 2453) in Galicia (Spain), when the LoS is the time of hospitalisation both in HW and ICU (top left), time in HW until admission to ICU (top right), time in HW until death in HW (middle left), time in HW until discharge (middle right), time in ICU until death in ICU (bottom left) and time in ICU until discharge from ICU (bottom right). The NP-MCM and KM estimates give the same result for the time of hospitalisation, and are represented with a single thick black line (top left).

Figure 1

Table 1. Estimated probabilities of the different medical events for the COVID-19 patients in Galicia (Spain) using NP-MCM and empirical estimators

Figure 2

Fig. 2. Generalised product-limit estimator [13] of the conditional survival function S(t|x) for the time of hospitalisation, both in HW and ICU (top) and the time in ICU (bottom), incorporating the effect of the sex (male = black line, female = red line) and the ages 40 years (left) and 70 years (right) for all the COVID-19 hospitalised cases (n = 2453) in Galicia (Spain).

Figure 3

Fig. 3. Number of patients in HW (top left), ICU (top right), deaths (bottom left) and discharges (bottom right) computed from 1000 simulated COVID-19 outbreaks for a period of 200 days, when the LoS are simulated depending on age and sex (blue), and unconditionally ignoring age and sex dependence (red). The real case counts of inpatients in the COVID-19 dataset from March 6th (day 1) to May 7th 2020 (day 62), rescaled to N = 1000 infected people, are also included for reference (solid black line).

Figure 4

Fig. 4. Number of days when the demand for beds is above the capacity in HW (left) and ICU (right), for different possible capacities and computed from 1000 simulated COVID-19 outbreaks. The demand was simulated conditionally depending on age and sex (blue), and unconditionally ignoring age and sex dependence (red).

Supplementary material: File

López-Cheda et al. supplementary material

López-Cheda et al. supplementary material

Download López-Cheda et al. supplementary material(File)
File 320 KB