Hostname: page-component-6766d58669-7cz98 Total loading time: 0 Render date: 2026-05-17T10:11:08.676Z Has data issue: false hasContentIssue false

Development and validation of a clinical and genetic model for predicting risk of severe COVID-19

Published online by Cambridge University Press:  02 July 2021

Gillian S. Dite*
Affiliation:
Genetic Technologies Ltd, Fitzroy, Victoria, Australia
Nicholas M. Murphy
Affiliation:
Genetic Technologies Ltd, Fitzroy, Victoria, Australia
Richard Allman
Affiliation:
Genetic Technologies Ltd, Fitzroy, Victoria, Australia
*
Author for correspondence: Gillian S. Dite, E-mail: gillian.dite@gtglabs.com
Rights & Permissions [Opens in a new window]

Abstract

Clinical and genetic risk factors for severe coronavirus disease 2019 (COVID-19) are often considered independently and without knowledge of the magnitudes of their effects on risk. Using severe acute respiratory syndrome-coronavirus-2 (SARS-CoV-2) positive participants from the UK Biobank, we developed and validated a clinical and genetic model to predict risk of severe COVID-19. We used multivariable logistic regression on a 70% training dataset and used the remaining 30% for validation. We also validated a previously published prototype model. In the validation dataset, our new model was associated with severe COVID-19 (odds ratio per quintile of risk = 1.77, 95% confidence interval (CI) 1.64–1.90) and had acceptable discrimination (area under the receiver operating characteristic curve = 0.732, 95% CI 0.708–0.756). We assessed calibration using logistic regression of the log odds of the risk score, and the new model showed no evidence of over- or under-estimation of risk (α = −0.08; 95% CI −0.21−0.05) and no evidence or over-or under-dispersion of risk (β = 0.90, 95% CI 0.80–1.00). Accurate prediction of individual risk is possible and will be important in regions where vaccines are not widely available or where people refuse or are disqualified from vaccination, especially given uncertainty about the extent of infection transmission among vaccinated people and the emergence of SARS-CoV-2 variants of concern.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
Copyright © The Author(s), 2021. Published by Cambridge University Press
Figure 0

Table 1. Characteristics of cases and controls in the training and validation datasets for the variables considered for inclusion in the new model

Figure 1

Table 2. Adjusted ORs and odds per adjusted standard deviation for the risk factors in the age and sex model for risk of severe COVID-19 in the training dataset and adjusted ORs in the validation dataset

Figure 2

Table 3. Adjusted ORs and odds per adjusted standard deviation for the risk factors in the new model for risk of severe COVID-19 in the training dataset and adjusted ORs in the validation dataset

Figure 3

Fig. 1. Flow chart of risk factor selection for the development of the new model in the training dataset.

Figure 4

Fig. 2. Receiver operating characteristic curves for the age and sex model and the new model in the validation dataset. The new model has an area under the curve (AUC) of 0.732 (95% CI 0.708–0.756), and the age and sex model has an AUC of 0.671 (95% CI 0.646–0.696).

Figure 5

Fig. 3. Calibration plots for the (a) age and sex model and (b) new model in the validation dataset.

Figure 6

Fig. 4. Distribution of probability of severe COVID-19 in all of UK Biobank for (a) the age and sex model and (b) the new model.

Supplementary material: File

Dite et al. supplementary material

Dite et al. supplementary material

Download Dite et al. supplementary material(File)
File 112.6 KB