Hostname: page-component-6766d58669-76mfw Total loading time: 0 Render date: 2026-05-15T20:26:14.137Z Has data issue: false hasContentIssue false

Attach importance of the bootstrap t test against Student's t test in clinical epidemiology: a demonstrative comparison using COVID-19 as an example

Published online by Cambridge University Press:  30 April 2021

Shi Zhao*
Affiliation:
JC School of Public Health and Primary Care, Chinese University of Hong Kong, Shatin, Hong Kong Shenzhen Research Institute of Chinese University of Hong Kong, Shenzhen, China
Zuyao Yang
Affiliation:
JC School of Public Health and Primary Care, Chinese University of Hong Kong, Shatin, Hong Kong
Salihu S. Musa
Affiliation:
Department of Applied Mathematics, Hong Kong Polytechnic University, Hung Hom, Hong Kong
Jinjun Ran
Affiliation:
School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, China
Marc K. C. Chong
Affiliation:
JC School of Public Health and Primary Care, Chinese University of Hong Kong, Shatin, Hong Kong Shenzhen Research Institute of Chinese University of Hong Kong, Shenzhen, China
Mohammad Javanbakht
Affiliation:
Nephrology and Urology Research Center, Baqiyatallah University of Medical Sciences, Tehran, Iran
Daihai He
Affiliation:
Department of Applied Mathematics, Hong Kong Polytechnic University, Hung Hom, Hong Kong
Maggie H. Wang
Affiliation:
JC School of Public Health and Primary Care, Chinese University of Hong Kong, Shatin, Hong Kong Shenzhen Research Institute of Chinese University of Hong Kong, Shenzhen, China
*
Authors for correspondence: Shi Zhao, E-mail: zhaoshi.cmsa@gmail.com; Daihai He, E-mail: daihai.he@polyu.edu.hk
Rights & Permissions [Opens in a new window]

Abstract

Student's t test is valid for statistical inference under the normality assumption or asymptotically. By contrast, although the bootstrap t test was proposed in 1993, it is seldom adopted in medical research. We aim to demonstrate that the bootstrap t test outperforms Student's t test under normality in data. Using random data samples from normal distributions, we evaluated the testing performance, in terms of true-positive rate (TPR) and false-positive rate and diagnostic abilities, in terms of the area under the curve (AUC), of the bootstrap t test and Student's t test. We explore the AUC of both tests with varying sample size and coefficient of variation. We compare the testing outcomes using the COVID-19 serial interval (SI) data in Shenzhen and Hong Kong, China, for demonstration. With fixed TPR, the bootstrap t test maintained the equivalent accuracy in TPR, but significantly improved the true-negative rate from the Student's t test. With varying TPR, the diagnostic ability of bootstrap t test outperformed or equivalently performed as Student's t test in terms of the AUC. The equivalent performances are possible but rarely occur in practice. We find that the bootstrap t test outperforms by successfully detecting the difference in COVID-19 SI, which is defined as the time interval between consecutive transmission generations, due to sex and non-pharmaceutical interventions against the Student's t test. We demonstrated that the bootstrap t test outperforms Student's t test, and it is recommended to replace Student's t test in medical data analysis regardless of sample size.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - SA
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike licence (http://creativecommons.org/licenses/by-nc-sa/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the same Creative Commons licence is included and the original work is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use.
Copyright
Copyright © The Author(s), 2021. Published by Cambridge University Press
Figure 0

Fig. 1. Relations between the testing accuracies of the bootstrap t test (blue) and Student's t test (black), including informedness, TPR and TNR, and the features of the data samples including sample size and CV. Panels (a) and (b) show the relations between informedness ( = TPR + TNR − 1) and sample size and CV respectively. Panels (c) and (d) show the relations between TPR and sample size and CV respectively. Panels (e) and (f) show the relations between TNR and sample size and CV respectively. The CVs (of the data samples) were determined by $\sqrt n \times t_{P = 0.975, {\rm df} = n}^\ast$, where n denotes the sample size, t* is the quantile of the t distribution and ‘df’ is the degree of freedom, in panels (a), (c) and (e). The sample size was fixed to be 30 in panels (b), (d) and (f). The level of α was fixed to be 5% in all panels. The vertical bars in each panel represent the 95% CIs.

Figure 1

Fig. 2. ROC curves and AUCs of the bootstrap t test (blue) and Student's t test (black) with varying sample sizes, n, and CVs of the data samples. The diagonal dashed lines show the testing performance of a random classifier.

Figure 2

Table 1. Summary of the situations to be tested and the recommendation of Student's or Bootstrap t tests

Figure 3

Fig. 3. SIs of the COVID-19 transmission pairs in Shenzhen and Hong Kong, China, during the early outbreaks. The SI with male or female primary case is represented by upward or downward triangle, respectively. The hollow or filled (red for female and blue for male) triangle represents the SI data excluded or included in the t tests, respectively. The green shading area highlights the CLNY.

Supplementary material: File

Zhao et al. supplementary material

Zhao et al. supplementary material

Download Zhao et al. supplementary material(File)
File 13.3 KB