Hostname: page-component-89b8bd64d-mmrw7 Total loading time: 0 Render date: 2026-05-05T11:07:02.287Z Has data issue: false hasContentIssue false

Sunspot activity and influenza pandemics: a statistical assessment of the purported association

Published online by Cambridge University Press:  29 August 2017

S. TOWERS*
Affiliation:
Simon A. Levin Mathematical, Computational and Modelling Sciences Center, Arizona State University, Tempe, AZ, USA
*
*Author for correspondence: S. Towers, Simon A. Levin Mathematical, Computational and Modelling Sciences Center, Arizona State University, Tempe, AZ, USA. (Email: smtowers@asu.edu)
Rights & Permissions [Opens in a new window]

Summary

Since 1978, a series of papers in the literature have claimed to find a significant association between sunspot activity and the timing of influenza pandemics. This paper examines these analyses, and attempts to recreate the three most recent statistical analyses by Ertel (1994), Tapping et al. (2001), and Yeung (2006), which all have purported to find a significant relationship between sunspot numbers and pandemic influenza. As will be discussed, each analysis had errors in the data. In addition, in each analysis arbitrary selections or assumptions were also made, and the authors did not assess the robustness of their analyses to changes in those arbitrary assumptions. Varying the arbitrary assumptions to other, equally valid, assumptions negates the claims of significance. Indeed, an arbitrary selection made in one of the analyses appears to have resulted in almost maximal apparent significance; changing it only slightly yields a null result. This analysis applies statistically rigorous methodology to examine the purported sunspot/pandemic link, using more statistically powerful un-binned analysis methods, rather than relying on arbitrarily binned data. The analyses are repeated using both the Wolf and Group sunspot numbers. In all cases, no statistically significant evidence of any association was found. However, while the focus in this particular analysis was on the purported relationship of influenza pandemics to sunspot activity, the faults found in the past analyses are common pitfalls; inattention to analysis reproducibility and robustness assessment are common problems in the sciences, that are unfortunately not noted often enough in review.

Information

Type
Review
Copyright
Copyright © Cambridge University Press 2017 
Figure 0

Table 1. Summary of influenza outbreaks from 1700 to 1977 labelled as pandemics, listed by Morens and Taubenberger, Mamelund, Lattanzi, Hampson and Mackenzie, Potter, Garrett, Beveridge, Kilbourne, Pyle, and Patterson [4–13]. The pandemic year of 2009 was also included in the data

Figure 1

Fig. 1. Wolf and Group sunspots by year from 1700 to 2014. Overlaid is the timing of purported pandemics, as listed by Morens and Taubenberger, Mamelund, Lattanzi, Hampson and Mackenzie, Potter, Garrett, Beveridge, Kilbourne, Pyle, and Patterson [4–13], with the points coloured and sized relative to the number of reviewers agreeing on the date.

Figure 2

Fig. 2. The results of the analyses of pandemic data as listed by Morens and Taubenberger, Mamelund, Lattanzi, Hampson and Mackenzie, Potter, Garrett, Beveridge, Kilbourne, Pyle, and Patterson [4–13], assessed using the Wolf and Group sunspot numbers (top row and bottom row of plots, respectively). The plots show the P-values assessed by the analyses, vs. the minimum number of reviewers agreeing that an outbreak or pandemic occurred. The first plot in the top and bottom rows shows the Binomial probability of the observed fraction of outbreak years within ±1 year of a maximum in sunspot activity, given the expected fraction for all years (and similarly for the years within ±1 year of an extremum in sunspot activity). The plots in the second, third, and fourth columns, respectively, show the re-creation of the Ertel, Tapping et al., and Yeung analyses [25–27], respectively, with the use of the Kolmogorov–Smirnov and Anderson–Darling tests to assess significance.

Figure 3

Table 2. Summary of influenza pandemics from 1700 to 1977, listed by Morens and Taubenberger, Mamelund, Lattanzi, Hampson and Mackenzie, Potter, Garrett, Beveridge, Kilbourne, Pyle, and Patterson [4–13]. In this analysis, the pandemic year of 2009 is also included in the data

Figure 4

Table 3. Re-creation of Table 1 of Ertel [25]. The data were taken from Assaad, Beveridge et al., Hoyle, Collier's Encyclopedia, Creighton, Hope-Simpson, Patterson, Pyle, Silverstein, and Tschijewsky [2, 12, 13, 20, 33, 37–39, 62, 63]. The */** before/after a year indicates years that Ertel [25] identified to be within ±1 year of a minimum/maximum in solar activity. However, note that there are several errors in the data transcription from the sources (see text for details), and several sources list both pandemics and outbreaks, not just pandemics. Correctly transcribed data for pandemics only are shown in Table 2

Figure 5

Table 4. Re-creation of Table 1 in Tapping et al. [26], showing the years they considered as pandemic years in their analysis. In several cases, shown in blue, outbreaks clearly designated by the source as not being a pandemic year were included in the data. In addition, in the several instances indicated in red, the phase, ϕ, was incorrectly calculated

Figure 6

Table 5. Re-creation of Table 1 in Yeung [27], showing the years Yeung considered as pandemic years in the analysis. In several cases, indicated in red, the data are mis-transcribed from the original sources

Figure 7

Fig. 3. Apparent significance of the results of the Yeung [27] analysis, as a function of the percentile value used to assess the cutoff in sunspot number, SSN. The result of Yeung [27] was obtained with an SSN percentile cutoff of 60% (0·6), which resulted in almost maximal apparent significance. Note that the Group sunspot numbers do not yield a significant result for any percentile used to assess the cutoff in SSN.