To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
As the COVID-19 pandemic continues to escalate and place pressure on hospital system resources, a proper screening and risk stratification score is essential. We aimed to develop a risk score to identify patients with increased risk of COVID-19, allowing proper identification and allocation of limited resources. A retrospective study was conducted of 338 patients who were admitted to the hospital from the emergency room to regular floors and tested for COVID-19 at an acute care hospital in the Metropolitan Washington D.C. area. The dataset was split into development and validation sets with a ratio of 6:4. Demographics, presenting symptoms, sick contact, triage vital signs, initial laboratory and chest X-ray results were analysed to develop a prediction model for COVID-19 diagnosis. Multivariable logistic regression was performed in a stepwise fashion to develop a prediction model, and a scoring system was created based on the coefficients of the final model. Among 338 patients admitted to the hospital from the emergency room, 136 (40.2%) patients tested positive for COVID-19 and 202 (59.8%) patients tested negative. Sick contact with suspected or confirmed COVID-19 case (3 points), nursing facility residence (3 points), constitutional symptom (1 point), respiratory symptom (1 point), gastrointestinal symptom (1 point), obesity (1 point), hypoxia at triage (1 point) and leucocytosis (−1 point) were included in the prediction score. A risk score for COVID-19 diagnosis achieved area under the receiver operating characteristic curve of 0.87 (95% confidence interval (CI) 0.82–0.92) in the development dataset and 0.85 (95% CI 0.78–0.92) in the validation dataset. A risk prediction score for COVID-19 can be used as a supplemental tool to assist clinical decision to triage, test and quarantine patients admitted to the hospital from the emergency room.
We propose a new approach to mortality prediction under survival energy hypothesis (SEH). We assume that a human is born with initial energy, which changes stochastically in time and the human dies when the energy vanishes. Then, the time of death is represented by the first hitting time of the survival energy (SE) process to zero. This study assumes that SE follows a time-inhomogeneous diffusion process and defines the mortality function, which is the first hitting time distribution function of the SE process. Although SEH is a fictitious construct, we illustrate that this assumption has the potential to yield a good parametric family of cumulative probability of death, and the parametric family yields surprisingly good predictions for future mortality rates.
This study used hospital records from two time periods to understand the implication of COVID-19 on hospital-based deaths in Burundi. The place of COVID-19 symptoms was sought among deaths that occurred from January to May 2020 (during the pandemic) vs. January to May 2019 (before the pandemic). First, death proportions were tested to seize differences between mortality rates for each month in 2020 vs. 2019. In the second time, we compared mean time-to-death between the two periods using the Kaplan–Meier survival curve. Finally, a logistic regression was fitted to assess the likelihood of dying from COVID-19 symptoms between the two periods. We found statistical evidence of a higher death rate in May 2020 as compared to May 2019. Moreover, death occurred faster in 2020 (mean = 6.7 days, s.d. = 8.9) than in 2019 (mean = 7.8 days, s.d. = 10.9). Unlike in 2019, being a male was significantly associated with a much lower likelihood of dying with one or more COVID-19 symptom(s) in 2020 (odds ratio 0.35, 95% confidence interval 0.14–0.87). This study yielded some evidence for a possible COVID-19-related hospital-based mortality trend for May 2020. However, considering the time-constraint of the study, further similar studies over a longer period of time need to be conducted to trace a clearer picture on COVID-19 implication on hospital-based deaths in Burundi.
One of the largest nationwide bursts of the first COVID-19 outbreak occurred in Spain, where infection expanded in densely populated areas through March 2020. We analyse the cumulative growth curves of reported cases and deaths in all Spain and two highly populated regions, Madrid and Catalonia, identifying changes and sudden shifts in their exponential growth rate through segmented Poisson regressions. We associate these breakpoints with a timeline of key events and containment measures, and data on policy stringency and citizen mobility. Results were largely consistent for infections and deaths in all territories, showing four major shifts involving 19–71% reductions in growth rates originating from infections before 3 March and on 5–8, 10–12 and 14–18 March, but no identifiable effect of the strengthened lockdown of 29–30 March. Changes in stringency and mobility were only associated to the latter two shifts, evidencing an early deceleration in COVID-19 spread associated to personal hygiene and social distancing recommendations, followed by a stronger decrease when lockdown was enforced, leading to the contention of the outbreak by mid-April. This highlights the importance of combining public health communication strategies and hard confinement measures to contain epidemics.
We analyze a mean field game model of SIR dynamics (Susceptible, Infected, and Recovered) where players choose when to vaccinate. We show that this game admits a unique mean field equilibrium (MFE) that consists in vaccinating at a maximal rate until a given time and then not vaccinating. The vaccination strategy that minimizes the total cost has the same structure as the MFE. We prove that the vaccination period of the MFE is always smaller than the one minimizing the total cost. This implies that, to encourage optimal vaccination behavior, vaccination should always be subsidized. Finally, we provide numerical experiments to study the convergence of the equilibrium when the system is composed by a finite number of agents ($N$) to the MFE. These experiments show that the convergence rate of the cost is $1/N$ and the convergence of the switching curve is monotone.
The relationship between zoo animals, particularly nonhuman primates, and visitors is complex and varies by species. Adding complexity to this relationship is the trend for zoos to host events outside of normal operating hours. Here, we explored whether a late-night haunted-house style event influenced the behavior of spider monkeys. We conducted behavioral observations both on event nights and nights without the event. The spider monkeys were active and outside more frequently on event nights compared to the control nights indicating that their typical nighttime behavior was altered. However, it is difficult to definitively conclude whether the behavioral changes were a result of the event being aversive or enriching. Our findings suggest that zoos should conduct behavioral observations of and collect physiological data from their animals, especially if they are sensitive to environmental changes, when implementing new events, including those occurring outside of normal operating hours to ensure high levels of animal welfare.
The Erdős–Simonovits stability theorem states that for all ε > 0 there exists α > 0 such that if G is a Kr+1-free graph on n vertices with e(G) > ex(n, Kr+1)– α n2, then one can remove εn2 edges from G to obtain an r-partite graph. Füredi gave a short proof that one can choose α = ε. We give a bound for the relationship of α and ε which is asymptotically sharp as ε → 0.
Spatial analysis ranges from simple univariate descriptive statistics to complex multivariate analyses and is typically used to investigate spatial patterns or to identify spatially linked consumer behaviours in insurance. This paper investigates if the incorporation of publicly available spatially linked demographic census data at population level is useful in modelling customers’ lapse behaviour (i.e. stopping payment of premiums) in life insurance policies, based on data provided by an insurance company in Ireland. From the insurance company’s perspective, identifying and assessing such lapsing risks in advance permit engagement to prevent such incidents, saving money by re-evaluating customer acquisition channels and improving capital reserve calculation and preparation. Incorporating spatial analysis in lapse modelling is expected to improve lapse prediction. Therefore, a hybrid approach to lapse prediction is proposed – spatial clustering using census data is used to reveal the underlying spatial structure of customers of the Irish life insurer, in conjunction with traditional statistical models for lapse prediction based on the company data. The primary contribution of this work is to consider the spatial characteristics of customers for life insurance lapse behaviour, via the integration of reliable government provided census demographics, which has not been considered previously in actuarial literature. Company decision-makers can use the insights gleaned from this analysis to identify customer subsets to target with personalized promotions to reduce lapse rates, and to reduce overall company risk.
This paper establishes central limit theorems (CLTs) and proposes how to perform valid inference in factor models. We consider a setting where many counties/regions/assets are observed for many time periods, and when estimation of a global parameter includes aggregation of a cross-section of heterogeneous microparameters estimated separately for each entity. The CLT applies for quantities involving both cross-sectional and time series aggregation, as well as for quadratic forms in time-aggregated errors. This paper studies the conditions when one can consistently estimate the asymptotic variance, and proposes a bootstrap scheme for cases when one cannot. A small simulation study illustrates performance of the asymptotic and bootstrap procedures. The results are useful for making inferences in two-step estimation procedures related to factor models, as well as in other related contexts. Our treatment avoids structural modeling of cross-sectional dependence but imposes time-series independence.
We propose a multiplex interdependent durations model and study its empirical content. The model considers an empirical stopping game of multiple agents making optimal timing decisions with incomplete information. We characterize the unique Bayesian Nash equilibrium of the stopping game in a system of simultaneous equations involving the conditional distribution of each duration with a moderate strategic interaction condition. The system of nonlinear simultaneous equations allows us to obtain constructive identification results of the interaction effects and other nonparametric model primitives. We propose two consistent semiparametric estimation methods based on different parameterizations of modeling components with right-censored duration data.
In this paper, we discuss how the notion of subgeometric ergodicity in Markov chain theory can be exploited to study stationarity and ergodicity of nonlinear time series models. Subgeometric ergodicity means that the transition probability measures converge to the stationary measure at a rate slower than geometric. Specifically, we consider suitably defined higher-order nonlinear autoregressions that behave similarly to a unit root process for large values of the observed series but we place almost no restrictions on their dynamics for moderate values of the observed series. Results on the subgeometric ergodicity of nonlinear autoregressions have previously appeared only in the first-order case. We provide an extension to the higher-order case and show that the autoregressions we consider are, under appropriate conditions, subgeometrically ergodic. As useful implications, we also obtain stationarity and$\beta $-mixing with subgeometrically decaying mixing coefficients.
This book looks at how numbers and statistics have been used to underpin quality in news reporting. In doing so, the aim is to challenge some common assumptions about how journalists engage and use statistics in their quest for quality news. It seeks to improve our understanding about the usage of data and statistics as a primary means for the construction of social reality. This is a task, in our view, that is urgent in times of 'post-truth' politics and the rise of 'fake news'. In this sense, the quest to produce 'quality' news, which seems to require incorporating statistics and engaging with data, as laudable and straightforward as it sounds, is instead far more problematic and complex than what is often accounted for.
Building upon the success of the first edition, Statistics Using Stata uses the latest version of Stata to meet the needs of today's students. Engaging and accessible for students from a variety of mathematical backgrounds, this textbook integrates statistical concepts with the Stata (version 16) software package. It aligns Stata commands with examples based on real data, enabling students to understand statistics in a way that reflects statistical practice. Capitalizing on Stata's menu-driven 'point and click' and program syntax interface, the chapters guide students from the comfortable 'point and click' environment to the beginnings of statistical programming. Its coverage of essential topics gives instructors flexibility in curriculum planning and provides students with more advanced material to prepare for future work. Online resources - including solutions to exercises, PowerPoint slides, and Stata syntax (do-files) for each chapter - allow students to review independently and adapt code to analyze new problems.
The prevalence of Chagas disease has decreased in the Americas region due to vector control measures. However, non-vectorial transmission through blood transfusions and organ transplantation has gained importance in recent years. Screening among blood and organ donors are essential to reduce Trypanosoma cruzi transmission and could provide information to estimate population prevalence. We conducted a cross-sectional study on the prevalence of immunoglobulin G (IgG) antibodies against T. cruzi in healthy blood donors, solid organ donors and heart transplant recipients from 2012 to 2019. We found a total of 99 357 IgG T. cruzi results during the study period. The cumulative seroprevalence in healthy blood donors was 0.13% (95% confidence interval (CI) 0.10–0.15), in organ donors was 0.53% (95% CI 0.06–1.92) and in heart transplant recipients was 3.03 (95% CI 0.07–15.75). Seroprevalence trend in healthy blood donors showed annual increase between 2012 and 2015, decreasing in the following years. No trend was seen in organ donors neither heart recipients. Adjusted rates did not show difference by sex and age among blood donors. No significant increases in seroprevalence T. cruzi were found during the study period. T. cruzi transmission remains low.
Although the progression of invasive aspergillosis (IA) shares some risk factors in the development of active pulmonary tuberculosis (PTB), however, the prevalence of IA in suspected PTB remains unclear. During a period of 1 year (from January 2016 to December 2016), consecutive patients with suspected PTB were included in a referral TB hospital. Data, including demographic information and underlying diseases, were collected from medical records. PTB were all confirmed by mycobacterial culture (Lowenstein–Jensen medium). IA were diagnosed as proven or probable according to the criteria of the 2008 EORTC/MSG definitions. A descriptive analysis was performed to estimate the corresponding prevalence. During the study year, 1507 patients have a positive mycobacterial culture, with a mean age of 45.6 (s.d. 19.9) years old and a female:male ratio of 1:4. Among the 82 patients with non-tuberculous mycobacterial diseases, two patients (2.44%, 95% CI 0.67–8.46%) were diagnosed as IA (one proven and one probable); two probable IA patients (0.15%, 95% CI 0.04–0.55%) were diagnosed in PTB patients (n = 1315), and all were retreatment cases. In addition, all four IA patients (100%) exhibited cavities in both lobes on radiograph. In China, the prevalence of IA is low in active PTB patients. However, when high-risk factors for IA are encountered in PTB patients, further investigations are required and empirically treatment for IA might be warranted.
Drift analysis is one of the state-of-the-art techniques for the runtime analysis of randomized search heuristics (RSHs) such as evolutionary algorithms (EAs), simulated annealing, etc. The vast majority of existing drift theorems yield bounds on the expected value of the hitting time for a target state, for example the set of optimal solutions, without making additional statements on the distribution of this time. We address this lack by providing a general drift theorem that includes bounds on the upper and lower tail of the hitting time distribution. The new tail bounds are applied to prove very precise sharp-concentration results on the running time of a simple EA on standard benchmark problems, including the class of general linear functions. On all these problems, the probability of deviating by an r-factor in lower-order terms of the expected time decreases exponentially with r. The usefulness of the theorem outside the theory of RSHs is demonstrated by deriving tail bounds on the number of cycles in random permutations. All these results handle a position-dependent (variable) drift that was not covered by previous drift theorems with tail bounds. Finally, user-friendly specializations of the general drift theorem are given.
The aim was to analyse invasive pneumococcal disease (IPD) serotypes in children aged ⩽17 years according to clinical presentation and antimicrobial susceptibility. We conducted a prospective study (January 2012–June 2016). IPD cases were diagnosed by culture and/or real-time polymerase chain reaction (PCR). Demographic, microbiological and clinical data were analysed. Associations were assessed using the odds ratio (OR) and 95% confidence intervals (CI). Of the 253 cases, 34.4% were aged <2 years, 38.7% 2–4 years and 26.9% 5–17 years. Over 64% were 13-valent pneumococcal conjugate vaccine (PCV13) serotypes. 48% of the cases were diagnosed only by real-time PCR. Serotypes 3 and 1 were associated with complicated pneumonia (P < 0.05) and non-PCV13 serotypes with meningitis (OR 7.32, 95% CI 2.33–22.99) and occult bacteraemia (OR 3.6, 95% CI 1.56–8.76). Serotype 19A was more frequent in children aged <2 years and serotypes 3 and 1 in children aged 2–4 years and 5–17 years, respectively. 36.1% of cases were not susceptible to penicillin and 16.4% were also non-susceptible to cefotaxime. Serotypes 14, 24F and 23B were associated with non-susceptibility to penicillin (P < 0.05) and serotypes 11, 14 and 19A to cefotaxime (P < 0.05). Serotype 19A showed resistance to penicillin (P = 0.002). In conclusion, PCV13 serotypes were most frequent in children aged ⩽17 years, mainly serotypes 3, 1 and 19A. Non-PCV13 serotypes were associated with meningitis and occult bacteraemia and PCV13 serotypes with pneumonia. Non-susceptibility to antibiotics of non-PCV13 serotypes should be monitored.
We prove two estimates for the expectation of the exponential of a complex function of a random permutation or subset. Using this theory, we find asymptotic expressions for the expected number of copies and induced copies of a given graph in a uniformly random graph with degree sequence(d1, …, dn) as n→ ∞. We also determine the expected number of spanning trees in this model. The range of degrees covered includes dj= λn + O(n1/2+ε) for some λ bounded away from 0 and 1.
Vaccination has reduced the disease burden of vaccine-preventable diseases. However, the extent to which seasonal cycles of immunity could influence vaccine-induced immunity is not well understood. A national cross-sectional serosurveillance study performed in the Netherlands (Pienter-2) yielded data to investigate whether season of vaccination was associated with antibody responses induced by DT-IPV (diphtheria, tetanus and poliomyelitis), MMR (measles, mumps and rubella) and meningococcus C (MenC) vaccines in children. In total, 434 children met the inclusion criteria to study DT-IPV immunity, 811 for MMR and 311 for MenC. Differences in log(antibody levels) by season of vaccination were investigated with linear multivariable regression analyses. Seroconversion rates varied according to season of vaccination for rubella (90% of autumn-vaccinated children vs. 99% of winter-vaccinated had concentrations above cut-off levels). Summer-vaccinated boys showed a slower decline of tetanus antibodies (6% per month), in comparison with winter-vaccinated boys. In conclusion, season of vaccination showed little association with immunological protection. However, a number of associations were seen with a P-value of about 0.03; and adding data from a just-completed nationwide serological study might add more power to the current study. Further immunological and longitudinal investigations could help understand the mechanisms of seasonal influence in vaccine-induced responses.