To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
The COVID-19 pandemic creates a challenge for actuaries analysing experience data that include mortality shocks. Without sufficient local flexibility in the time dimension, any analysis based on the most recent data will be biased by the temporarily higher mortality. Also, depending on where the shocks sit in the exposure period, any attempt to identify mortality trends will be distorted. We present a methodology for analysing portfolio mortality data that offer local flexibility in the time dimension. The approach permits the identification of seasonal variation, mortality shocks and occurred-but-not reported deaths (OBNR). The methodology also allows actuaries to measure portfolio-specific mortality improvements. Finally, the method assists actuaries in determining a representative mortality level for long-term applications like reserving and pricing, even in the presence of mortality shocks. Results are given for a mature annuity portfolio in the UK, which suggest that the Bayesian information criterion is better for actuarial model selection in this application than Akaike’s information criterion.
A retrial queue with classical retrial policy, where each blocked customer in the orbit retries for service, and general retrial times is modeled by a piecewise deterministic Markov process (PDMP). From the extended generator of the PDMP of the retrial queue, we derive the associated martingales. These results are used to derive the conditional expected number of customers in the orbit in the transient regime.
Data have played a role in urban mobility policy planning for decades, especially in forecasting demand, but much less in policy evaluations and assessments. The surge in availability and openness of (big) data in the last decade seems to provide new opportunities to meet demand for evidence-based policymaking. This paper reviews how different types of data are employed in assessments published in academic journals by analyzing 74 cases. Our review finds that (a) academic literature has currently provided limited insight in new data developments in policy practice; (b) research shows that the new types of big data provide new opportunities for evidence-based policy-making; however, (c) they cannot replace traditional data usage (surveys and statistics). Instead, combining big data with survey and Geographic Information System data in ex-ante assessments, as well as in developing decision support tools, is found to be the most effective. This could help policymakers not only to get much more insight from policy assessments, but also to help avoid the limitations of one certain type of data. Finally, current research projects are rather data supply-driven. Future research should engage with policy practitioners to reveal best practices, constraints, and potential of more demand-driven data use in mobility policy assessments in practice.
Given a fixed graph H that embeds in a surface $\Sigma$, what is the maximum number of copies of H in an n-vertex graph G that embeds in $\Sigma$? We show that the answer is $\Theta(n^{f(H)})$, where f(H) is a graph invariant called the ‘flap-number’ of H, which is independent of $\Sigma$. This simultaneously answers two open problems posed by Eppstein ((1993) J. Graph Theory17(3) 409–416.). The same proof also answers the question for minor-closed classes. That is, if H is a $K_{3,t}$ minor-free graph, then the maximum number of copies of H in an n-vertex $K_{3,t}$ minor-free graph G is $\Theta(n^{f'(H)})$, where f′(H) is a graph invariant closely related to the flap-number of H. Finally, when H is a complete graph we give more precise answers.
The association between the ABO blood group and the risk of malaria during pregnancy has not been clearly established. The present study summarised relevant knowledge and reassessed the association through meta-analysis. Articles in MEDICINE and PubMed published before 30 November 2021 were searched. Five studies satisfied the inclusion criteria and were enrolled in the meta-analysis. It was shown that primiparae with different ABO blood group, multiparae with blood group A and non-A, AB and non-AB had a comparable risk of malaria. However, multiparae with blood group B had a significantly higher risk than non-B group [odds ratio (OR) = 1.23, 95% confidence interval (CI) was 1.01 to 1.50, P = 0.04], while multiparae with blood group O had a significantly lower risk than non-O group (OR = 0.78, 95% CI was 0.63 to 0.97, P = 0.03). Therefore, the ABO blood group may not result in a different risk of malaria in primiparae. Blood group B is potentially a risk factor while blood group O is a protective factor for multiparae.
We consider a collection of Markov chains that model the evolution of multitype biological populations. The state space of the chains is the positive orthant, and the boundary of the orthant is the absorbing state for the Markov chain and represents the extinction states of different population types. We are interested in the long-term behavior of the Markov chain away from extinction, under a small noise scaling. Under this scaling, the trajectory of the Markov process over any compact interval converges in distribution to the solution of an ordinary differential equation (ODE) evolving in the positive orthant. We study the asymptotic behavior of the quasi-stationary distributions (QSD) in this scaling regime. Our main result shows that, under conditions, the limit points of the QSD are supported on the union of interior attractors of the flow determined by the ODE. We also give lower bounds on expected extinction times which scale exponentially with the system size. Results of this type when the deterministic dynamical system obtained under the scaling limit is given by a discrete-time evolution equation and the dynamics are essentially in a compact space (namely, the one-step map is a bounded function) have been studied by Faure and Schreiber (2014). Our results extend these to a setting of an unbounded state space and continuous-time dynamics. The proofs rely on uniform large deviation results for small noise stochastic dynamical systems and methods from the theory of continuous-time dynamical systems.
In general, QSD for Markov chains with absorbing states and unbounded state spaces may not exist. We study one basic family of binomial-Poisson models in the positive orthant where one can use Lyapunov function methods to establish existence of QSD and also to argue the tightness of the QSD of the scaled sequence of Markov chains. The results from the first part are then used to characterize the support of limit points of this sequence of QSD.
Established methods of recruiting population controls for case–control studies in infectious disease outbreak investigations are resource- and time-intensive, and are often subject to bias. The online panel have recently gained interest as an easy and timely method to select controls. We examined the feasibility, suitability and reliability of using an online panel to select controls for case–control studies as part of investigations of diffuse food and waterborne outbreaks. In January 2019, we deployed a web survey by email to the 277 members of a non-probabilistic online panel in Lower Saxony, Germany. We questioned them on basic sociodemographic characteristics and eating habits. They were frequency matched to cases on sex and age. Their food exposures were compared to those of traditionally recruited controls of four historical case–controls studies, which successfully investigated food and waterborne outbreaks. We used logistic regressions to assess the association between the food exposures and the disease (odds ratios). The use of a control panel successfully led to the identification of the food items in three of the four historical outbreak investigations, and their recruitment benefitted from increased speed and limited costs. Timely outbreak investigations would enable rapidly implementing control measures. We recommend the further evaluation of using panellists as controls in parallel case–control studies and case–panel studies.
Whole-genome sequencing (WGS) has shown tremendous potential in rapid diagnosis of drug-resistant tuberculosis (TB). In the current study, we performed WGS on drug-resistant Mycobacterium tuberculosis isolates obtained from Shanghai (n = 137) and Russia (n = 78). We aimed to characterise the underlying and high-frequency novel drug-resistance-conferring mutations, and also create valuable combinations of resistance mutations with high predictive sensitivity to predict multidrug- and extensively drug-resistant tuberculosis (MDR/XDR-TB) phenotype using a bootstrap method. Most strains belonged to L2.2, L4.2, L4.4, L4.5 and L4.8 lineages. We found that WGS could predict 82.07% of phenotypically drug-resistant domestic strains. The prediction sensitivity for rifampicin (RIF), isoniazid (INH), ethambutol (EMB), streptomycin (STR), ofloxacin (OFL), amikacin (AMK) and capreomycin (CAP) was 79.71%, 86.30%, 76.47%, 88.37%, 83.33%, 70.00% and 70.00%, respectively. The mutation combination with the highest sensitivity for MDR prediction was rpoB S450L + rpoB H445A/P + katG S315T + inhA I21T + inhA S94A, with a sensitivity of 92.17% (0.8615, 0.9646), and the mutation combination with highest sensitivity for XDR prediction was rpoB S450L + katG S315T + gyrA D94G + rrs A1401G, with a sensitivity of 92.86% (0.8158, 0.9796). The molecular information presented here will be of particular value for the rapid clinical detection of MDR- and XDR-TB isolates through laboratory diagnosis.
Serosurveillance is an important epidemiologic tool for severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), used to estimate infection rates and the degree of population immunity. There is no general agreement on which antibody biomarker(s) should be used, especially with the rollout of vaccines globally. Here, we used random forest models to demonstrate that a single spike or receptor-binding domain (RBD) antibody was adequate for classifying prior infection, while a combination of two antibody biomarkers performed better than any single marker for estimating time-since-infection. Nucleocapsid antibodies performed worse than spike or RBD antibodies for classification, but can be useful for estimating time-since-infection, and in distinguishing infection-induced from vaccine-induced responses. Our analysis has the potential to inform the design of serosurveys for SARS-CoV-2, including decisions regarding a number of antibody biomarkers measured.
This paper addresses the task of modeling severity losses using segmentation when the data distribution does not fall into the usual regression frameworks. This situation is not uncommon in lines of business such as third-party liability insurance, where heavy-tails and multimodality often hamper a direct statistical analysis. We propose to use regression models based on phase-type distributions, regressing on their underlying inhomogeneous Markov intensity and using an extension of the expectation–maximization algorithm. These models are interpretable and tractable in terms of multistate processes and generalize the proportional hazards specification when the dimension of the state space is larger than 1. We show that the combination of matrix parameters, inhomogeneity transforms, and covariate information provides flexible regression models that effectively capture the entire distribution of loss severities.
The fact that a large proportion of insurance policyholders make no claims during a one-year period highlights the importance of zero-inflated count models when analyzing the frequency of insurance claims. There is a vast literature focused on the univariate case of zero-inflated count models, while work in the area of multivariate models is considerably less advanced. Given that insurance companies write multiple lines of insurance business, where the claim counts on these lines of business are often correlated, there is a strong incentive to analyze multivariate claim count models. Motivated by the idea of Liu and Tian (Computational Statistics and Data Analysis, 83, 200–222; 2015), we develop a multivariate zero-inflated hurdle model to describe multivariate count data with extra zeros. This generalization offers more flexibility in modeling the behavior of individual claim counts while also incorporating a correlation structure between claim counts for different lines of insurance business. We develop an application of the expectation–maximization (EM) algorithm to enable the statistical inference necessary to estimate the parameters associated with our model. Our model is then applied to an automobile insurance portfolio from a major insurance company in Spain. We demonstrate that the model performance for the multivariate zero-inflated hurdle model is superior when compared to several alternatives.
This study investigated the characteristics of transmission routes of COVID-19 cluster infections (⩾10 linked cases within a short period) in Gangwon Province between 22 February 2020 and 31 May 2021. Transmission routes were divided into five major categories and 35 sub-categories according to the relationship between the infector and the infectee and the location of transmission. A total of 61 clusters occurred during the study period, including 1741 confirmed cases (55.7% of all confirmed cases (n = 3125)). The the five major routes of transmission were as follows: ‘using (staying in) the same facility (50.7%), ‘cohabiting family members’ (23.3%), ‘social gatherings with acquaintances’ (10.8%), ‘other transmission routes’ (7.0%), and ‘social gatherings with non-cohabiting family members/relatives’ (5.5%). For transmission caused by using (staying in) the same facility, the highest number of confirmed cases was associated with churches, followed by medical institutions (inpatient), sports facilities, military bases, offices, nightlife businesses, schools, restaurants, day-care centres and kindergarten, and service businesses. Our analysis highlights specific locations with frequent transmission of infections, and transmission routes that should be targeted in situations where adherence to disease control rules is difficult.
Since the start of the coronavirus disease-2019 (COVID-19) pandemic, there has been interest in using wastewater monitoring as an approach for disease surveillance. A significant uncertainty that would improve the interpretation of wastewater monitoring data is the intensity and timing with which individuals shed RNA from severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) into wastewater. By combining wastewater and case surveillance data sets from a university campus during a period of heightened surveillance, we inferred that individual shedding of RNA into wastewater peaks on average 6 days (50% uncertainty interval (UI): 6–7; 95% UI: 4–8) following infection, and that wastewater measurements are highly overdispersed [negative binomial dispersion parameter, k = 0.39 (95% credible interval: 0.32–0.48)]. This limits the utility of wastewater surveillance as a leading indicator of secular trends in SARS-CoV-2 transmission during an epidemic, and implies that it could be most useful as an early warning of rising transmission in areas where transmission is low or clinical testing is delayed or of limited capacity.
It is unclear if – after symptom onset of a primary case of coronavirus disease-2019 (COVID-19) in a household – ensuing chains of transmissions among household members occur and if household epidemiology of COVID-19 is modified by the different circulating variants. We analysed data of 52 774 household clusters to investigate the day of symptom onset of ensuing cases in households relative to the symptom onset of the primary case within the household. Irrespective of cluster size or age of the primary case, 95% of all secondary household cases had symptom onset within 14 days after the symptom onset of the primary case. Stratification by variant showed that the mean interval from symptom onset of the primary case to the symptom onset of secondary cases decreased significantly from 4.8 days (wildtype) to 4.5 days (alpha) and 4.0 days (delta). Similarly, the cumulative proportion of 95% of secondary cases occurred within 14 days (wild type), 12 days (alpha) and 10 days (delta). Our findings suggest that during dominant delta circulation – apart from rare individual constellations – a 10-day household quarantine after symptom onset of the primary case is sufficient for household contacts who remain COVID-free.
Rabies, a fatal and vaccine-preventable disease, is endemic throughout Africa. In 2016, a rabies outbreak occurred in black-backed jackals (Canis mesomelas) along the western boundary of Gauteng Province, South Africa. We investigated the possible drivers of the 2016 outbreak and established its origin. Using spatio-temporal locations of cases, we applied logistic regression and Geographic Information System techniques to investigate environmental covariates driving occurrences of emerging rabies cases in Gauteng Province. About 53.8% of laboratory-confirmed lyssaviruses in Gauteng Province in 2016 originated from jackals. Phylogenetic trees reconstructed from a partial region of the glycoprotein gene of these and historical rabies viruses (RABVs) demonstrated the lyssaviruses to be of canid origin with 97.7% nucleotide sequence similarity. The major cluster comprised jackal RABVs from the 2012 KwaZulu/Natal outbreak and the 2016 outbreak in Gauteng Province. The second cluster was composed of both jackal and dog RABVs. Both clusters correlated with independent RABV introductions into Gauteng by dogs and jackals, respectively. This study demonstrated an expansion of a jackal rabies cycle from north-west Province into Gauteng Province during the 2016 dry period, as jackals ranged widely in search for food resources leading to increased jackal-dog interactions, reminiscent of the intricate links of domestic and wildlife rabies cycles in South Africa.
This essential reference for students and scholars in the input-output research and applications community has been fully revised and updated to reflect important developments in the field. Expanded coverage includes construction and application of multiregional and interregional models, including international models and their application to global economic issues such as climate change and international trade; structural decomposition and path analysis; linkages and key sector identification and hypothetical extraction analysis; the connection of national income and product accounts to input-output accounts; supply and use tables for commodity-by-industry accounting and models; social accounting matrices; non-survey estimation techniques; and energy and environmental applications. Input-Output Analysis is an ideal introduction to the subject for advanced undergraduate and graduate students in many scholarly fields, including economics, regional science, regional economics, city, regional and urban planning, environmental planning, public policy analysis and public management.
We determine the asymptotics of the number of independent sets of size $\lfloor \beta 2^{d-1} \rfloor$ in the discrete hypercube $Q_d = \{0,1\}^d$ for any fixed $\beta \in (0,1)$ as $d \to \infty$, extending a result of Galvin for $\beta \in (1-1/\sqrt{2},1)$. Moreover, we prove a multivariate local central limit theorem for structural features of independent sets in $Q_d$ drawn according to the hard-core model at any fixed fugacity $\lambda>0$. In proving these results we develop several general tools for performing combinatorial enumeration using polymer models and the cluster expansion from statistical physics along with local central limit theorems.
In this article, we focus on data trust and data privacy, and how attitudes may be changing during the COVID-19 period. On balance, it appears that Australians are more trusting of organizations with regards to data privacy and less concerned about their own personal information and data than they were prior to the spread of COVID-19. The major determinant of this change in trust with regards to data was changes in general confidence in government institutions. Despite this improvement in trust with regards to data privacy, trust levels are still low.