To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Supreme audit institutions (SAIs) are touted as an integral component to anticorruption efforts in developing nations. SAIs review governmental budgets and report fiscal discrepancies in publicly available audit reports. These documents contain valuable information on budgetary discrepancies, missing resources, or may even report fraud and corruption. Existing research on anticorruption efforts relies on information published by national-level SAIs while mostly ignoring audits from subnational SAIs because their information is not published in accessible formats. I collect publicly available audit reports published by a subnational SAI in Mexico, the Auditoria Superior del Estado de Sinaloa, and build a pipeline for extracting the monetary value of discrepancies detected in municipal budgets. I systematically convert scanned documents into machine-readable text using optical character recognition, and I then train a classification model to identify paragraphs with relevant information. From the relevant paragraphs, I extract the monetary values of budgetary discrepancies by developing a named entity recognizer that automates the identification of this information. In this paper, I explain the steps for building the pipeline and detail the procedures for replicating it in different contexts. The resulting dataset contains the official amounts of discrepancies in municipal budgets for the state of Sinaloa. This information is useful to anticorruption policymakers because it quantifies discrepancies in municipal spending potentially motivating reforms that mitigate misappropriation. Although I focus on a single state in Mexico, this method can be extended to any context where audit reports are publicly available.
Persons experiencing homelessness (PEH) or rough sleeping are a vulnerable population, likely to be disproportionately affected by the coronavirus disease 2019 (COVID-19) pandemic. The impact of COVID-19 infection on this population is yet to be fully described in England. We present a novel method to identify COVID-19 cases in this population and describe its findings. A phenotype was developed and validated to identify PEH or rough sleeping in a national surveillance system. Confirmed COVID-19 cases in England from March 2020 to March 2022 were address-matched to known homelessness accommodations and shelters. Further cases were identified using address-based indicators, such as NHS pseudo postcodes. In total, 1835 cases were identified by the phenotype. Most were <39 years of age (66.8%) and male (62.8%). The proportion of cases was highest in London (29.8%). The proportion of cases of a minority ethnic background and deaths were disproportionality greater in this population, compared to all COVID-19 cases in England. This methodology provides an approach to track the impact of COVID-19 on a subset of this population and will be relevant to policy making. Future surveillance systems and studies may benefit from this approach to further investigate the impact of COVID-19 and other diseases on select populations.
A quantification of the financial implications of the design of a funded, collective defined contribution (CDC) pension scheme is presented and illustrated. It is done through an attribution analysis, which allows the importance of various elements of CDC scheme design to be determined. The model of a CDC scheme analysed is based lightly on the first CDC scheme set to be approved in the UK. In the CDC scheme analysed, contributions are fixed and the initial benefit accrued by each contribution is fixed. Once accrued, benefits are subsequently adjusted annually in response to changes in assumptions and returns. An attribution of the benefit payments shows that this design gives higher benefits to the first generations and lower benefits to the last generations, for a scheme which starts with no members. The contributions paid also affect the balance of benefits paid between generations. Too high a contribution is to the advantage of the first generations. Too low a contribution is in the interests of the later generations. The conclusion, within the simple model considered, is that a constant benefit accrual is an important design choice. Its financial consequences across all generations should be carefully analysed, if it is intended to be implemented. Additionally, contributions should be reviewed regularly in such a CDC scheme, to ensure that cross-subsidies are not borne excessively by particular generations.
The notion of ordered system signature, originally defined for independent and identical coherent systems, is first extended to the case of independent and non-identical coherent systems, and then some key properties that help simplify its computation are established. Through its use, a dynamic ordered system signature is defined next, which facilitates a systematic study of dynamic properties of several coherent systems under a life test. The theoretical results established here are then illustrated through some specific examples. Finally, the usefulness in the evaluation of aging used systems of the concepts introduced is demonstrated.
Quantifying tail dependence is an important issue in insurance and risk management. The prevalent tail dependence coefficient (TDC), however, is known to underestimate the degree of tail dependence and it does not capture non-exchangeable tail dependence since it evaluates the limiting tail probability only along the main diagonal. To overcome these issues, two novel tail dependence measures called the maximal tail concordance measure (MTCM) and the average tail concordance measure (ATCM) are proposed. Both measures are constructed based on tail copulas and possess clear probabilistic interpretations in that the MTCM evaluates the largest limiting probability among all comparable rectangles in the tail, and the ATCM is a normalized average of these limiting probabilities. In contrast to the TDC, the proposed measures can capture non-exchangeable tail dependence. Analytical forms of the proposed measures are also derived for various copulas. A real data analysis reveals striking tail dependence and tail non-exchangeability of the return series of stock indices, particularly in periods of financial distress.
Today, technological developments are ever-growing yet fragmented. Alongside inconsistent digital approaches and attitudes across city administrations, such developments have made it difficult to reap the benefits of city digital twins. Bringing together experiences from five research projects, this paper discusses these digital twins based on two digital integration methodologies—systems and semantic integration. We revisit the nature of the underlying technologies, and their implications for interoperability and compatibility in the context of planning processes and smart urbanism. Semantic approaches present a new opportunity for bidirectional data flows that can inform both governance processes and technological systems to co-create, cross-pollinate, and support optimal outcomes. Building on this opportunity, we suggest that considering the technological dimension as a new addition to the trifecta of economic, environmental, and social sustainability goals that guide planning processes, can aid governments to address this conundrum of fragmentation, interoperability, and compatibility.
The principle of maximum entropy is a well-known approach to produce a model for data-generating distributions. In this approach, if partial knowledge about the distribution is available in terms of a set of information constraints, then the model that maximizes entropy under these constraints is used for the inference. In this paper, we propose a new three-parameter lifetime distribution using the maximum entropy principle under the constraints on the mean and a general index. We then present some statistical properties of the new distribution, including hazard rate function, quantile function, moments, characterization, and stochastic ordering. We use the maximum likelihood estimation technique to estimate the model parameters. A Monte Carlo study is carried out to evaluate the performance of the estimation method. In order to illustrate the usefulness of the proposed model, we fit the model to three real data sets and compare its relative performance with respect to the beta generalized Weibull family.
Let $\{X_n\}_{n\in{\mathbb{N}}}$ be an ${\mathbb{X}}$-valued iterated function system (IFS) of Lipschitz maps defined as $X_0 \in {\mathbb{X}}$ and for $n\geq 1$, $X_n\;:\!=\;F(X_{n-1},\vartheta_n)$, where $\{\vartheta_n\}_{n \ge 1}$ are independent and identically distributed random variables with common probability distribution $\mathfrak{p}$, $F(\cdot,\cdot)$ is Lipschitz continuous in the first variable, and $X_0$ is independent of $\{\vartheta_n\}_{n \ge 1}$. Under parametric perturbation of both F and $\mathfrak{p}$, we are interested in the robustness of the V-geometrical ergodicity property of $\{X_n\}_{n\in{\mathbb{N}}}$, of its invariant probability measure, and finally of the probability distribution of $X_n$. Specifically, we propose a pattern of assumptions for studying such robustness properties for an IFS. This pattern is implemented for the autoregressive processes with autoregressive conditional heteroscedastic errors, and for IFS under roundoff error or under thresholding/truncation. Moreover, we provide a general set of assumptions covering the classical Feller-type hypotheses for an IFS to be a V-geometrical ergodic process. An accurate bound for the rate of convergence is also provided.
Surveillance is a key public health function to enable early detection of infectious disease events and inform public health action. Data linkage may improve the depth of data for response to infectious disease events. This study aimed to describe the uses of linked data for infectious disease events. A systematic review was conducted using Pubmed, CINAHL and Web of Science. Studies were included if they used data linkage for an acute infectious disease event (e.g. outbreak of disease). We summarised the event, study aims and designs; data sets; linkage methods; outcomes reported; and benefits and limitations. Fifty-four studies were included. Uses of linkage for infectious disease events included assessment of severity of disease and risk factors; improved case finding and contact tracing; and vaccine uptake, safety and effectiveness. The ability to conduct larger scale population level studies was identified as a benefit, in particular for rarer exposures, risk factors or outcomes. Limitations included timeliness, data quality and inability to collect additional variables. This review demonstrated multiple uses of data linkage for infectious disease events. As infectious disease events occur without warning, there is a need to establish pre-approved protocols and the infrastructure for data-linkage to enhance information available during an event.
Under the influence of γ-quanta (60Co, P = 9.276 rad/s, T = 300 K), the amount, formation rate, and radiation-chemical yield of molecular hydrogen obtained from the radiolysis process that changes the mass of water (m = 0.0001 ÷ 0.8 g) have been defined in the created nano-SiO2/H2O system with m = 0.2 g mass and d = 20 nm particle size. It was determined that the radiation-chemical yield of molecular hydrogen obtained from the water radiolysis process in the nano-SiO2/H2O system created by the adsorption of water on the nanoparticle surface had a low value. In systems created with the addition of water, the radiation-chemical yield of molecular hydrogen obtained from its radiolysis increased in direct proportion to the water mass. This proves that due to ionizing rays, the yield of electrons emitted from the nanoparticle surface into the water and solvated there increases. Therefore, the radiation-chemical yield of molecular hydrogen is higher than that of the adsorbed system.
Oral rotavirus vaccine efficacy estimates from randomised controlled trials are highly variable across settings. Although the randomised study design increases the likelihood of internal validity of findings, results from trials may not always apply outside the context of the study due to differences between trial participants and the target population. Here, we used a weight-based method to transport results from a monovalent rotavirus vaccine clinical trial conducted in Malawi between 2005 and 2008 to a target population of all trial-eligible children in Malawi, represented by data from the 2015–2016 Malawi Demographic and Health Survey (DHS). We reweighted trial participants to reflect the population characteristics described by the Malawi DHS. Vaccine efficacy was estimated for 1008 trial participants after applying these weights such that they represented trial-eligible children in Malawi. We also conducted subgroup analyses to examine the heterogeneous treatment effects by stunting and tuberculosis vaccination status at enrolment. In the original trial, the estimates of one-year vaccine efficacy against severe rotavirus gastroenteritis and any-severity rotavirus gastroenteritis in Malawi were 49.2% (95% CI 15.6%–70.3%) and 32.1% (95% CI 2.5%–53.1%), respectively. After weighting trial participants to represent all trial-eligible children in Malawi, vaccine efficacy increased to 62.2% (95% CI 35.5%–79.0%) against severe rotavirus gastroenteritis and 38.9% (95% CI 11.4%–58.5%) against any-severity rotavirus gastroenteritis. Rotavirus vaccine efficacy may differ between trial participants and target populations when these two populations differ. Differences in tuberculosis vaccination status between the trial sample and DHS population contributed to varying trial and target population vaccine efficacy estimates.
The resurgence and outbreaks of mumps occur frequently in many countries worldwide in recent years, even in countries with high vaccination coverage. In this study, a descriptive and spatiotemporal clustering analysis at the township level was conducted to explore the dynamic spatiotemporal aggregation and epidemiological characteristics of mumps in Wuhan. During 2005 and 2019, there were 40 685 cases reported in Wuhan, with an average annual morbidity of 28.11 per 100 000 populations. The morbidity showed a fluctuating tendency, and peaked in 2010 and 2018. Bimodal seasonality was found, with a large peak between May and July, and a mild peak from November to January in the following year. Male students aged 5–9-year-old were the main risk group of mumps infection. Significant global spatial auto-correlation was detected except in 2007, 2009 and 2015. The spatial and temporal scan statistics indicated that the hot-spots mainly located at the western and southern areas of Wuhan with variations almost every year. Our findings could assist the public health authorities to develop and improve targeted health strategies, and allocate health resources rationally.
Imagine, you enter a grocery store to buy food. How many people do you overlap with in this store? How much time do you overlap with each person in the store? In this paper, we answer these questions by studying the overlap times between customers in the infinite server queue. We compute in closed form the steady-state distribution of the overlap time between a pair of customers and the distribution of the number of customers that an arriving customer will overlap with. Finally, we define a residual process that counts the number of overlapping customers that overlap in the queue for at least $\delta$ time units and compute its distribution.
We consider the asset price as the weak solution to a stochastic differential equation driven by both a Brownian motion and the counting process martingale whose predictable compensator follows shot-noise and Hawkes processes. In this framework, we discuss the Esscher martingale measure where the conditions for its existence are detailed. This generalizes certain relationships not yet encountered in the literature.
The tail index is an important parameter that measures how extreme events occur. In many practical cases, this tail index depends on covariates. In this paper,we assume that it takes a finite number of values over a partition of the covariate space. This article proposes a tail index partition-based rules extraction method that is able to construct estimates of the partition subsets and estimates of the tail index values. The method combines two steps: first an additive tree ensemble based on the Gamma deviance is fitted, and second a hierarchical clustering with spatial constraints is used to estimate the subsets of the partition. We also propose a global tree surrogate model to approximate the partition-based rules while providing an explainable model from the initial covariates. Our procedure is illustrated on simulated data. A real case study on wind property damages caused by tornadoes is finally presented.
This article proposes a continuous time mortality model based on calendar years. Mortality rates belong to a mean-reverting random field indexed by time and age. In order to explain the improvement of life expectancies, the reversion level of mortality rates is the product of a deterministic function of age and of a decreasing jump-diffusion process driving the evolution of longevity. We provide a general closed-form expression for survival probabilities and develop it when the mean reversion level of mortality rates is proportional to a Gompertz–Makeham law. We develop an econometric estimation method and validate the model on the Belgian population.
The association between time to positivity (TTP) of blood culture and the clinical prognosis of patients with Klebsiella pneumoniae bloodstream infection (BSI) remains unclear. A retrospective study of 148 inpatients with BSI caused by K. pneumoniae was performed at Shanghai Tongji Hospital, China, from October 2016–2020. The total in-hospital fatality rate was 32%. The median TTP was 11.0 (7.7–16.1) h and the optimal cutoff for prediction of in-hospital mortality was 9.4 h according to the ROC curve. Early TTP (<9.4 h) was a risk factor for in-hospital mortality by univariate analysis (OR = 2.5, 95% CI 1.2–5.0, P = 0.01), but not by multivariate analysis (OR = 2.7, 95% CI 1.0–7.4, P = 0.06). Old age, serum creatinine, white blood cells, and C-reactive protein values were risk factors for in-hospital mortality by multivariate analysis. Early TTP was not a risk factor for septic shock (OR = 1.8, 95% CI 0.6–5.1, P = 0.27) or ICU admission (OR = 1.0, 95% CI 1.0–1.0, P = 0.32). In conclusion, the in-hospital fatality rate of patients with K. pneumoniae BSI was relatively high and associated with an early TTP of blood cultures. However, no increased risk of mortality, septic shock or ICU admission was evident in early TTP patients.
We study an optimal reinsurance problem for a diffusion model, in which the drift of the claim follows an Ornstein–Uhlenbeck process. The aim of the insurer is to maximize the expected exponential utility of its terminal wealth. We consider two cases: full information and partial information. Full information occurs when the insurer directly observes the drift; partial information occurs when the insurer observes only its claims. By applying stochastic control and by solving the corresponding Hamilton–Jacobi–Bellman equations, we find the value function and the optimal reinsurance strategy under both full and partial information. We determine a relationship between the value function and reinsurance strategy under full information with the value function and reinsurance strategy under partial information.
Our study population consisted of all children and adolescents, with laboratory-confirmed SARS-Co-V-2 infection, hospitalised from February 2020 through February 2022, among residents of the Tel Aviv (TA) District, Israel. There were 491 children and adolescents hospitalised with Sars-CoV-2 infection. Among them, 281 (57%) admitted with coronavirus disease 2019 (COVID-19) as the primary cause of admission (rate of 39 per 100 000). Among all children and adolescents in the TA District, the highest hospitalisation rates were observed among infants and children below the age of 4 years (rate of 311 per 100 000 population). Severe disease was observed mostly among children with multiple underlying medical conditions. Admission rates were also elevated among residents of the ultra-orthodox community (rate ratio (RR) compared to the rest of the district; 95% confidence interval (CI) 2.38–3.82). Admission rates with COVID-19 as primary cause of admission were higher during Omicron compared to Delta predominance period (RR 1.7; 95% CI 1.22–2.32). Targeted social and public health policies should be put in place when rates of disease start to increase, such as encouraging vaccine uptake for eligible children and social distancing when necessary, taking into account already existing social and learning gaps, in order to reduce the burden of disease.