To save content items to your account,
please confirm that you agree to abide by our usage policies.
If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account.
Find out more about saving content to .
To save content items to your Kindle, first ensure no-reply@cambridge.org
is added to your Approved Personal Document E-mail List under your Personal Document Settings
on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part
of your Kindle email address below.
Find out more about saving to your Kindle.
Note you can select to save to either the @free.kindle.com or @kindle.com variations.
‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi.
‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.
Polling has become very difficult. People do not respond, and pollsters use methods that are far removed from the random sampling tools that built the field. This chapter introduces the book by outlining the main challenges facing polling today, how conventional tools fail to fully meet these challenges and how a new paradigm and new methods can more directly take on the full spectrum of nonresponse bias given contemporary polling practices.
This chapter highlights the critical importance of having the right kind of data for selection models that address nonignorable nonresponse. In general, we need a variable that is included in our response model and excluded from our outcome model. The best approach is creating a randomized response instrument that affects whether someone responds, but does not affect the content of their response. In many polling contexts, it is easy to create randomized response instruments. The pollster simply needs to figure out some protocol that affects response rate and then randomize it. Section 10.1 makes it clear that knowing the correct functional form is not enough to save a selection model. Section 10.2 highlights the difficulty of using observational response instruments. Section 10.3 discusses how and why to create randomized response instruments. Section 10.4 shows how to use randomized response instruments in a simple test for diagnosing nonignorable nonresponse. Section 10.5 shows how randomized response instruments enable us to use the full suite of selection models even when we do not observe data for nonrespondents.
This paper proposes a nonparametric test to assess whether there exist heterogeneous quantile treatment effects (QTEs) of an intervention on the outcome of interest across different sub-populations defined by covariates of interest. Specifically, a consistent test statistic based on the Cramér–von Mises type criterion is developed to test if the treatment has a constant quantile effect for all sub-populations defined by covariates of interest. Under some regularity conditions, the asymptotic behaviors of the proposed test statistic are investigated under both the null and alternative hypotheses. Furthermore, a nonparametric Bootstrap procedure is suggested to approximate the finite-sample null distribution of the proposed test; then, the asymptotic validity of the proposed Bootstrap test is theoretically justified. Through Monte Carlo simulations, we demonstrate the power properties of the test in finite samples. Finally, the proposed testing approach is applied to investigate whether there exists heterogeneity for the QTE of maternal smoking during pregnancy on infant birth weight across different age groups of mothers.
We study a skew Ornstein–Uhlenbeck process with zero being a sticky reflecting boundary, which is defined as the weak solution to a stochastic differential equation (SDE) system involving local time. The main results obtained include: (i) the existence and uniqueness of solutions to the SDE system, (ii) the scale function and speed measure, and (iii) the distributional properties regarding the transition density and the first hitting times. On the application side, we apply the process to interest rate modeling and obtain the explicit pricing formula for zero-coupon bonds. Numerical examples illustrate the impacts on bond yields of skewness and stickiness parameters.
We show that for every $\eta \gt 0$ every sufficiently large $n$-vertex oriented graph $D$ of minimum semidegree exceeding $(1+\eta )\frac k2$ contains every balanced antidirected tree with $k$ edges and bounded maximum degree, if $k\ge \eta n$. In particular, this asymptotically confirms a conjecture of the first author for long antidirected paths and dense digraphs.
Further, we show that in the same setting, $D$ contains every $k$-edge antidirected subdivision of a sufficiently small complete graph, if the paths of the subdivision that have length $1$ or $2$ span a forest. As a special case, we can find all antidirected cycles of length at most $k$.
Finally, we address a conjecture of Addario-Berry, Havet, Linhares Sales, Reed, and Thomassé for antidirected trees in digraphs. We show that this conjecture is asymptotically true in $n$-vertex oriented graphs for all balanced antidirected trees of bounded maximum degree and of size linear in $n$.
A result of Gyárfás [12] exactly determines the size of a largest monochromatic component in an arbitrary $r$-colouring of the complete $k$-uniform hypergraph $K_n^k$ when $k\geq 2$ and $k\in \{r-1,r\}$. We prove a result which says that if one replaces $K_n^k$ in Gyárfás’ theorem by any ‘expansive’ $k$-uniform hypergraph on $n$ vertices (that is, a $k$-uniform hypergraph $G$ on $n$ vertices in which $e(V_1, \ldots, V_k)\gt 0$ for all disjoint sets $V_1, \ldots, V_k\subseteq V(G)$ with $|V_i|\gt \alpha$ for all $i\in [k]$), then one gets a largest monochromatic component of essentially the same size (within a small error term depending on $r$ and $\alpha$). As corollaries we recover a number of known results about large monochromatic components in random hypergraphs and random Steiner triple systems, often with drastically improved bounds on the error terms.
Gyárfás’ result is equivalent to the dual problem of determining the smallest possible maximum degree of an arbitrary $r$-partite $r$-uniform hypergraph $H$ with $n$ edges in which every set of $k$ edges has a common intersection. In this language, our result says that if one replaces the condition that every set of $k$ edges has a common intersection with the condition that for every collection of $k$ disjoint sets $E_1, \ldots, E_k\subseteq E(H)$ with $|E_i|\gt \alpha$, there exists $(e_1, \ldots, e_k)\in E_1\times \cdots \times E_k$ such that $e_1\cap \cdots \cap e_k\neq \emptyset$, then the smallest possible maximum degree of $H$ is essentially the same (within a small error term depending on $r$ and $\alpha$). We prove our results in this dual setting.
For a graph $H$ and a hypercube $Q_n$, $\textrm{ex}(Q_n, H)$ is the largest number of edges in an $H$-free subgraph of $Q_n$. If $\lim _{n \rightarrow \infty } \textrm{ex}(Q_n, H)/|E(Q_n)| \gt 0$, $H$ is said to have a positive Turán density in a hypercube or simply a positive Turán density; otherwise, it has zero Turán density. Determining $\textrm{ex}(Q_n, H)$ and even identifying whether $H$ has a positive or zero Turán density remains a widely open question for general $H$. By relating extremal numbers in a hypercube and certain corresponding hypergraphs, Conlon found a large class of graphs, ones having so-called partite representation, that have zero Turán density. He asked whether this gives a characterisation, that is, whether a graph has zero Turán density if and only if it has partite representation. Here, we show that, as suspected by Conlon, this is not the case. We give an example of a class of graphs which have no partite representation, but on the other hand, have zero Turán density. In addition, we show that any graph whose every block has partite representation has zero Turán density in a hypercube.
In this paper, the ordering properties of convex and increasing convex orders of the dependent random variables are studied. Some closure properties of the convex and increasing convex orders under independent random variables are extended to the dependent random variables under the Archimedean copula. Two applications are provided to illustrate our results.
Tao and Vu showed that every centrally symmetric convex progression $C\subset \mathbb{Z}^d$ is contained in a generalized arithmetic progression of size $d^{O(d^2)} \# C$. Berg and Henk improved the size bound to $d^{O(d\log d)} \# C$. We obtain the bound $d^{O(d)} \# C$, which is sharp up to the implied constant and is of the same form as the bound in the continuous setting given by John’s theorem.
In February 2021, a cluster of Beta variant (B.1.351) coronavirus disease 2019 (COVID-19) cases were identified in an apartment building located in Northern Ontario, Canada. Most cases had no known contact with each other. Objectives of this multi-component outbreak investigation were to better understand the social and environmental factors that facilitated the transmission of COVID-19 through this multi-unit residential building (MURB). A case–control study examined building-specific exposures and resident behaviours that may have increased the odds of being a case. A professional engineer assessed the building’s heating, ventilation, and air-conditioning (HVAC) systems. Whole-genome sequencing and an in-depth genomic analysis were performed. Forty-five outbreak-confirmed cases were identified. From the case–control study, being on the upper floors (OR: 10.4; 95% CI: 1.63–66.9) and within three adjacent vertical lines (OR: 28.3; 3.57–225) were both significantly associated with being a case of COVID-19, after adjusting for age. There were no significant differences in reported behaviours, use of shared spaces, or precautions taken between cases and controls. An assessment of the building’s ventilation found uncontrolled air leakage between apartment units. A single genomic cluster was identified, where most sequences were identical to one another. Findings from the multiple components of this investigation are suggestive of aerosol transmission between units.
In this paper, we introduce a slight variation of the dominated-coupling-from-the-past (DCFTP) algorithm of Kendall, for bounded Markov chains. It is based on the control of a (typically non-monotonic) stochastic recursion by another (typically monotonic) one. We show that this algorithm is particularly suitable for stochastic matching models with bounded patience, a class of models for which the steady-state distribution of the system is in general unknown in closed form. We first show that the Markov chain of this model can easily be controlled by an infinite-server queue. We then investigate the particular case where patience times are deterministic, and this control argument may fail. In that case we resort to an ad-hoc technique that can also be seen as a control (this time, by the arrival sequence). We then compare this algorithm to the primitive coupling-from-the-past (CFTP) algorithm and to control by an infinite-server queue, and show how our perfect simulation results can be used to estimate and compare, for instance, the loss probabilities of various systems in equilibrium.
Migrants in Europe face a disproportionate burden of HIV infection; however, it remains unclear if this can be prevented through public health interventions in host countries. We undertake a systematic review and meta-analysis to estimate post-migration HIV acquisition (PMHA) as a proportion of all HIV cases in European migrants. MEDLINE, EMBASE, Global Health, HMIC, and Cochrane Library were searched with terms capturing ‘HIV’, ‘migration’, and ‘Europe’. Data relating to the proportion of HIV acquired following migration were extracted and random-effects model (REM) meta-analysis was undertaken to calculate a pooled estimate for the proportion of PMHA in European countries. Subgroup meta-analysis was undertaken for PMHA by migrant demographic characteristics and host country. Fifteen articles were included for systematic review following retrieval and screening of 2,320 articles. A total of 47,182 migrants in 11 European countries were included in REM meta-analysis, showing an overall PMHA proportion of 0.30 (95% CI: 0.23–0.38). Subgroup analysis showed no significant difference in PMHA between host country and migrant demographic characteristics. This work illustrates that migrants continue to be at high risk of HIV acquisition in Europe. This indicates the need for targeted screening and HIV prevention interventions, ensuring resources are appropriately directed to combat the spread of HIV.
Publicly funded data-driven innovation programmes frequently involve partnerships between small and medium enterprises (SMEs) and municipal authorities utilizing citizen data. The intention of these projects is to benefit citizens. However, few such projects achieve success or impact within the project timeframe. This may result in benefit accruing mainly to the SME partner, who gains both learning and data, engendering questions of data justice around whether citizen data are being exploited without sufficient benefit returning to citizens. Through case studies composed of interviews and document analysis, we examine how benefits for citizens are conceived and achieved in the publicly funded data-driven air quality projects Data Pitch and Smart Cities Innovation Framework Implementation. We find the differences between the programme funders’ policies had a clear influence on the citizen engagement elements. There are also a number of ways in which the desired citizen engagement and benefit becomes diluted, including through misalignment of incentives and focus, a lack of prioritization and ownership, and power imbalances between citizens and the other actors in the quadruple helix model. To retain the focus on ensuring citizens benefit from data-driven innovation programmes using citizen data, we propose the use of data Justice plans. More work is required to specify the content and mechanisms of such plans for application in such programmes.
This paper studies the input-queued switch operating under the MaxWeight algorithm when the arrivals are according to a Markovian process. We exactly characterize the heavy-traffic scaled mean sum queue length in the heavy-traffic limit, and show that it is within a factor of less than 2 from a universal lower bound. Moreover, we obtain lower and upper bounds that are applicable in all traffic regimes and become tight in the heavy-traffic regime.
We obtain these results by generalizing the drift method recently developed for the case of independent and identically distributed arrivals to the case of Markovian arrivals. We illustrate this generalization by first obtaining the heavy-traffic mean queue length and its distribution in a single-server queue under Markovian arrivals and then applying it to the case of an input-queued switch.
The key idea is to exploit the geometric mixing of finite-state Markov chains, and to work with a time horizon that is chosen so that the error due to mixing depends on the heavy-traffic parameter.
Survey research is in a state of crisis. People have become less willing to respond to polls and recent misses in critical elections have undermined the field's credibility. Pollsters have developed many tools for dealing with the new environment, an increasing number of which rely on risky opt-in samples. Virtually all of these tools require that respondents in each demographic category are a representative sample of all people in each demographic category, something that is unlikely to be reliably true. Polling at a Crossroads moves beyond such strong limitations, providing tools that work even when survey respondents are unrepresentative in complex ways. This book provides case studies that show how to avoid underestimating Trump support and how conventional polls exaggerate partisan differences. This book also helps us think in clear and sometimes counterintuitive ways and points toward simple, low-cost changes that can better address contemporary polling challenges.
Different modalities for communicating subscore information, across varied testing purposes and audiences, are illustrated through excerpts from real score reports. Connections to general score reporting best practices and development are discussed.