Search results for Statistics and Probability

Part III - Fighting Nonignorable Nonresponse
Michael A. Bailey, Georgetown University, Washington DC
Book:

Polling at a Crossroads

Published online:

29 February 2024

Print publication:

07 March 2024, pp 135-136
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

1 - Modern Polling
from Part I - Polling in Context
Michael A. Bailey, Georgetown University, Washington DC
Book:

Polling at a Crossroads

Published online:

29 February 2024

Print publication:

07 March 2024, pp 3-22
- Chapter
- - You have access
- PDF
- Export citation
Summary

Polling has become very difficult. People do not respond, and pollsters use methods that are far removed from the random sampling tools that built the field. This chapter introduces the book by outlining the main challenges facing polling today, how conventional tools fail to fully meet these challenges and how a new paradigm and new methods can more directly take on the full spectrum of nonresponse bias given contemporary polling practices.

Part IV - Applications
Michael A. Bailey, Georgetown University, Washington DC
Book:

Polling at a Crossroads

Published online:

29 February 2024

Print publication:

07 March 2024, pp 211-212
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

10 - Randomized Response Instruments
from Part III - Fighting Nonignorable Nonresponse
Michael A. Bailey, Georgetown University, Washington DC
Book:

Polling at a Crossroads

Published online:

29 February 2024

Print publication:

07 March 2024, pp 183-197
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter highlights the critical importance of having the right kind of data for selection models that address nonignorable nonresponse. In general, we need a variable that is included in our response model and excluded from our outcome model. The best approach is creating a randomized response instrument that affects whether someone responds, but does not affect the content of their response. In many polling contexts, it is easy to create randomized response instruments. The pollster simply needs to figure out some protocol that affects response rate and then randomize it. Section 10.1 makes it clear that knowing the correct functional form is not enough to save a selection model. Section 10.2 highlights the difficulty of using observational response instruments. Section 10.3 discusses how and why to create randomized response instruments. Section 10.4 shows how to use randomized response instruments in a simple test for diagnosing nonignorable nonresponse. Section 10.5 shows how randomized response instruments enable us to use the full suite of selection models even when we do not observe data for nonrespondents.

A NONPARAMETRIC TEST OF HETEROGENEITY IN CONDITIONAL QUANTILE TREATMENT EFFECTS
Zongwu Cai, Ying Fang, Ming Lin, Shengfang Tang
Journal:

Econometric Theory / Volume 41 / Issue 3 / June 2025

Published online by Cambridge University Press:

07 March 2024, pp. 660-687
- Article
- - You have access
  - Open access
- PDF
- Export citation
This paper proposes a nonparametric test to assess whether there exist heterogeneous quantile treatment effects (QTEs) of an intervention on the outcome of interest across different sub-populations defined by covariates of interest. Specifically, a consistent test statistic based on the Cramér–von Mises type criterion is developed to test if the treatment has a constant quantile effect for all sub-populations defined by covariates of interest. Under some regularity conditions, the asymptotic behaviors of the proposed test statistic are investigated under both the null and alternative hypotheses. Furthermore, a nonparametric Bootstrap procedure is suggested to approximate the finite-sample null distribution of the proposed test; then, the asymptotic validity of the proposed Bootstrap test is theoretically justified. Through Monte Carlo simulations, we demonstrate the power properties of the test in finite samples. Finally, the proposed testing approach is applied to investigate whether there exists heterogeneity for the QTE of maternal smoking during pregnancy on infant birth weight across different age groups of mothers.

Acknowledgments
Michael A. Bailey, Georgetown University, Washington DC
Book:

Polling at a Crossroads

Published online:

29 February 2024

Print publication:

07 March 2024, pp xix-xx
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Skew Ornstein–Uhlenbeck processes with sticky reflection and their applications to bond pricing
Part of
Shiyu Song, Guangli Xu
Journal:

Journal of Applied Probability / Volume 61 / Issue 4 / December 2024

Published online by Cambridge University Press:

06 March 2024, pp. 1172-1195

Print publication:

December 2024
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
We study a skew Ornstein–Uhlenbeck process with zero being a sticky reflecting boundary, which is defined as the weak solution to a stochastic differential equation (SDE) system involving local time. The main results obtained include: (i) the existence and uniqueness of solutions to the SDE system, (ii) the scale function and speed measure, and (iii) the distributional properties regarding the transition density and the first hitting times. On the application side, we apply the process to interest rate modeling and obtain the explicit pricing formula for zero-coupon bonds. Numerical examples illustrate the impacts on bond yields of skewness and stickiness parameters.

Antidirected subgraphs of oriented graphs
Part of
- Graph theory
Maya Stein, Camila Zárate-Guerén
Journal:

Combinatorics, Probability and Computing / Volume 33 / Issue 4 / July 2024

Published online by Cambridge University Press:

06 March 2024, pp. 446-466
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
We show that for every $\eta \gt 0$ every sufficiently large $n$-vertex oriented graph $D$ of minimum semidegree exceeding $(1+\eta )\frac k2$ contains every balanced antidirected tree with $k$ edges and bounded maximum degree, if $k\ge \eta n$. In particular, this asymptotically confirms a conjecture of the first author for long antidirected paths and dense digraphs.
Further, we show that in the same setting, $D$ contains every $k$-edge antidirected subdivision of a sufficiently small complete graph, if the paths of the subdivision that have length $1$ or $2$ span a forest. As a special case, we can find all antidirected cycles of length at most $k$.
Finally, we address a conjecture of Addario-Berry, Havet, Linhares Sales, Reed, and Thomassé for antidirected trees in digraphs. We show that this conjecture is asymptotically true in $n$-vertex oriented graphs for all balanced antidirected trees of bounded maximum degree and of size linear in $n$.

Large monochromatic components in expansive hypergraphs
Part of
- Extremal combinatorics
- Graph theory
Deepak Bal, Louis DeBiasio
Journal:

Combinatorics, Probability and Computing / Volume 33 / Issue 4 / July 2024

Published online by Cambridge University Press:

05 March 2024, pp. 467-483
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
A result of Gyárfás [12] exactly determines the size of a largest monochromatic component in an arbitrary $r$-colouring of the complete $k$-uniform hypergraph $K_n^k$ when $k\geq 2$ and $k\in \{r-1,r\}$. We prove a result which says that if one replaces $K_n^k$ in Gyárfás’ theorem by any ‘expansive’ $k$-uniform hypergraph on $n$ vertices (that is, a $k$-uniform hypergraph $G$ on $n$ vertices in which $e(V_1, \ldots, V_k)\gt 0$ for all disjoint sets $V_1, \ldots, V_k\subseteq V(G)$ with $|V_i|\gt \alpha$ for all $i\in [k]$), then one gets a largest monochromatic component of essentially the same size (within a small error term depending on $r$ and $\alpha$). As corollaries we recover a number of known results about large monochromatic components in random hypergraphs and random Steiner triple systems, often with drastically improved bounds on the error terms.
Gyárfás’ result is equivalent to the dual problem of determining the smallest possible maximum degree of an arbitrary $r$-partite $r$-uniform hypergraph $H$ with $n$ edges in which every set of $k$ edges has a common intersection. In this language, our result says that if one replaces the condition that every set of $k$ edges has a common intersection with the condition that for every collection of $k$ disjoint sets $E_1, \ldots, E_k\subseteq E(H)$ with $|E_i|\gt \alpha$, there exists $(e_1, \ldots, e_k)\in E_1\times \cdots \times E_k$ such that $e_1\cap \cdots \cap e_k\neq \emptyset$, then the smallest possible maximum degree of $H$ is essentially the same (within a small error term depending on $r$ and $\alpha$). We prove our results in this dual setting.

A class of graphs of zero Turán density in a hypercube
Part of
- Graph theory
- Extremal combinatorics
Maria Axenovich
Journal:

Combinatorics, Probability and Computing / Volume 33 / Issue 3 / May 2024

Published online by Cambridge University Press:

05 March 2024, pp. 404-410
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
For a graph $H$ and a hypercube $Q_n$, $\textrm{ex}(Q_n, H)$ is the largest number of edges in an $H$-free subgraph of $Q_n$. If $\lim _{n \rightarrow \infty } \textrm{ex}(Q_n, H)/|E(Q_n)| \gt 0$, $H$ is said to have a positive Turán density in a hypercube or simply a positive Turán density; otherwise, it has zero Turán density. Determining $\textrm{ex}(Q_n, H)$ and even identifying whether $H$ has a positive or zero Turán density remains a widely open question for general $H$. By relating extremal numbers in a hypercube and certain corresponding hypergraphs, Conlon found a large class of graphs, ones having so-called partite representation, that have zero Turán density. He asked whether this gives a characterisation, that is, whether a graph has zero Turán density if and only if it has partite representation. Here, we show that, as suspected by Conlon, this is not the case. We give an example of a class of graphs which have no partite representation, but on the other hand, have zero Turán density. In addition, we show that any graph whose every block has partite representation has zero Turán density in a hypercube.

Some properties of convex and increasing convex orders under Archimedean copula
Qingyuan Guan, Bing Xing Wang
Journal:

Probability in the Engineering and Informational Sciences / Volume 38 / Issue 4 / October 2024

Published online by Cambridge University Press:

05 March 2024, pp. 632-643
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
In this paper, the ordering properties of convex and increasing convex orders of the dependent random variables are studied. Some closure properties of the convex and increasing convex orders under independent random variables are extended to the dependent random variables under the Archimedean copula. Two applications are provided to illustrate our results.

Sharp bounds for a discrete John’s theorem
Part of
- Discrete geometry
- General convexity
Peter van Hintum, Peter Keevash
Journal:

Combinatorics, Probability and Computing / Volume 33 / Issue 4 / July 2024

Published online by Cambridge University Press:

05 March 2024, pp. 484-486
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Tao and Vu showed that every centrally symmetric convex progression $C\subset \mathbb{Z}^d$ is contained in a generalized arithmetic progression of size $d^{O(d^2)} \# C$. Berg and Henk improved the size bound to $d^{O(d\log d)} \# C$. We obtain the bound $d^{O(d)} \# C$, which is sharp up to the implied constant and is of the same form as the bound in the continuous setting given by John’s theorem.

COVID-19 outbreak at a residential apartment building in Northern Ontario, Canada
Dinna Lozano, Carolyn Dohoo, David Elfstrom, Kendra Carswell, Jennifer L. Guthrie
Journal:

Epidemiology & Infection / Volume 152 / 2024

Published online by Cambridge University Press:

04 March 2024, e53
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
In February 2021, a cluster of Beta variant (B.1.351) coronavirus disease 2019 (COVID-19) cases were identified in an apartment building located in Northern Ontario, Canada. Most cases had no known contact with each other. Objectives of this multi-component outbreak investigation were to better understand the social and environmental factors that facilitated the transmission of COVID-19 through this multi-unit residential building (MURB). A case–control study examined building-specific exposures and resident behaviours that may have increased the odds of being a case. A professional engineer assessed the building’s heating, ventilation, and air-conditioning (HVAC) systems. Whole-genome sequencing and an in-depth genomic analysis were performed. Forty-five outbreak-confirmed cases were identified. From the case–control study, being on the upper floors (OR: 10.4; 95% CI: 1.63–66.9) and within three adjacent vertical lines (OR: 28.3; 3.57–225) were both significantly associated with being a case of COVID-19, after adjusting for age. There were no significant differences in reported behaviours, use of shared spaces, or precautions taken between cases and controls. An assessment of the building’s ventilation found uncontrolled air leakage between apartment units. A single genomic cluster was identified, where most sequences were identical to one another. Findings from the multiple components of this investigation are suggestive of aerosol transmission between units.

Perfect sampling of stochastic matching models with reneging
Part of
Thomas Masanet, Pascal Moyal
Journal:

Advances in Applied Probability / Volume 56 / Issue 4 / December 2024

Published online by Cambridge University Press:

04 March 2024, pp. 1307-1339

Print publication:

December 2024
- Article
- - You have access
- PDF
- HTML
- Export citation
In this paper, we introduce a slight variation of the dominated-coupling-from-the-past (DCFTP) algorithm of Kendall, for bounded Markov chains. It is based on the control of a (typically non-monotonic) stochastic recursion by another (typically monotonic) one. We show that this algorithm is particularly suitable for stochastic matching models with bounded patience, a class of models for which the steady-state distribution of the system is in general unknown in closed form. We first show that the Markov chain of this model can easily be controlled by an infinite-server queue. We then investigate the particular case where patience times are deterministic, and this control argument may fail. In that case we resort to an ad-hoc technique that can also be seen as a control (this time, by the arrival sequence). We then compare this algorithm to the primitive coupling-from-the-past (CFTP) algorithm and to control by an infinite-server queue, and show how our perfect simulation results can be used to estimate and compare, for instance, the loss probabilities of various systems in equilibrium.

Post-migration HIV acquisition: A systematic review and meta-analysis
Simran Mann, Zeenathnisa Mougammadou, Jan Wohlfahrt, Rahma Elmahdi
Journal:

Epidemiology & Infection / Volume 152 / 2024

Published online by Cambridge University Press:

01 March 2024, e49
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Migrants in Europe face a disproportionate burden of HIV infection; however, it remains unclear if this can be prevented through public health interventions in host countries. We undertake a systematic review and meta-analysis to estimate post-migration HIV acquisition (PMHA) as a proportion of all HIV cases in European migrants. MEDLINE, EMBASE, Global Health, HMIC, and Cochrane Library were searched with terms capturing ‘HIV’, ‘migration’, and ‘Europe’. Data relating to the proportion of HIV acquired following migration were extracted and random-effects model (REM) meta-analysis was undertaken to calculate a pooled estimate for the proportion of PMHA in European countries. Subgroup meta-analysis was undertaken for PMHA by migrant demographic characteristics and host country. Fifteen articles were included for systematic review following retrieval and screening of 2,320 articles. A total of 47,182 migrants in 11 European countries were included in REM meta-analysis, showing an overall PMHA proportion of 0.30 (95% CI: 0.23–0.38). Subgroup analysis showed no significant difference in PMHA between host country and migrant demographic characteristics. This work illustrates that migrants continue to be at high risk of HIV acquisition in Europe. This indicates the need for targeted screening and HIV prevention interventions, ensuring resources are appropriately directed to combat the spread of HIV.

When data meets citizens: an investigation of citizen engagement in data-driven innovation programmes
Gefion Thuermer, Johanna Walker, Elena Simperl, Les Carr
Journal:

Data & Policy / Volume 6 / 2024

Published online by Cambridge University Press:

01 March 2024, e12
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Publicly funded data-driven innovation programmes frequently involve partnerships between small and medium enterprises (SMEs) and municipal authorities utilizing citizen data. The intention of these projects is to benefit citizens. However, few such projects achieve success or impact within the project timeframe. This may result in benefit accruing mainly to the SME partner, who gains both learning and data, engendering questions of data justice around whether citizen data are being exploited without sufficient benefit returning to citizens. Through case studies composed of interviews and document analysis, we examine how benefits for citizens are conceived and achieved in the publicly funded data-driven air quality projects Data Pitch and Smart Cities Innovation Framework Implementation. We find the differences between the programme funders’ policies had a clear influence on the citizen engagement elements. There are also a number of ways in which the desired citizen engagement and benefit becomes diluted, including through misalignment of incentives and focus, a lack of prioritization and ownership, and power imbalances between citizens and the other actors in the quadruple helix model. To retain the focus on ensuring citizens benefit from data-driven innovation programmes using citizen data, we propose the use of data Justice plans. More work is required to specify the content and mechanisms of such plans for application in such programmes.

Heavy-traffic queue length behavior in a switch under Markovian arrivals
Part of
- Special processes
- Computer system organization
Shancong Mou, Siva Theja Maguluri
Journal:

Advances in Applied Probability / Volume 56 / Issue 3 / September 2024

Published online by Cambridge University Press:

01 March 2024, pp. 1106-1152

Print publication:

September 2024
- Article
- - You have access
- PDF
- HTML
- Export citation
This paper studies the input-queued switch operating under the MaxWeight algorithm when the arrivals are according to a Markovian process. We exactly characterize the heavy-traffic scaled mean sum queue length in the heavy-traffic limit, and show that it is within a factor of less than 2 from a universal lower bound. Moreover, we obtain lower and upper bounds that are applicable in all traffic regimes and become tight in the heavy-traffic regime.
We obtain these results by generalizing the drift method recently developed for the case of independent and identically distributed arrivals to the case of Markovian arrivals. We illustrate this generalization by first obtaining the heavy-traffic mean queue length and its distribution in a single-server queue under Markovian arrivals and then applying it to the case of an input-queued switch.
The key idea is to exploit the geometric mixing of finite-state Markov chains, and to work with a time horizon that is chosen so that the error due to mixing depends on the heavy-traffic parameter.

Polling at a Crossroads

Rethinking Modern Survey Research
Michael A. Bailey
Published online:

29 February 2024

Print publication:

07 March 2024
- Book
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Survey research is in a state of crisis. People have become less willing to respond to polls and recent misses in critical elections have undermined the field's credibility. Pollsters have developed many tools for dealing with the new environment, an increasing number of which rely on risky opt-in samples. Virtually all of these tools require that respondents in each demographic category are a representative sample of all people in each demographic category, something that is unlikely to be reliably true. Polling at a Crossroads moves beyond such strong limitations, providing tools that work even when survey respondents are unrepresentative in complex ways. This book provides case studies that show how to avoid underestimating Trump support and how conventional polls exaggerate partisan differences. This book also helps us think in clear and sometimes counterintuitive ways and points toward simple, low-cost changes that can better address contemporary polling challenges.

2 - How Are Subscores Reported?
Shelby Haberman, Sandip Sinharay, Educational Testing Service, New Jersey, Richard A. Feinberg, National Board of Medical Examiners, Pennsylvania, Howard Wainer
Book:

Subscores

Published online:

22 February 2024

Print publication:

29 February 2024, pp 18-44
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Different modalities for communicating subscore information, across varied testing purposes and audiences, are illustrated through excerpts from real score reports. Connections to general score reporting best practices and development are discussed.

7 - Statistical Boosting for GAMLSS
from Part II - Statistical Inference in GAMLSS
Mikis D. Stasinopoulos, University of Greenwich, Thomas Kneib, Georg-August-Universität, Göttingen, Germany, Nadja Klein, Technische Universität Dortmund, Andreas Mayr, Rheinische Friedrich-Wilhelms-Universität Bonn, Gillian Z. Heller, University of Sydney
Book:

Generalized Additive Models for Location, Scale and Shape

Published online:

22 February 2024

Print publication:

29 February 2024, pp 165-194
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Statistics and Probability

Refine search

Refine search

Actions for selected content:

52346 results in Statistics and Probability

Part III - Fighting Nonignorable Nonresponse

1 - Modern Polling

Summary

Part IV - Applications

10 - Randomized Response Instruments

Summary

A NONPARAMETRIC TEST OF HETEROGENEITY IN CONDITIONAL QUANTILE TREATMENT EFFECTS

Acknowledgments

Skew Ornstein–Uhlenbeck processes with sticky reflection and their applications to bond pricing

Antidirected subgraphs of oriented graphs

Large monochromatic components in expansive hypergraphs

A class of graphs of zero Turán density in a hypercube

Some properties of convex and increasing convex orders under Archimedean copula

Sharp bounds for a discrete John’s theorem

COVID-19 outbreak at a residential apartment building in Northern Ontario, Canada

Perfect sampling of stochastic matching models with reneging

Post-migration HIV acquisition: A systematic review and meta-analysis

When data meets citizens: an investigation of citizen engagement in data-driven innovation programmes

Heavy-traffic queue length behavior in a switch under Markovian arrivals

Polling at a Crossroads

2 - How Are Subscores Reported?

Summary

7 - Statistical Boosting for GAMLSS

Statistics and Probability

Refine search

Refine search

Actions for selected content:

Save Search

52346 results in Statistics and Probability

Summary

Summary

Polling at a Crossroads

Summary