Hostname: page-component-77f85d65b8-grvzd Total loading time: 0 Render date: 2026-03-28T21:34:20.458Z Has data issue: false hasContentIssue false

Uncertainty quantification and confidence intervals for naive rare-event estimators

Published online by Cambridge University Press:  02 September 2024

Yuanlu Bai*
Affiliation:
Columbia University
Henry Lam*
Affiliation:
Columbia University
*
*Postal address: 500 West 120th Street, New York, NY, USA.
*Postal address: 500 West 120th Street, New York, NY, USA.
Rights & Permissions [Opens in a new window]

Abstract

We consider the estimation of rare-event probabilities using sample proportions output by naive Monte Carlo or collected data. Unlike using variance reduction techniques, this naive estimator does not have an a priori relative efficiency guarantee. On the other hand, due to the recent surge of sophisticated rare-event problems arising in safety evaluations of intelligent systems, efficiency-guaranteed variance reduction may face implementation challenges which, coupled with the availability of computation or data collection power, motivate the use of such a naive estimator. In this paper we study the uncertainty quantification, namely the construction, coverage validity, and tightness of confidence intervals, for rare-event probabilities using only sample proportions. In addition to the known normality, Wilson, and exact intervals, we investigate and compare them with two new intervals derived from Chernoff’s inequality and the Berry–Esseen theorem. Moreover, we generalize our results to the natural situation where sampling stops by reaching a target number of rare-event hits. Our findings show that the normality and Wilson intervals are not always valid, but they are close to the newly developed valid intervals in terms of half-width. In contrast, the exact interval is conservative, but safely guarantees the attainment of the nominal confidence level. Our new intervals, while being more conservative than the exact interval, provide useful insights into understanding the tightness of the considered intervals.

Information

Type
Original Article
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use or in order to create a derivative work.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of Applied Probability Trust
Figure 0

Figure 1. Comparisons of the positions of confidence upper and lower bounds under the standard setting. Here, ‘valid’ means that the CI has valid coverage in the sense that the actual coverage probability always reaches the nominal confidence level.

Figure 1

Figure 2. Comparisons of the positions of confidence upper and lower bounds under the targeted stopping setting. Here, ‘valid’ means that the CI has valid coverage in the sense that the actual coverage probability always reaches the nominal confidence level.

Figure 2

Table 1. Summary of the CIs in the standard setting ($I_1,\dots,I_n \stackrel{\mathrm{i.i.d.}}{\sim}\mathrm{Bernoulli}(p)$, $\hat{p}=({1}/{n})\sum_{i=1}^n I_i$, $\hat{s}=n\hat p$).

Figure 3

Table 2. Summary of the CIs in the targeted stopping setting ($N_1,\dots,N_{n_0}\stackrel{\mathrm{i.i.d.}}{\sim}\mathrm{Geometric}(p)$, $N=\sum_{i=1}^{n_0}N_i$, $\hat p=n_0/N$).

Figure 4

Figure 3. Average values of the confidence upper and lower bounds under the standard setting.

Figure 5

Table 3. Coverage probabilities of the CIs under the standard setting.

Figure 6

Figure 4. Average values of the confidence upper and lower bounds under the targeted stopping setting.

Figure 7

Table 4. Coverage probabilities of the CIs under the targeted stopping setting.