Search results for Statistical theory and methods

6 - Contiguity
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 85-91
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

“Contiguity” is another name for “asymptotic absolute continuity.” Contiguity arguments are a technique to obtain the limit distribution of a sequence of statistics under underlying laws Qn from a limiting distribution under laws Pn Typically, the laws Pn describe a null distribution under investigation, and the laws Qn correspond to an alternative hypothesis.
Likelihood Ratios
Let P and be measures on a measurable space. Then is absolutely continuous with respectto P if impliesfor every measurable set A; this is denoted by P. Furthermore, P and are orthogonal ifcan be partitioned as pwith and. Thus P “charges” only and lives on” the set which is disjoint with the “support” of P. Orthogonality is denoted by
In general, two measures P and need be neither absolutely continuous nor orthogonal. The relationship between their supports can best be described in terms of densities. Suppose P andpossess densities p and with respect to a measure and consider the sets
See Figure 6.1. Because, the measure P is supported on the set Similarly, is supported on The intersection receives positive measure from both P and provided its measure under is positive. The measure can be written as the sum of the measures. (6.1) As proved in the next lemma, P and Furthermore, for every measurable set A
The decomposition is called the Lebesgue decomposition of with respect to P. The measures and are called the absolutely continuous part and the orthogonal part (or singular part) of with respect to P, respectively. In view of the preceding display, the function is a density of with respect to P.

16 - Likelihood Ratio Tests
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 227-241
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The critical values of the likelihood ratio test are usually based on an asymptotic approximation. We derive the asymptotic distribution of the likelihood ratio statistic and investigate its asymptotic quality through its asymptotic power function and its Bahadur efficiency.
Introduction
Suppose that we observe a sample from a density and wish to test the null hypothesis versus the alternative. If both the null and the alternative hypotheses consist of single points, then a most powerful test can be based on the log likelihood ratio, by the Neyman-Pearson theory. If the two points are and, respectively, then the optimal test statistic is given by
For certain special models and hypotheses, the most powerful test turns out not to depend on, and the test is uniformly most powerful for a composite hypothesis Sometimes the null hypothesis can be extended as well, and the testing problem has a fully satisfactory solution. Unfortunately, in many situations there is no single best test, not even in an asymptotic sense (see Chapter 15). A variety of ideas lead to reasonable tests. A sensible extension of the idea behind the Neyman-Pearson theory is to base a test on the log likelihood ratio
The single points are replaced by maxima over the hypotheses. As before, the null hypothesis is rejected for large values of the statistic.
Because the distributional properties of can be somewhat complicated, one usually replaces the supremum in the numerator by a supremum over the whole parameter set. This changes the test statistic only if, which is inessential, because in most cases the critical value will be positive. We study the asymptotic properties of the (log) likelihood ratio statistic
The most important conclusion of this chapter is that, under the null hypothesis, the sequence is asymptotically chi squared-distributed. The main conditions are that the model is differentiable in and that the null hypothesis and the full parameter set are (locally) equal to linear spaces.

10 - Bayes Procedures
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 138-152
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter Bayes estimators are studied from a frequentist perspective. Both posterior measures and Bayes point estimators in smooth parametric models are shown to be asymptotically normal.
Introduction
In Bayesian terminology the distribution of an observation in under a parameteris viewed as the conditional law of in given that a random variable en is equal to. The distribution n of the “random parameter” en is called the prior distribution, and the conditional distribution of en given in is the posterior distribution. If en possesses a density and admits a density (relative to given dominating measures), then the density of the posterior distribution is given by Bayes’ formula
This expression may define a probability density even ifis not a probability density itself. A prior distribution with infinite mass is called improper.
The calculation of the posterior measure can be considered the ultimate aim of a Bayesian analysis. Alternatively, one may wish to obtain a “point estimator” for the parameter using the posterior distribution. The posterior mean is often used for this purpose, but other location estimators are also reasonable.
A choice of point estimator may be motivated by a loss function. The Bayes risk of an estimator Tn relative to the loss function i and prior measure n is defined as
Here the expectation is the risk function of Tn in the usual set-up and is identical to the conditional risk in the Bayesian notation. The corresponding Bayes estimator is the estimator Tn that minimizes the Bayes risk. Because the Bayes risk can be written in the the value minimizes, for every fixed the “posterior risk”
Minimizing this expression may again be a well-defined problem even for prior densities of infinite total mass. For the loss function, the solution Tn is the posterior mean, the solution is the posterior median.

18 - Stochastic Convergence in Metric Spaces
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 255-264
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter extends the concepts of convergence in distribution, in probability, and almost surely from Euclidean spaces to more abstract metric spaces. We are particularly interested in developing the theory for randomfunctions, or stochastic processes, viewed as elements of the metric space of all bounded functions.
Metric and Normed Spaces
In this section we recall some basic topological concepts and introduce a number of examples of metric spaces.
A metric space is a set equipped with a metric. metric or distance function is a map with the properties
A semimetric satisfies (i) and (ii), but not necessarily (iii). An open ball is a set of the form A subset of a metric space is open if and only if it is the union of open balls; it is closed if and only if its complement is open. A sequence converges to if and only if this is denoted by The closure If of a set consists of all points that are the limit of a sequence in; it is the smallest closed set containing. The interior A is the collection of all points such that for some open set G; it is the largest open set contained in. A function between two metric spaces is continuous at a point if and only if for every sequence it is continuous at everyif and only if the inverse image. A subset of a metric space is dense if and only if its closure is the whole space. A metric space is separable if and only if it has a countable dense subset. A subset of a metric space is compact if and only if it is closed and every sequence in has a converging subsequence.

Contents
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp vii-xii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

22 - L-Statistics
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 316-325
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter we prove the asymptotic normality of linear combinations of order statistics, particularly those used for robust estimation or testing, such as trimmed means. We present two methods: The projection method presumes knowledge of Chapter 11 only; the second method is based on the functional delta method of Chapter 20.
Introduction
Let be the order statistics of a sample of real-valued random variables. A linear combination of (transformed) order statistics, or L-statistic, is a statistic of the form
The coefficients Cni are a triangular array of constants and a is some fixed function. This “score function” can without much loss of generality be taken equal to the identity function, for an L-statistic with monotone function a can be viewed as a linear combination of the order statistics of the variables and an L-statistic with a function a of bounded variation can be dealt with similarly, by splitting the L-statistic into two parts.
22.1 Example (Trimmed and Winsorized means). The simplest example of an L-statistic is the sample mean. More interesting are the a-trimmed meanst and the a-Winsorized means
The a-trimmed mean is the average of the middle -th fraction of the observations, the a-Winsorized mean replaces the ath fractions of smallest and largest data by and, respectively, and next takes the average. Both estimators were already used in the early days of statistics as location estimators in situations in which the data were suspected to contain outliers. Their properties were studied systematicclIly in the context of robust estimation in the 1960s and 1970s. The estimators were shown to have good properties in situations in which the data follows a heavier tailed distribution than the normal one. Figure 22.1 shows the asymptotic variances of the trimmed means as a function of a for four distributions. (A formula for the asymptotic variance is given in Example 22.11.) The four graphs suggest that 10% to 15% trimming may give an improvement over the sample mean in some cases and does not cost much even for the normal distribution.

Preface
- By A.W. van der Vaart
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp xiii-xiv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This book grew out of courses that I gave at various places, including a graduate course in the Statistics Department of Texas A&M University, Master's level courses for mathematics students specializing in statistics at the Vrije Universiteit Amsterdam, a course in the DEA program (graduate level) ofUniversite de Paris-sud, and courses in the Dutch AIO-netwerk (graduate level).
The mathematical level is mixed. Some parts I have used for second year courses for mathematics students (but they find it tough), other parts I would only recommend for a graduate program. The text is written both for students who know about the technical details of measure theory and probability, but little about statistics, and vice versa. This requires brief explanations of statistical methodology, for instance of what a rank test or the bootstrap is about, and there are similar excursions to introduce mathematical details. Familiarity with (higher-dimensional) calculus is necessary in all of the manuscript. Metric and normed spaces are briefly introduced in Chapter 18, when these concepts become necessary for Chapters 19, 20, 21 and 22, but I do not expect that this would be enough as a first introduction. For Chapter 25 basic knowledge of Hilbert spaces is extremely helpful, although the bare essentials are summarized at the beginning. Measure theory is implicitly assumed in the whole manuscript but can at most places be avoided by skipping proofs, by ignoring the word “measurable” or with a bit of handwaving. Because we deal mostly with i.i.d. observations, the simplest limit theorems from probability theory suffice. These are derived in Chapter 2, but prior exposure is helpful.
Sections, results or proofs that are preceded by asterisks are either of secondary importance or are out of line with the natural order of the chapters. As the chart in Figure 0.1 shows, many of the chapters are independent from one another, and the book can be used for several different courses.
A unifying theme is approximation by a limit experiment. The full theory is not developed (another writing project is on its way), but the material is limited to the “weak topology” on experiments, which in 90% of the book is exemplified by the case of smooth parameters of the distribution of i.i.d. observations.

8 - Efficiency of Estimators
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 108-124
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

One purpose of asymptotic statistics is to compare the performance of estimators for large sample sizes. This chapter discusses asymptotic lower bounds for estimation in locally asymptotically normal models. These show, among others, in what sense maximum likelihood estimators are asymptotically efficient.
Asymptotic Concentration
Suppose the problem is to estimate based on observations from a model governed by the parameter (). What is the best asymptotic performance of an estimator sequence Tn for
To simplify the situation, we shall in most of this chapter assume that the sequence converges in distribution under every possible value of. Next we rephrase the question as: What are the best possible limit distributions? In analogy with the CramerRao theorem a “best” limit distribution is referred to as an asymptotic lower bound. Under certain restrictions the normal distribution with mean zero and covariance the inverse Fisher information is an asymptotic lower bound for estimating in a smooth parametric model. This is the main result of this chapter, but it needs to be qualified.
The notion of a “best” limit distribution is understood in terms of concentration. If the limit distribution is a priori assumed to be normal, then this is usually translated into asymptotic unbiasedness and minimum variance. The statement that converges in distribution to a distribution can be roughly understood in the sense that eventually Tn is approximately normally distributed with mean and variance given by
Because Tn is meant to estimate, optimal choices for the asymptotic mean and variance are and variance as small as possible. These choices ensure not only that the asymptotic mean square error is small but also that the limit distribution is maximally concentrated near zero. For instance, the probability of the interval is maximized by choosingminimal.
We do not wish to assume a priori that the estimators are asymptotically normal. That normal limits are best will actually be an interesting conclusion.

Dedication
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp v-vi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

13 - Rank, Sign, and Permutation Statistics
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 173-191
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Statistics that depend on the observations only through their ranks can be used to test hypotheses on departures from the null hypothesis that the observations are identically distributed. Such rank statistics are attractive, because they are distribution-free under the null hypothesis and need not be less efficient, asymptotically. In the case of a sample from a symmetric distribution, statistics based on the ranks of the absolute values and the signs of the observations have a similar property. Rank statistics are a special example of permutation statistics.
Rank Statistics
The order statistics XN(1) XN of a set of real-valued observations order statistic are the values of the observations positioned in increasing order. The of among is its position number in the order statistics. More precisely, if are all different, then is defined by the equation
If Xi is tied with some other observations, this definition is invalid. Then the rank is defined as the average of all indices such that (sometimes called the or alternatively as (which is something like an uprank).
In this section it is assumed that the random variables have continuous distribution functions, so that ties in the observations occur with probability zero. We shall neglect the latter null set. The ranks and order statistics are written with double subscripts, because N varies and we shall consider order statistics of samples of different sizes. The vectors of order statistics and ranks are abbreviated to X N () and R N, respectively.
A rank statistic is any function of the ranks. A linear rank statistic is a rank statistic of the special form for a given matrix.

Frontmatter
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp i-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

1 - Introduction
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 1-4
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Why asymptotic statistics? The use of asymptotic approximations is twofold. First, they enable us to find approximate tests and confidence regions. Second, approximations can be used theoretically to study the quality (efficiency) of statistical procedures.
Approximate Statistical Procedures
To carry out a statistical test, we need to know the critical value for the test statistic. In most cases this means that we must know the distribution of the test statistic under the null hypothesis. Sometimes this is known exactly, but more often only approximations are available. This may be because the distribution of the statistic is analytically intractable, or perhaps the postulated statistical model is considered only an approximation of the true underlying distributions. In both cases the use of an approximate critical value may be fully satisfactory for practical purposes.
Consider for instance the classical t-test for location. Given a sample of independent observations X1, … , Xn, we wish to test a null hypothesis concerning the mean = EX. The t-test is based on the quotient of the sample mean and the sample standard deviation Sn. If the observations arise from a normal distribution with mean then the distribution of is known exactly: It is a t-distribution with n- 1 degrees of freedom. However, we may have doubts regarding the normality, or we might even believe in a completely different model. If the number of observations is not too small, this does not matter too much. Then we may act as if possesses a standard normal distribution. The theoretical justification is the limiting result, as
provided the variables Xi have a finite second moment. This variation on the central limit theorem is proved in the next chapter. A “large sample” level test is to reject if exceeds the upper quantile of the standard normal distribution. Table 1.1 gives the significance level of this test if the observations are either normally or exponentially distributed, and

3 - Delta Method
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 25-34
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The delta method consists of using a Taylor expansion to approximate a random vector of the form (Tn) by the polynomial + ‘ (Tn–) + … in Tn–. It is a simple but useful method to deduce the limit law of(Tn)–from that of Tn–. Applications include the non robustness of the chi-square test for normal variances and variance stabilizing transformations.
Basic Result
Suppose an estimator Tn for a parameter is available, but the quantity of interest isfor some known function . A natural estimator is (Tn). How do the asymptotic properties of (Tn) follow from those of Tn
A first result is an immediate consequence of the continuous-mapping theorem. If the sequence Tn converges in probability to andis continuous at then (Tn) converges in probability to
Of greater interest is a similar question concerning limit distributions. In particular, if y'n(Tn) converges weakly to a limit distribution, is the same true for y'n((Tn) Ifis differentiable, then the answer is affirmative. Informally, we have
If y'n(Tn)–T for some variable T, then we expect that y'n((Tn)–(e»)–‘ In particular, if y'n(Tn–e) is asymptotically normal then we expect that y'n((Tn)–(e») is asymptotically normal This is proved in greater generality in the following theorem.
In the preceding paragraph it is silently understood that Tn is real-valued, but we are more interested in considering statistics(Tn) that are formed out of several more basic statistics. Consider the situation that … , Tn,k) is vector-valued, and thatis a given function defined at least on a neighbourhood of Recall that is differentiable at if there exists a linear map (matrix) such that
All the expressions in this equation are vectors of length m, and IIhll is the Euclidean norm. The linear map is sometimes called a “total derivative,” as opposed to partial derivatives.

23 - Bootstrap
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 326-340
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter investigates the asymptotic properties of bootstrap estimators for distributions and confidence intervals. The consistency of the bootstrap for the sample mean implies the consistency for many other statistics by the delta method. A similar result is valid with the empirical process.
Introduction
In most estimation problems it is important to give an indication of the precision of a given estimate. A simple method is to provide an estimate of the bias and variance of the estimator; more accurate is a confidence interval for the parameter. In this chapter we concentrate on bootstrap confidence intervals and, more generally, discuss the bootstrap as a method of estimating the distribution of a given statistic.
Let be an estimator of some parameter attached to the distribution P of the observations. The distribution of the difference contains all the information needed for assessing the precision of In particular, if is the upper a-quantile of the distribution of then
Here may be arbitrary, but it is typically an estimate of the standard deviation of It follows that the interval is a confidence interval of level. Unfortunately, in most situations the quantiles and the distribution of depend on the unknown distribution P of the observations and cannot be used to assess the performance of They must be replaced by estimators.
If the sequence tends in distribution to a standard normal variable, then the normal N -distribution can be used as an estimator of the distribution of, and we can substitute the standard normal quantiles for the quantiles The weak convergence implies that the interval is a confidence interval of asymptotic level.
Bootstrap procedures yield an alternative. They are based on an estimate of the underlying distribution of the observations. The distribution of under can, in principle, be written as a function of The bootstrap estimator for this distribution is the “plug-in” estimator obtained by substituting for in this function.

Index
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 439-443
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

20 - Functional Delta Method
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 291-303
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The delta method was introduced in Chapter 3 as an easy way to tum the weak convergence of a sequence of random vectors into the weak convergence of transformations of the type. It is useful to apply a similar technique in combination with the more powerful convergence of stochastic processes. In this chapter we consider the delta method at two levels. The first section is of a heuristic character and limited to the case that Tn is the empirical distribution. The second section establishes the delta method rigorously and in general, completely parallel to the delta method for, for Hadamard differentiable maps between normed spaces.
von Mises Calculus
Let be the empirical distribution of a random sample X1, … , Xn from a distribution P. Many statistics can be written in the form whereis a function that maps every distribution of interest into some space, which for simplicity is taken equal to the real line. Because the observations can be regained from completely (unless there are ties), any statistic can be expressed in the empirical distribution. The special structure assumed here is that the statistic can be written as a fixed functionof , independent of n, a strong assumption.
Because converges to P astends to infinity, we may hope to find the asymptotic behavior of through a differential analysis ofin a neighborhood of P. A first-order analysis would have the form
where is a “derivative” and the remainder is hopefully negligible. The simplest approach towards defining a derivative is to consider the function for a fixed perturbation H and as a function of the real-valued argument t. Iftakes its values in JR, then this function is just a function from the reals to the reals.

19 - Empirical Processes
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 265-290
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The empirical distribution of a random sample is the uniform discrete measure on the observations. In this chapter, we study the convergence of this measure and in particular the convergence of the corresponding distribution function. This leads to laws of large numbers and central limit theorems that are uniform in classes of functions. We also discuss a number of applications of these results.
Empirical Distribution Functions
Let X1, …, Xn be a random sample from a distribution function F on the real line. The empirical distribution function is defined as
It is the natural estimator for the underlying distribution F if this is completely unknown. Because is binomially distributed with mean this estimator is unbiased. By the law of large numbers it is also consistent,
By the central limit theorem it is asymptotically normal,
In this chapter we improve on these results by considering as a random function, rather than as a real-valued estimator for each separately. This is of interest on its own account but also provides a useful starting tool for the asymptotic analysis of other statistics, such as quantiles, rank statistics, or trimmed means.
The Glivenko-Cantelli theorem extends the law of large numbers and gives uniform convergence. The uniform distance is known as the Kolmogorov-Smimov statistic.
19.1 Theorem (Glivenko-Cantelli). If are random variables with distributionfunction F, then.
Proof. By the strong law oflarge numbers, both and for every Given a fixed, there exists a partition. (Points at which F jumps more than e are points of the partition.) Now, for
The convergence of and for every fixed is certainly uniform for in the finite set. Conclude that lim sup, almost surely. This is true for every and hence the limit superior is zero.

References
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 433-438
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

14 - Relative Efficiency of Tests
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 192-214
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The quality of sequences of tests can be judged from their power at alternatives that become closer and closer to the null hypothesis. This motivates the study of local asymptotic power functions. The relative efficiency of two sequences of tests is the quotient of the numbers of observations needed with the two tests to obtain the same level and power. We discuss several types of asymptotic relative efficiencies.
Asymptotic Power Functions
Consider the problem of testing a null hypothesis versus the alternative The power function of a test that rejects the null hypothesis if a test statistic falls into a critical region Kn is the function which gives the probability of rejecting the null hypothesis. The test is of level if its size does not exceed A sequence of tests is called asymptotically of level a if
(An alternative definition is to drop the supremum and require only thatfor everyA test with power function is better than a test with power function
The aim of this chapter is to compare tests asymptotically. We consider sequences of tests with power functions and and wish to decide which of the sequences is best as . Typically, the tests corresponding to a sequence are of the same type. For instance, they are all based on a certain U -statistic or rank statistic, and only the number of observations changes with n. Otherwise the comparison would have little relevance. A first idea is to consider limiting power functions of the form
If this limit exists for all and the same is true for the competing tests then the sequence is better than the sequenceif the limiting power functionis better than the limiting power function. It turns out that this approach is too naive. The limiting power functions typically exist, but they are trivial and identical for all reasonable sequences of tests.

15 - Efficiency of Tests
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 215-226
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

It is shown that, given converging experiments, every limiting power function is the power function of a test in the limit experiment. Thus, uniformly most powerful tests in the limit experiment give absolute upper bounds for the power of a sequence of tests. In normal experiments such uniformly most powerful tests exist for linear hypotheses of codimension one. The one-sample location problem and the two-sample problem are discussed in detail, and appropriately designed (signed) rank tests are shown to be asymptotically optimal.
Asymptotic Representation Theorem
A randomized test (or test function) in an experiment is a measurable map on the sample space. The interpretation is that if x is observed, then a null hypothesis is rejected with probabilityThe power function of a test is the function
This gives the probabilities that the null hypothesis is rejected. A test is of level ex for testing a null hypothesis Ho if its size sup does not exceed ex. The quality of a test can be judged from its power function, and classical testing theory is aimed at finding, among the tests of level ex, a test with high power at every alternative.
The asymptotic quality of a sequence of tests may be judged from the limit of the sequence of local power functions. If the tests are defined in experiments that converge to a limit experiment, then a pointwise limit of power functions is necessarily a power function in the limit experiment. This follows from the following theorem, which specializes the asymptotic representation theorem, Theorem 9.3, to the testing problem. Applied to the special case of the local experiments of a differentiable parametric model as considered in Chapter 7, which converge to the Gaussian experiment the theorem is the parallel for testing of Theorem 7.10.

Statistical theory and methods

Refine search

Refine search

Actions for selected content:

2348 results in Statistical theory and methods

6 - Contiguity

Summary

16 - Likelihood Ratio Tests

Summary

10 - Bayes Procedures

Summary

18 - Stochastic Convergence in Metric Spaces

Summary

Contents

22 - L-Statistics

Summary

Preface

Summary

8 - Efficiency of Estimators

Summary

Dedication

13 - Rank, Sign, and Permutation Statistics

Summary

Frontmatter

1 - Introduction

Summary

3 - Delta Method

Summary

23 - Bootstrap

Summary

Index

20 - Functional Delta Method

Summary

19 - Empirical Processes

Summary

References

14 - Relative Efficiency of Tests

Summary

15 - Efficiency of Tests

Summary

Statistical theory and methods

Refine search

Refine search

Actions for selected content:

Save Search

2348 results in Statistical theory and methods

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary