Search results for Statistical theory and methods

7 - Local Asymptotic Normality
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 92-107
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

A sequence of statistical models is “locally asymptotically normal” if, asymptotically, their likelihood ratio processes are similar to those for a normal location parameter. Technically, this is if the likelihood ratio processes admit a certain quadratic expansion. An important example in which this arises is repeated sampling from a smooth parametric model. Local asymptotic normality implies convergence of the models to a Gaussian model after a rescaling of the parameter.
Introduction
Suppose we observe a sample X1,, Xn from a distribution on some measurable space (X, A) indexed by a parameter that ranges over an open subset e Then the full observation is a single observation from the produc of copies of, and the statistical model is completely described as the collection of probability measures on the sample space In the context of the present chapter we shall speak of a statistical experiment, rather than of a statistical model. In this chapter it is shown that many statistical experiments can be approximated by Gaussian experiments after a suitable reparametrization.
The reparametrization is centered around a fixed parameter which should be regarded as known. We define a local parameter, rewrite as and thus obtain an experiment with parameter In this chapter we show that, for large the experiments
are similar in statistical properties, whenever the original experiments are “smooth” in the parameter. The second experiment consists of observing a single observation from a normal distribution with mean and known covariance matrix (equal to the inverse of the Fisher information matrix). This is a simple experiment, which is easy to analyze, whence the approximation yields much information about the asymptotic properties of the original experiments. This information is extracted in several chapters to follow and concerns both asymptotic optimality theory and the behavior of statistical procedures such as the maximum likelihood estimator and the likelihood ratio test.
We have taken the local parameter set equal to which is not correct if the parameter set e is a true subset of IRk.

9 - Limits of Experiments
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 125-137
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

A sequence of experiments is defined to converge to a limit experiment if the sequence of likelihood ratio processes converges marginally in distribution to the likelihood ratio process of the limit experiment. A limit experiment serves as an approximation for the converging sequence of experiments. This generalizes the convergence of locally asymptotically normal sequences of experiments considered in Chapter 7. Several examples of nonnormallimit experiments are discussed.
Introduction
This chapter introduces a notion of convergence of statistical models or “experiments” to a limit experiment. In this notion a sequence of models, rather than just a sequence of estimators or tests, converges to a limit. The limit experiment serves two purposes. First, it provides an absolute standard for what can be achieved asymptotically by a sequence of tests or estimators, in the form of a “lower bound“: No sequence of statistical procedures can be asymptotically better than the “best” procedure in the limit experiment. For instance, the best limiting power function is the best power function in the limit experiment; a best sequence of estimators converges to a best estimator in the limit experiment. Statements of this type are true irrespective of the precise meaning of “best.” A second purpose of a limit experiment is to explain the asymptotic behaviour of sequences of statistical procedures. For instance, the asymptotic normality or (in)efficiency of maximum likelihood estimators.
Many sequences of experiments converge to normal limit experiments. In particular, the local experiments in a given locally asymptotically normal sequence of experiments, as considered in Chapter 7, converge to a normal location experiment. The asymptotic representation theorem given in the present chapter is therefore a generalization of Theorem 7.10 (for the LAN case) to the general situation. The importance of the general concept is illustrated by several examples of non-Gaussian limit experiments.
In the present context it is customary to speak: of “experiment” rather than model, although these terms are interchangeable. Formally an experiment is a measurable space the sample space, equipped with a collection of probability measures. The set of probability measures serves as a statistical model for the observation, written as X. In this chapter the parameter is denoted by (and not because the results are typically applied to “local” parameters (such as).

Notation
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp xv-xvi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

4 - Moment Estimators
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 35-40
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The method of moments determines estimators by comparing sample and theoretical moments. Moment estimators are useful for their simplicity, although not always optimal. Maximum likelihood estimators for full exponentialfamilies are moment estimators, and their asymptotic normality can be proved by treating them as such.
Method of Moments
Let X1, … , X n be a sample from a distribution Po that depends on a parameter ranging over some set. The method of moments consists of estimating by the solution of a system of equations
for given functions Thus the parameter is chosen such that the sample moments (on the left side) match the theoretical moments. If the parameter is k-dimensional one usually tries to match k moments in this manner. The choices lead to the method of moments in its simplest form.
Moment estimators are not necessarily the best estimators, but under reasonable conditions they have convergence rate and are asymptotically normal. This is a consequence of the delta method. Write the given functions in the vector notation and let be the vector-valued expectation Then the moment estimator solves the system of equations
For existence of the moment estimator, it is necessary that the vectorbe in the range of the function If is one-to-one, then the moment estimator is uniquely determined as
If is asymptotically normal and is differentiable, then the right side is asymptotically normal by the delta method.
The derivative of at is the inverse of the derivative of eat Because the function is often not explicit, it is convenient to ascertain its differentiability from the differentiability of This is possible by the inverse function theorem. According to this theorem a map that is (continuously) differentiable throughout an open set with nonsingular derivatives is locally one-to-one, is of full rank, and has a differentiable inverse. Thus we obtain the following theorem.

12 - U -Statistics
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 161-172
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

One-sample U -statistics can be regarded as generalizations of means. They are sums of dependent variables, but we show them to be asymptotically normal by the projection method. Certain interesting test statistics, such as the Wilcoxon statistics and Kendall's r-statistic, are one-sample U -statistics. The Wilcoxon statistic for testing a difference in location between two samples is an example of a two-sample U-statistic. The Cramer-von Mises statistic is an example of a degenerate U-statistic.
One-Sample V-Statistics
Let X I, … , Xn be a random sample from an unknown distribution. Given a known function h, consider estimation of the “parameter“
In order to simplify the formulas, it is assumed throughout this section that the function h is permutation symmetric in its r arguments. (A given h could always be replaced by a symmetric one.) The statistic h(XI , •.. , Xr) is an unbiased estimator for (), but it is unnatural, as it uses only the first r observations. A U-statistic with kernel h remedies this; it is defined as
where the sum is taken over the set of all unordered subsets f3 of r different integers chosen from ﹛I, … , n﹜. Because the observations are i.i.d., U is an unbiased estimator for () also. Moreover, U is permutation symmetric in Xl, … Xn, and has smaller variance than h(Xb … , Xr). In fact, if X (1) , •.• , X(n) denote the values Xl, … , Xn stripped from their order (the order statistics in the case of real-valued variables), then
Because a conditional expectation is a projection, and projecting decreases second moments, the variance of the U -statistic U is smaller than the variance of the naive estimator h(XJ, … , Xr).
In this section it is shown that the sequence ./ii(U–0) is asymptotically nonnal under the condition that Eh2(X\, … , Xr ) < 00.
12.1 Example. A U-statistic of degree r = 1 is a mean n-I1::7=\h(Xi ). The asserted asymptotic nonnality is then just the central limit theorem.

5 - M–and Z-Estimators
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 41-84
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter gives an introduction to the consistency and asymptotic normality of M -estimators and Z-estimators. Maximum likelihood estimators are treated as a special case.
Introduction
Suppose that we are interested in a parameter (or “functional“) attached to the distribution ofobservations, …,A popular method for finding an estimator is to maximize a criterion function of the type
Here are known functions. An estimator maximizing over is called an M -estimator. In this chapter we investigate the asymptotic behavior of sequences of M -estimators.
Often the maximizing value is sought by setting a derivative (or the set of partial derivatives in the multidimensional case) equal to zero. Therefore, the name M -estimator is also used for estimators satisfying systems of equations of the type
Here are known vector-valued maps. For instance, if is k-dimensional, thentypically has k coordinate functions and (5.2) is shorthand for the system of equations
Even though in many examples is the partial derivative of some function this is irrelevant for the following. Equations, such as (5.2), defining an estimator are called estimating equations and need not correspond to a maximization problem. In the latter case it is probably better to call the corresponding estimators Z-estimators (for zero), but the use of the name M -estimator is widespread.
Sometimes the maximum of the criterion function Mn is not taken or the estimating equation does not have an exact solution. Then it is natural to use as estimator a value that almost maximizes the criterion function or is a near zero. This yields approximate M-estimators or Z-estimators. Estimators that are sufficiently close to being a point of maximum or a zero often have the same asymptotic behavior.
An operator notation for taking expectations simplifies the formulas in this chapter. We write P for the marginal law of the observations which we assume to be identically distributed. Furthermore, we write for the expectation and abbreviate the average. Thus Pn is the empirical distribution: the (random) discrete distribution that puts mass at every of the observations.

2 - Stochastic Convergence
A. W. van der Vaart, Vrije Universiteit, Amsterdam
Book:

Asymptotic Statistics

Published online:

05 June 2012

Print publication:

13 October 1998, pp 5-24
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter provides a review of basic modes of convergence of sequences of stochastic vectors, in particular convergence in distribution and in probability.
Basic Theory
A random vector in is a vector X = (X1, … , Xk) of real random variables. t The distributionfunction of X is the map
A sequence of random vectors Xn is said to converge in distribution to a random vector X if
for every x at which the limit distribution function is continuous. Alternative names are weak convergence and convergence in law. As the last name suggests, the convergence only depends on the induced laws of the vectors and not on the probability spaces on which they are defined. Weak convergence is denoted by if X has distribution L, or a distribution with a standard code, such as N(O, 1), then also by
Let d (x, y) be a distance function on IRk that generates the usual topology. For instance, the Euclidean distance
A sequence of random variables Xn is said to converge in probability to X if for all
This is denoted by. In this notation convergence in probability is the same as
As we shall see, convergence in probability is stronger than convergence in distribution. An even stronger mode of convergence is almost-sure convergence. The sequence Xn is said to converge almost surely to X if
This is denoted by Xn X. Note that convergence in probability and convergence almost surely only make sense if each of Xn and X are defined on the same probability space. For convergence in distribution this is not necessary.

Preface
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University
Book:

Regression Analysis of Count Data

Published online:

05 January 2013

Print publication:

28 September 1998, pp xvii-xx
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This book describes regression methods for count data, where the response variable is a nonnegative integer. The methods are relevant for analysis of counts that arise in both social and natural sciences.
Despite their relatively recent origin, count data regression methods build on an impressive body of statistical research on univariate discrete distributions. Many of these methods have now found their way into major statistical packages, which has encouraged their application in a variety of contexts. Such widespread use has itself thrown up numerous interesting research issues and themes, which we explore in this book.
The objective of the book is threefold. First, we wish to provide a synthesis and integrative survey of the literature on count data regressions, covering both the statistical and econometric strands. The former has emphasized the framework of generalized linear models, exponential families of distributions, and generalized estimating equations; the latter has emphasized nonlinear regression and generalized method of moment frameworks. Yet between them there are numerous points of contact that can be fruitfully exploited. Our second objective is to make sophisticated methods of data analysis more accessible to practitioners with different interests and backgrounds. To this end we consider models and methods suitable for cross-section, time series, and longitudinal data. Detailed analyses of several data sets as well as shorter illustrations, implemented from a variety of viewpoints, are scattered throughout the book to put empirical flesh on theoretical or methodological discussion. We draw on examples from, and give references to, works in many applied areas.

12 - Flexible Methods for Counts
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University
Book:

Regression Analysis of Count Data

Published online:

05 January 2013

Print publication:

28 September 1998, pp 344-370
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Introduction
In this chapter we examine methods for modeling count data that are more flexible than those presented in previous chapters. The focus is on the cross-section case, although some of the methods given here have potential extension to time series, multivariate or longitudinal count data, and treatment of sample selection.
One type of flexible modeling is to specify low-order conditional moments of the dependent variable, rather than the entire distribution. This moment-based approach has already been considered extensively in previous chapters. Here we extend it by considering higher-order moments. The emphasis is on the more difficult question of the most efficient use of the moments, with estimators derived using results on optimal GMM.
The core of the chapter considers two basic types of flexible model. First, we consider a sequence of progressively more flexible parametric models, where the underlying parameters in the sequence are tightly specified, for example, equal to a specified function of a linear combination of regressors and parameters.
Second, we consider models in which part of the distribution or general functional form for the moment is tightly specified, but the remainder is flexibly modeled. For example, the conditional mean may be the exponential of the sum of a linear combination of all but one regressor and a flexible function of the remaining regressor. A second example, in which the conditional mean function is specified but the conditional variance is flexible, has already been considered in earlier chapters but is covered in further depth here.

References
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University
Book:

Regression Analysis of Count Data

Published online:

05 January 2013

Print publication:

28 September 1998, pp 379-398
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Subject Index
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University
Book:

Regression Analysis of Count Data

Published online:

05 January 2013

Print publication:

28 September 1998, pp 404-411
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

7 - Time Series Data
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University
Book:

Regression Analysis of Count Data

Published online:

05 January 2013

Print publication:

28 September 1998, pp 221-250
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Introduction
The previous chapters have focused on models for cross-section regression on a single count dependent variable. We now turn to models for more general types of data – univariate time series data in this chapter, multivariate cross-section data in Chapter 8, and longitudinal or panel data in Chapter 9.
Count data introduce complications of discreteness and heteroskedasticity. For cross-section data, this leads to moving from the linear model to the Poisson regression model. This model is often too restrictive for real data, which are typically overdispersed. With cross-section data, overdispersion is most frequently handled by leaving the conditional mean unchanged and rescaling the conditional variance. The same adjustment is made regardless of whether the underlying cause of overdispersion is unobserved heterogeneity in a Poisson point process or true contagion leading to dependence in the process.
For time series count data, one can again begin with the Poisson regression model. In this case, however, it is not clear how to proceed if dependence is present. For example, developing even a pure time series count model in which the count in period t, yt, depends only on the count in the previous period, yt−1, is not straightforward, and there are many possible ways to proceed. Even restricting attention to a fully parametric approach, one can specify distributions for yt either conditional on yt−1 or unconditional on yt−1. For count data this leads to quite different models, whereas for continuous data the assumption of joint normality leads to both conditional and marginal distributions that are also normal.

4 - Generalized Count Regression
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University
Book:

Regression Analysis of Count Data

Published online:

05 January 2013

Print publication:

28 September 1998, pp 96-138
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Introduction
This chapter deals with departures from the Poisson regression. One reason for the failure of the Poisson regression is unobserved heterogeneity, which contributes additional randomness. Mixture models obtained by averaging with respect to unobserved heterogeneity generally are not Poisson. A second reason is the failure of the Poisson process assumption and its replacement by a more general stochastic process.
Section 4.2 deals with the negative binomial model. One characterization of this is as a Poisson-gamma mixture. In Section 4.3 we examine the relation between waiting times and counts introduced in Chapter 1. Section 4.4 considers flexible functional forms which are alternatives to the Poisson. Sections 4.5 and 4.6 consider the case in which the range of observed counts is further restricted by either truncation or censoring. Section 4.7 considers an empirically important class of hurdle models that give a special treatment to zero counts. This class combines elements both of truncation and mixtures. Section 4.8 provides a detailed treatment of the finite mixture latent class model that is empirically implemented in Chapter 6. Section 4.9 gives an introduction to estimation by simulation. In the remainder of this section we summarize the motivation underlying the models analyzed in this chapter.
The leading motivation for considering parametric distributions other than the Poisson is that they have the potential to accommodate features of data that are inconsistent with the Poisson assumption.

Author Index
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University
Book:

Regression Analysis of Count Data

Published online:

05 January 2013

Print publication:

28 September 1998, pp 399-403
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

3 - Basic Count Regression
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University
Book:

Regression Analysis of Count Data

Published online:

05 January 2013

Print publication:

28 September 1998, pp 59-95
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Introduction
This chapter is intended to provide a self-contained treatment of basic cross-section count data regression analysis. It is analogous to a chapter in a standard statistics text that covers both homoskedastic and heteroskedastic linear regression models.
The most commonly used count models are Poisson and negative binomial. For readers interested only in these models it is sufficient to read sections 3.1 through 3.5, along with preparatory material in sections 1.2 and 2.2 in previous chapters.
Additional regression models for cross-section count data are given in the remainder of Chapter 3, most notably the ordered probit and logit models. These additional models generally ignore the count nature of the data. Still further models, such as the hurdle model, which do explicitly treat the data as count data, are given in Chapter 4. Some model diagnostic methods are presented in Chapter 3, but most are deferred to Chapter 5.
As indicated in Chapter 2, the properties of an estimator vary with the assumptions made on the dgp. By correct specification of the conditional mean or variance or density, we mean that the functional form and explanatory variables in the specified conditional mean or variance or density are those of the dgp.

10 - Measurement Errors
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University
Book:

Regression Analysis of Count Data

Published online:

05 January 2013

Print publication:

28 September 1998, pp 301-325
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Introduction
The well-known bivariate linear errors-in-variables regression model with additive measurement errors in both variables provides one benchmark for nonlinear errors-in-variables models. The standard textbook treatment of the errors-invariables case emphasizes the attenuation result, which says that the estimated least squares estimate of the slope parameter is downward-biased if both variables are subject to measurement error. The essential problem lies in the correlation between the observed explanatory variable and the measurement error. This leads to distorted inferences about the role of the covariate. Although this result does not always extend to general cases, such as a linear model with two or more covariates measured with error, it is usually of interest to consider whether a similar attenuation bias exists generally in nonlinear models (Carroll et al., 1995).
There are important similarities and differences between measurement errors in nonlinear and linear models. First, in nonlinear models it may be more natural to allow measurement errors to enter multiplicatively rather than additively. Second, models in which the measurement errors are confined to the count variable, rather than covariates, are of considerable interest. Third, the direction of measurement errors in count models is sometimes strongly suspected from a priori analysis, which permits stronger conclusions.
Given these motivations, this chapter considers estimation and inference in the presence of measurement errors in exposure time, errors due to underreporting and misclassification of events. Such errors are shown to have important consequences for model identification, specification, estimation, and testing.

List of Tables
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University
Book:

Regression Analysis of Count Data

Published online:

05 January 2013

Print publication:

28 September 1998, pp xiv-xvi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

8 - Multivariate Data
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University
Book:

Regression Analysis of Count Data

Published online:

05 January 2013

Print publication:

28 September 1998, pp 251-274
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Introduction
In this chapter we consider regression models for an m-dimensional vector of jointly distributed, and in general correlated, random variables y = (y1, y2, …, ym), a subset of which are event counts. A special case is if m = 2, y1 is a count, and y2 is either discrete or continuous. Multivariate data appear in three contexts in this book. The first is basic cross-section, which is the main subject of this chapter. The second is longitudinal data with repeated measures over time on the same variable, leading to special correlation structure handled in Chapter 9. The third is the context of multivariate cross-section data with endogeneity or feedback from yj to yk, dealt with in Chapter 10. There are other forms of multivariate data, such as multivariate time series analogs of Gaussian vector autoregressions, that we do not cover.
Multivariate linear Gaussian models are widely used, but multivariate nonlinear, non-Gaussian models are less common. Fully parametric approaches based on the joint distribution of non-Gaussian vector y, given a set of covariates x, are difficult to apply because analytically and computationally tractable expressions for such joint distributions are available for special cases only. Consequently, it is more convenient to analyze models that are of interest in specific situations.
Multivariate cross-section count models arise in several different settings. The first is that in which several related events are measured as counts and the joint distribution of several counts is required. These models are analogous to the seemingly unrelated regressions model.

List of Figures
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University
Book:

Regression Analysis of Count Data

Published online:

05 January 2013

Print publication:

28 September 1998, pp xiii-xiii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Contents
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University
Book:

Regression Analysis of Count Data

Published online:

05 January 2013

Print publication:

28 September 1998, pp ix-xii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Statistical theory and methods

Refine search

Refine search

Actions for selected content:

2348 results in Statistical theory and methods

7 - Local Asymptotic Normality

Summary

9 - Limits of Experiments

Summary

Notation

4 - Moment Estimators

Summary

12 - U -Statistics

Summary

5 - M–and Z-Estimators

Summary

2 - Stochastic Convergence

Summary

Preface

Summary

12 - Flexible Methods for Counts

Summary

References

Subject Index

7 - Time Series Data

Summary

4 - Generalized Count Regression

Summary

Author Index

3 - Basic Count Regression

Summary

10 - Measurement Errors

Summary

List of Tables

8 - Multivariate Data

Summary

List of Figures

Contents

Statistical theory and methods

Refine search

Refine search

Actions for selected content:

Save Search

2348 results in Statistical theory and methods

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary