Search results for Statistical theory and methods

7 - Alternative variance parameterizations
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp 136-159
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Negative binomial regression has traditionally been used to model otherwise overdispersed count or Poisson data. It is now considered to be the general catch-all method used when Poisson data are found to be overdispersed, particularly when the source of overdispersion has not been identified. When we can identify that which gives rise to extra correlation, and hence overdispersion, the basic Poisson and negative binomial algorithms may themselves be further adjusted or enhanced to directly address the identified source of the extra correlation. For example, when overdispersion results from an excess of zero counts in the response, an appropriate strategy is to model the data using either a zero-inflated Poisson (ZIP) or zero-inflated negative binomial (ZINB). Employing a hurdle model may also result in a better fit. On the other hand, if the response is structured such that zero counts are not possible, such as in hospital length of stay data, a zero-truncated Poisson (ZTP) or zero-truncated negative binomial (ZTNB) model may be appropriate.
A variety of alternative models have been developed to address specific facts in the data that give rise to overdispersion. Models dealing with an excess or absence of zeros typically define a mixture that alters the distributional variance of the Poisson distribution. Other models are constructed to alter not the probability and log-likelihood distributions, but rather the Poisson and negative binomial variance functions. We discuss these types of models in this chapter.

Author Index
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp 247-248
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

4 - Overdispersion
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp 51-76
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendix E - Data sets
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp 240-241
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

References
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp 242-246
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendix C - Stata negative binominal – ML algorithm
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp 237-238
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendix D - Negative binomial variance functions
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp 239-239
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

10 - Negative binomial panel models
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp 198-232
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

A basic assumption in the construction of models from likelihood theory is that observations in the model are independent. This is a reasonable assumption for perhaps the majority of studies. However, for longitudinal studies this assumption is not feasible; nor does it hold when data are clustered. For example, observations from a study on student drop-out can be clustered by the type of schools sampled. If the study is related to intervention strategies, schools in affluent suburban, middle-class suburban, middle-class urban, and below poverty level schools have more highly correlated strategies within the school type than between types or groups. Likewise, if we have study data taken on a group of individual patients over time (e.g., treatment results obtained once per month for a year), the data related to individuals in the various time periods are likely to be more highly correlated than are treatment results between patients. Any time the data can be grouped into clusters, or panels, of correlated groups, we must adjust the likelihood-based model (based on independent observations) to account for the extra-correlation.
We have previously employed robust variance estimators and bootstrapped standard errors when faced with overdispersed count data. Overdispersed Poisson models were adjusted by using different types of negative binomial models, or by extending the basic Poisson model by adjusting the variance or by designing a new log-likelihood function to account for the specific cause of the overdispersion.

6 - Negative binomial regression: modeling
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp 99-135
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

9 - Negative binomial with censoring, truncation, and sample selection
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp 179-197
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

There are many times when certain data elements are lost, discarded, ignored, or are otherwise excluded from analysis. Truncated and censored models have been developed to deal with these types of data. Both models take two forms, truncation or censoring from below, and truncation or censoring from above. Count model forms take their basic logic from truncated and censored continuous response data, in particular from Tobit (Amemiya, 1984) and censored normal regression (Goldberger, 1983) respectively.
Count sample selection models also deal with data situations in which the distribution is confounded by an external condition. We shall address sample selection models at the end of the chapter.
The traditional parameterization used for truncated and censored count data can be called the econometric parameterization. This is the form of model discussed in standard econometric texts and is the form found in current econometric software implementations. I distinguish this from what I term a survival parameterization, the form of which is derived from standard survival models. This parameterization only relates to censored Poisson and censored negative binomial models. I shall first address the more traditional econometric parameterization. In addition, I shall not use subscripts for this chapter; they are understood as presented in the earlier chapters.
Censored and truncated models – econometric parameterization
Censored and truncated count models are related, with only a relatively minor algorithmic difference between the two. The essential difference relates to how response values beyond a user-defined cut point are handled.

1 - Overview of count response models
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp 8-18
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Introduction
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp 1-7
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The negative binomial is traditionally derived from a Poisson–gamma mixture model. However, the negative binomial may also be thought of as a member of the single parameter exponential family of distributions. This family of distributions admits a characterization known as generalized linear models (GLMs), which summarizes each member of the family. Most importantly, the characterization is applicable to the negative binomial. Such interpretation allows statisticians to apply to the negative binomial model the various goodness-of-fit tests and residual analyses that have been developed for GLMs.
Poisson regression is the standard method used to model count response data. However, the Poisson distribution assumes the equality of its mean and variance – a property that is rarely found in real data. Data that have greater variance than the mean are termed Poisson overdispersed, but are more commonly designated as simply overdispersed. Negative binomial regression is a standard method used to model overdispersed Poisson data.
When the negative binomial is used to model overdispersed Poisson count data, the distribution can be thought of as an extension to the Poisson model. Certainly, when the negative binomial is derived as a Poisson–gamma mixture, thinking of it in this way makes perfect sense. The original derivation of the negative binomial regression model stems from this manner of understanding it, and has continued to characterize the model to the present time.
As mentioned above, the negative binomial has recently been thought of as having an origin other than as a Poisson–gamma mixture.

8 - Problems with zero counts
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp 160-178
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

I have indicated that extended negative binomial models are generally developed to solve either a distributional or variance problem arising in the base NB-2 model. Changes to the negative binomial variance function were considered in the last chapter. In this chapter, we address the difficulties that arise when there are either no possible zeros in the data, or when there are an excessive number.
Zero-truncated negative binomial
Often we are asked to model count data that structurally exclude zero counts. Hospital length of stay data are an excellent example of count data that cannot have a zero count. When a patient first enters the hospital, the count begins. Upon registration the length of stay is given as 1. There can be no 0 days – unless we are describing patients who do not enter the hospital, and this is a different model where there may be two generating processes. This type of model will be discussed later.
The Poisson and negative binomial distributions both include zeros. When data structurally exclude zero counts, then the underlying probability distribution must preclude this outcome to properly model the data. This is not to say that Poisson and negative binomial models are not commonly used to model such data, the point is that they should not. The Poisson and negative binomial probability functions, and their respective log-likelihood functions, need to be amended to exclude zeros, and at the same time provide for all probabilities in the distribution to sum to one.

3 - Poisson regression
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp 39-50
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Poisson regression is the standard or base count response regression model. We have seen in previous discussion that other count models deal with data that violate the assumptions carried by the Poisson model. Since the model does play such a central role in count response modeling, we begin with an examination of its derivation and structure, as well as how it can be parametermized to model rates. The concept of overdispersion is introduced in this chapter, together with two tests that have been used to assess its existence and strength.
Derivation of the Poisson model
A primary assumption is that of equidispersion, or the equality of the mean and variance functions. When the value of the variance exceeds that of the mean, we have what is termed overdispersion. Negative binomial regression is a standard way to deal with certain types of Poisson overdispersion; we shall find that there are a variety of negative binomial based models, each of which address the manner in which overdispersion has arisen in the data. However, to fully appreciate the negative binomial model and its variations, it is important to have a basic understanding of the derivation of the Poisson as well as an understanding of the logic of its interpretation.
Maximum likelihood models, as well as the canonical form members of generalized linear models, are ultimately based on an estimating equation derived from a probability distribution.

Frontmatter
Joseph M. Hilbe, Arizona State University
Book:

Negative Binomial Regression

Published online:

05 June 2012

Print publication:

23 August 2007, pp i-iv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

2 - Properties and derivations
Ronald W. Butler, Colorado State University
Book:

Saddlepoint Approximations with Applications

Published online:

25 February 2010

Print publication:

16 August 2007, pp 38-74
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 1 introduced expressions which define the various saddlepoint approximations along with enough supplementary information to allow the reader to begin making computations. This chapter develops some elementary properties of the approximations which leads to further understanding of the methods. Heuristic derivations for many of the approximations are presented.
Simple properties of the approximations
Some important properties possessed by saddlepoint density/mass functions and CDFs are developed below. Unless noted otherwise, the distributions involved throughout are assumed to have MGFs that are convergent on open neighborhoods of 0.
The first few properties concern a linear transformation of the random variable X to Y = σX + μ with σ ≠ 0. When X is discrete with integer support, then Y has support on a subset of the σ-lattice {μ,μ ± σ, μ ± 2σ, …}. The resulting variable Y has a saddlepoint mass and CDF approximation that has not been defined and there are a couple of ways in which to proceed. The more intriguing approach would be based on the inversion theory of the probability masses, however, the difficulty of this approach places it beyond the scope of this text. A more expedient and simpler alternative approach is taken here which adopts the following convention and which leads to the same approximations.
Lattice convention. The saddlepoint mass function and CDF approximation for lattice variable Y, with support in {μ, μ ± σ,μ ± 2σ, …} for σ > 0, are specified in terms of their equivalents based on X = (Y − μ) /σ with support on the integer lattice.

8 - Probabilities with r*-type approximations
Ronald W. Butler, Colorado State University
Book:

Saddlepoint Approximations with Applications

Published online:

25 February 2010

Print publication:

16 August 2007, pp 259-284
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Approximations to continuous univariate CDFs of MLEs in curved exponential and transformation families have been derived in Barndorff-Nielsen (1986, 1990, 1991) and are often referred to as r * approximations. These approximations, along with their equivalent approximations of the Lugannani and Rice/Skovgaard form, are presented in the next two chapters. Section 8.2 considers the conditional CDF for the MLE of a scalar parameter given appropriate ancillaries. The more complex situation that encompasses a vector nuisance parameter is the subject of chapter 9.
Other approaches to this distribution theory, aimed more toward p-value computation, are also presented in section 8.5. Fraser and Reid (1993, 1995, 2001) and Fraser et al. (1999a) have suggested an approach based on geometrical considerations of the inference problem. In this approach, explicit ancillary expressions are not needed which helps to simplify the computational effort. Along these same lines, Skovgaard (1996) also offers methods forCDF approximation that are quite simple computationally. Specification of ancillaries is again not necessary and these methods are direct approximations to the procedures suggested by Barndorff-Nielsen above.
Expressions for these approximate CDFs involve partial derivatives of the likelihood with respect the parameter but also with respect to the MLE and other quantities holding the approximate ancillary fixed. The latter partial derivatives are called sample space derivatives and can be difficult to compute. An introduction to these derivatives is given in the next section and approximations to such derivatives, as suggested in Skovgaard (1996), are presented in appropriate sections.

12 - Ratios and roots of estimating equations
Ronald W. Butler, Colorado State University
Book:

Saddlepoint Approximations with Applications

Published online:

25 February 2010

Print publication:

16 August 2007, pp 374-429
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The ratio R = U/V of two random variables U and V, perhaps dependent, admits to saddlepoint approximation through the joint MGF of (U, V). If V > 0 with probability one, then the Lugannani and Rice approximation may be easily applied to approximate the associated CDF. Saddlepoint density approximation based on the joint MGF uses the Geary (1944) representation for its density. This approach was first noted in Daniels (1954, –9) and is discussed in section 12.1 below.
The ratio R is the root of the estimating equation U − RV = 0 and the distribution theory for ratios can be generalized to consider distributions for roots of general estimating equations. The results of section 12.1 are subsumed into the more general discussion of section 12.2 that provides approximate distributions for roots of general estimating equations. Saddlepoint approximations for these roots began in the robustness literature where M-estimates are the roots of certain estimating equations and the interestwas in determining their distributions when sample sizes are small. Hampel (1973), Field and Hampel (1982), and Field (1982) were instrumental in developing this general approach.
Saddlepoint approximation for a vector of ratios, such as for example (R1, R2, R3) = {U1/V, U2/V, U3/V}, is presented in section 12.3 and generalizes the results of Geary (1944). An important class of such examples to be considered includes vector ratios of quadratic forms in normal variables. A particularly prominent example in times series which is treated in detail concerns approximation to the joint distribution for the sequence of lag correlations comprising the serial autocorrelation function.

3 - Multivariate densities
Ronald W. Butler, Colorado State University
Book:

Saddlepoint Approximations with Applications

Published online:

25 February 2010

Print publication:

16 August 2007, pp 75-106
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Index
Ronald W. Butler, Colorado State University
Book:

Saddlepoint Approximations with Applications

Published online:

25 February 2010

Print publication:

16 August 2007, pp 560-564
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Statistical theory and methods

Refine search

Refine search

Actions for selected content:

2348 results in Statistical theory and methods

7 - Alternative variance parameterizations

Summary

Author Index

4 - Overdispersion

Appendix E - Data sets

References

Appendix C - Stata negative binominal – ML algorithm

Appendix D - Negative binomial variance functions

10 - Negative binomial panel models

Summary

6 - Negative binomial regression: modeling

9 - Negative binomial with censoring, truncation, and sample selection

Summary

1 - Overview of count response models

Introduction

Summary

8 - Problems with zero counts

Summary

3 - Poisson regression

Summary

Frontmatter

2 - Properties and derivations

Summary

8 - Probabilities with r*-type approximations

Summary

12 - Ratios and roots of estimating equations

Summary

3 - Multivariate densities

Index

Statistical theory and methods

Refine search

Refine search

Actions for selected content:

Save Search

2348 results in Statistical theory and methods

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary