Search results for Knowledge Management, Databases and Data Mining

11 - Visual Basic Editing and Code Development
Elliot Bendoly, Emory University, Atlanta
Book:

Excel Basics to Blackbelt

Published online:

05 August 2013

Print publication:

29 July 2013, pp 291-333
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Many effective decision support systems rely not only on the ability of a manager to present information, analysis, and meaningful dynamics (for example, through graphics), but also on enabling users to realize the intended use of those elements by themselves (without the developer holding their hand). This is often going to mean providing sufficient documentation that might go beyond cell labeling and embedded comments. It may mean coming up with some kind of a customized user-driven help or wizard component as part of the DSS that makes use of not only automated numerical and graphical demos, but also other objects, such as images and .wav files, which could be incorporated into the workbook. This is often going to mean a level of automation that stretches the limits of the kind of work that can happen at the spreadsheet interface alone. In fact, it may be impossible to achieve by using only the top layer of an Excel workbook. Let's see how macros and the Visual Basic (VB) Editor might provide us with some new options in this regard.
The Visual Basic Editor
Let's take a deeper look into one of the first macros I introduced. Opening the Chp8_LobosInventory workbook provides us with an opportunity. To see the code associated with this macro, select the Developer tab on the main menu bar and then select Visual Basic (which will open the general VB Editor screen) or click Macros (see Figure 11.1), and from the associated dialog box select the specific name of the program code you are interested in viewing (in this case, generically called Macro1) and then Edit.

4 - Structuring Problems and Option Visualization
Elliot Bendoly, Emory University, Atlanta
Book:

Excel Basics to Blackbelt

Published online:

05 August 2013

Print publication:

29 July 2013, pp 63-105
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Decision modeling describes the use of data and logic to clarify the specific nature of a situation for which assistance in the decision-making process may be needed. The hope is that in clarifying such details, the development of meaningful suggestions and solutions may be easier to create. Most management problems for which decisions are sought can be represented by three standard elements: objectives, decision variables, and cons-traints.
Objectives
Maximize profit
Provide earliest entry into market
Minimize employee discomfort and turnover
Decision variables
Determine what price to use
Determine the length of time tests should be run on a new product or service
Determine the responsibilities to assign to each worker
Constraints
Can't charge below cost
Must test enough to meet minimum safety regulations
Ensure responsibilities are shared by two workers at most
All of these elements can be visualized graphically, often to the benefit of analysis and general insights. Our initial discussion will be limited to objectives and decision variables; we'll discuss constraints further on in this chapter. In most business scenarios, managers are faced with making a set of decisions that impact a final outcome (objective). This tends to make the decision process more complex, and sometimes the rationale for making specific decisions is difficult to describe.

13 - Guided and User-Friendly Interfaces
Elliot Bendoly, Emory University, Atlanta
Book:

Excel Basics to Blackbelt

Published online:

05 August 2013

Print publication:

29 July 2013, pp 359-380
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

As you've probably guessed by now, decisions can become increasingly complex as we increase the number of variables and constraints to maintain reality and practicality in our decision-making process. Similarly, the ability to concisely provide visualizations of what is possible and what is ideal (and, conversely, what isn't) becomes increasingly challenging. Given this complexity and the perceived need in industry to nevertheless pursue means of assisting people in decision making, the concept of the dashboard has come into being and continues to gain popularity.
A dashboard, from a general decision-making perspective, is basically a computer interface that allows individual users to simultaneously view various depictions (that is, presented structures) of data and information, as well as various subsets of data (that is, content) relevant to a particular task and user context. For example, Figure 13.1 shows four dashboards that I’ve personally put into use for research and consulting purposes in the recent past.
Two of these are highly oriented toward geographic (specifically, logistics) tasks; the other two are designed with project management tasks in mind. You’ll notice that each of these consists of multiple frames and multiple control- and form-based interfaces. Some make use of parameterization forms more so than others. Some make use of graphs and charts predominantly, whereas others make rich use of tables with key indices summarized. All of them were designed as applications that could function through the use of Excel alone, and are highly mobile from a distributional perspective.

Associated Links
Elliot Bendoly, Emory University, Atlanta
Book:

Excel Basics to Blackbelt

Published online:

05 August 2013

Print publication:

29 July 2013, pp ix-x
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

9 - Simulation Search, Optimization, and Reporting
Elliot Bendoly, Emory University, Atlanta
Book:

Excel Basics to Blackbelt

Published online:

05 August 2013

Print publication:

29 July 2013, pp 245-268
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

A natural extension of a discussion of simulation, given our existing understanding of optimization, is how the two methods can be used together. The basic question behind simulation optimization is:
What decision (if any) tends to provide relatively superior results regardless of the uncertainty associated with the real-world problems they are designed to resolve?
Simulation provides the means by which to incorporate uncertainty into the evaluation of a specific decision or a predetermined handful of such decisions; however, this question implies a much greater scope. It suggests a formal search for the best decision across a vast range of possible alternative decisions. For simulated variants, the term best takes into account not just the average or expected value of parameters describing the setting (as would be common in discrete optimization), but also the potentially extreme performance of outliers, be that good or bad. For system simulations, the best would necessarily need to further relate to performance as the result of a sequence of events where the interplay of initial guiding decisions, complicated by uncertainty, might be extremely difficult to assess without sufficient simulation runs. The follow-up question then is:
How can we integrate the techniques associated with simulation and optimization into a single solid mechanism for meaningful decision support?

6 - The Analytics of Optimization
Elliot Bendoly, Emory University, Atlanta
Book:

Excel Basics to Blackbelt

Published online:

05 August 2013

Print publication:

29 July 2013, pp 140-176
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Excel gives us Solver, a great tool that helps us determine what specific decisions (values of our decision variables) should be used to obtain our objectives that are subject to the issues constraining us. Generally, Solver can be accessed under the Data tab in the Analysis section (Figure 6.1). If you do not find Solver in your Excel Data tab, it means that either Solver was not selected for installation at the time your copy of Excel was installed, or it is currently not activated. To activate Solver, click Options>Add-Ins. Select Excel Add-Ins in the Manage drop-down menu and then click Go. The Add-Ins dialog box opens, which enables you to choose Solver Add-In (Figure 6.2).
Optimization with Solver
The general structure of Solver fits perfectly with the description in Chapter 4 of the three key elements of decision structuring: objectives, decision variables, and constraints (Figure 6.3). Solver is designed to provide the best solutions possible, based on the information we provide. It has its limits (it breaks down with extremely complex or large problems), but it does a nice job for smaller problems that still present challenges to decision makers.

Contents
Elliot Bendoly, Emory University, Atlanta
Book:

Excel Basics to Blackbelt

Published online:

05 August 2013

Print publication:

29 July 2013, pp v-viii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

8 - Multivariate Data
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University, Bloomington
Book:

Regression Analysis of Count Data

Published online:

05 July 2014

Print publication:

27 May 2013, pp 304-340
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

INTRODUCTION
In this chapter we consider regression models for an m-dimensional vector of jointly distributed and, in general, correlated random variables y = (y1, y2, …, ym), a subset of which are event counts. One special case of interest is that of m seemingly unrelated count regressions denoted as y∣x = (y1|x1, y2|x2, …, ym|xm), where x = (x1, …, xm) are observed exogenous covariates and the counts are conditionally correlated. In econometric terminology this model is a multivariate reduced-form model in which multivariate dependence is not causal. Most of this chapter deals with such reduced-form dependence. Causal dependence, such as y1 depending explicitly on y2, is covered elsewhere, most notably in Chapter 10.
Depending on the multivariate model, ignoring multivariate dependence may or may not affect the consistency of the univariate model estimator. In either case, joint modeling of y1, …, ym leads to improved efficiency of estimation and the ability to make inferences about the dependence structure. A joint model can also support probability statements about the conditional distribution of a subset of variables, say y1, given realization of another subset, say y2.
Multivariate nonlinear, non-Gaussian models are used much less often than multivariate linear Gaussian models, and there is no model with the universality of the linear Gaussian model. Fully parametric approaches based on the joint distribution of non-Gaussian vector y, given a set of covariates x, are difficult to apply because analytically and computationally tractable expressions for such joint distributions are available for special cases only.

Preface
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University, Bloomington
Book:

Regression Analysis of Count Data

Published online:

05 July 2014

Print publication:

27 May 2013, pp xxi-xxiv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Since Regression Analysis of Count Data was published in 1998, significant new research has contributed to the range and scope of count data models. This growth is reflected in many new journal articles, fuller coverage in textbooks, and wide interest in and availability of software for handling count data models. These developments (to which we have also contributed) have motivated us to revise and expand the first edition. Like the first edition, this volume reflects an orientation toward practical data analysis.
The revisions in this edition have affected all chapters. First, we have corrected the typographical and other errors in the first edition, improved the graphics throughout, and where appropriate we have provided a cleaner and simpler exposition. Second, we have revised and relocated material that seemed better placed in a different location, mostly within the same chapter though occasionally in a different chapter. For example, material in Chapter 4 (generalized count models), Chapter 8 (multivariate counts), and Chapter 13 (measurement errors) has been pruned and rearranged so the more mainstream topics appear earlier and the more marginal topics have disappeared altogether. For similar reasons bootstrap inference has moved to Chapter 2 from Chapter 5. Our goal here has been to improve quality of synthesis and accessibility of material to the reader. Third, the final few chapters have been reordered. Chapter 10 (endogeneity and selection) has moved up from Chapter 11.

Dedication
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University, Bloomington
Book:

Regression Analysis of Count Data

Published online:

05 July 2014

Print publication:

27 May 2013, pp vii-viii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Subject Index
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University, Bloomington
Book:

Regression Analysis of Count Data

Published online:

05 July 2014

Print publication:

27 May 2013, pp 553-566
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

11 - Flexible Methods for Counts
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University, Bloomington
Book:

Regression Analysis of Count Data

Published online:

05 July 2014

Print publication:

27 May 2013, pp 413-448
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

C - Software
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University, Bloomington
Book:

Regression Analysis of Count Data

Published online:

05 July 2014

Print publication:

27 May 2013, pp 509-510
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Most regression packages support estimation of cross-section Poisson and negative binomial regression models. Packages that claim to have a significant component for count models most likely cover cross-section truncated, zero-inflated, and hurdle models, and they may cover the standard panel count models. Packages that instead only include counts within a GLM module do not cover truncated, zero-inflated, and hurdle models and only cover panel counts to the extent that they have GEE and generalized linear mixed model modules. They do not cover the Poisson fixed effects model.
LIMDEP 9.0 covers a wide range of cross-section count models, including censored, truncated, hurdle, zero-inflated, latent heterogeneity, and latent class, as well as panel fixed and random effects Poisson and negative binomial.
Stata 12.0 covers a similar range of models. Additionally it covers GLM and GEE estimators, some mixed models, and a wide range of mixed models with user-written add-on GLLAMM.
SAS 9.3 covers a similar range of models to Stata through procedures COUNT, TRCOUNT (a recent addition that covers zero-truncated and panel), GENMOD, GEE, and NLMIXED. Procedure COPULA covers copula-based models.
EViews 7.2 covers cross-section Poisson, NB, and GLM.
TSP 5.1 covers cross-section Poisson and NB. It also provides a relatively simple syntax for ML and GMM estimation of user-provided objective functions, including panel count models.
SPSS 20 covers cross-section Poisson and NB and has procedures GENLIN, GEE, and GLMM for GLMs and their multi-equation extensions.

7 - Time Series Data
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University, Bloomington
Book:

Regression Analysis of Count Data

Published online:

05 July 2014

Print publication:

27 May 2013, pp 263-303
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

INTRODUCTION
The previous chapters have focused on models for cross-section regression on a single count dependent variable. We now turn to models for more general types of data – univariate time series data in this chapter, multivariate cross-section data in Chapter 8, and longitudinal or panel data in Chapter 9.
Count data introduce complications of discreteness and heteroskedasticity. For cross-section data, this leads to moving from the linear model to the Poisson regression model. However, this model is often too restrictive when confronted with real data, which are typically overdispersed. With cross-section data, overdispersion is most frequently handled by leaving the conditional mean unchanged and rescaling the conditional variance. The same adjustment is made regardless of whether the underlying cause of overdispersion is unobserved heterogeneity in a Poisson point process or true contagion leading to dependence in the process.
For time series count data, one can again begin with the Poisson regression model. In this case, however, it is not clear how to proceed if dependence is present. For example, developing even a pure time series count model where the count in period t, yt, depends only on the count in the previous period, yt−1, is not straightforward, and there are many possible ways to proceed. Even restricting attention to a fully parametric approach, one can specify distributions for yt either conditional on yt−1 or unconditional on yt−1. For count data this leads to quite different models, whereas for continuous data the assumption of joint normality leads to both conditional and marginal distributions that are also normal.

Author Index
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University, Bloomington
Book:

Regression Analysis of Count Data

Published online:

05 July 2014

Print publication:

27 May 2013, pp 543-552
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Miscellaneous Endmatter
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University, Bloomington
Book:

Regression Analysis of Count Data

Published online:

05 July 2014

Print publication:

27 May 2013, pp 567-567
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

1 - Introduction
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University, Bloomington
Book:

Regression Analysis of Count Data

Published online:

05 July 2014

Print publication:

27 May 2013, pp 1-20
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

God made the integers, all the rest is the work of man.
– Kronecker
This book is concerned with models of event counts. An event count refers to the number of times an event occurs, for example, the number of airline accidents or earthquakes. It is the realization of a nonnegative integer-valued random variable. A univariate statistical model of event counts usually specifies a probability distribution of the number of occurrences of the event known up to some parameters. Estimation and inference in such models are concerned with the unknown parameters, given the probability distribution and the count data. Such a specification involves no other variables, and the number of events is assumed to be independently identically distributed (iid). Much early theoretical and applied work on event counts was carried out in the univariate framework. The main focus of this book, however, is on regression analysis of event counts.
The statistical analysis of counts within the framework of discrete parametric distributions for univariate iid random variables has a long and rich history (Johnson, Kemp, and Kotz, 2005). The Poisson distribution was derived as a limiting case of the binomial by Poisson (1837). Early applications include the classic study of Bortkiewicz (1898) of the annual number of deaths in the Prussian army from being kicked by mules. A standard generalization of the Poisson is the negative binomial distribution. It was derived by Greenwood and Yule (1920), as a consequence of apparent contagion due to unobserved heterogeneity, and by Eggenberger and Polya (1923) as a result of true contagion.

Frontmatter
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University, Bloomington
Book:

Regression Analysis of Count Data

Published online:

05 July 2014

Print publication:

27 May 2013, pp i-vi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

References
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University, Bloomington
Book:

Regression Analysis of Count Data

Published online:

05 July 2014

Print publication:

27 May 2013, pp 511-542
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

5 - Model Evaluation and Testing
A. Colin Cameron, University of California, Davis, Pravin K. Trivedi, Indiana University, Bloomington
Book:

Regression Analysis of Count Data

Published online:

05 July 2014

Print publication:

27 May 2013, pp 177-224
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

INTRODUCTION
It is desirable to analyze count data using a cycle of model specification, estimation, testing, and evaluation. This cycle can go from specific to general models – for example, it can begin with Poisson and then test for the negative binomial – or one can use a general-to-specific approach, beginning with the negative binomial and then testing the restrictions imposed by Poisson. In terms of inclusion of regressors in a given count model, either approach might be taken; for the choice of the count data model itself, other than simple choices such as Poisson or negative binomial, the former approach is most often useful. For example, if the negative binomial model is inadequate, there is a very wide range of models that might be considered, rendering a general-to-specific approach difficult to implement.
The preceding two chapters have presented the specification and estimation components of this cycle for cross-section count data. In this chapter we focus on the testing and evaluation aspects of this cycle. This includes residual analysis, goodness-of-fit measures, and model specification tests, in addition to classical statistical inference.
Residual analysis, based on a range of definitions of the residual for heteroskedastic data such as counts, is presented in section 5.2. A range of measures of goodness of fit, including pseudo R-squareds and a chi-square goodness-of-fit statistic, is presented in section 5.3. Discrimination among nonnested models is the subject of section 5.4.

Knowledge Management, Databases and Data Mining

Refine search

Refine search

Actions for selected content:

1835 results in Knowledge Management, Databases and Data Mining

11 - Visual Basic Editing and Code Development

Summary

4 - Structuring Problems and Option Visualization

Summary

13 - Guided and User-Friendly Interfaces

Summary

Associated Links

9 - Simulation Search, Optimization, and Reporting

Summary

6 - The Analytics of Optimization

Summary

Contents

8 - Multivariate Data

Summary

Preface

Summary

Dedication

Subject Index

11 - Flexible Methods for Counts

C - Software

Summary

7 - Time Series Data

Summary

Author Index

Miscellaneous Endmatter

1 - Introduction

Summary

Frontmatter

References

5 - Model Evaluation and Testing

Summary

Knowledge Management, Databases and Data Mining

Refine search

Refine search

Actions for selected content:

Save Search

1835 results in Knowledge Management, Databases and Data Mining

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary