Search results for Statistics and Probability

7 - About Data and Data Collection
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 226-243
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In statistics, we are often interested in some characteristics of a population. Maybe we are interested in the mean of some measurable characteristic, or maybe we are interested in the proportion of the population that have some property. In all but the simplest cases, the population is so large that it is impossible, or at least impractical, to take the measurement on every item in the population. We therefore have to settle on taking a sample and measuring those units selected for this sample.

Contents
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp vii-xii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

16 - Time Series Methods
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 618-644
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Forecasting is an important problem that spans many fields, including business and industry, government, economics, environmental sciences, medicine, social science, politics, and finance. Forecasting problems are often classified as short term, medium term, and long term. Short-term forecasting problems involve predicting events only a few time periods (days, weeks, and months) into the future. Medium-term forecasts extend from 1 to 2 years into the future, and long-term forecasting problems can extend beyond that by many years.

13 - Hypothesis Tests for Categorical Data
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 459-492
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Often we look at the relationships between categorical variables, such as which hospital a patient is admitted to, or whether a person has diabetes, pre-diabetes, or no diabetes at all. These variables can be nominal (like the hospital) or ordinal (like diabetes, pre-diabetes, or no diabetes). In many cases we want to know something about how these variables are related.

1 - Introduction
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 1-30
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In the 1990s and before, most of the world’s information was stored on paper and other analog media, such as film. However, with the proliferation of personal computers and the internet, by 2000 one-quarter of the world’s information was stored digitally. Since that time, the amount of digital data has exploded, roughly doubling every couple of years, so that now more than 98% of all stored information is digital.

2 - Data Visualization
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 31-53
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Graphical plots are the means by which data are most easily visualized and understood. Indeed, there is no better tool for finding patterns in data than the human eye applied to appropriate displays of relevant data, particularly patterns that are ill-specified or unknown.

19 - Cross-Validation and Estimates of Prediction Error
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 751-772
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Overfitting refers to the use of a model with more parameters than can be justified by the data. Models that are overfit are often poor at predicting the outcome of new observations, that is, observations that were not used in the construction of the model. The next example illustrates this concept.

9 - Point Estimation
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 273-301
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We began the last chapter by reviewing the terms population, parameter, sample, and statistic. Parameters are numerical characteristics of a population that we would like to know, but since the population is nearly always too large to make a measurement on every unit, we often rely on a sample from the population.

17 - Estimating the Standard Error: Analytic Approximations, the Jackknife, and the Bootstrap
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 645-683
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In Chapter 9 we discussed point estimation for a parameter or a vector of parameters. In Chapters 10 and 11, on confidence intervals and hypothesis testing, we needed the idea of the standard error of an estimator.

12 - Hypothesis Tests for Two or More Populations
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 386-458
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

A common problem in statistics is to compare groups. Does a new drug work better at reducing the time of hospitalization from COVID? Which pop-up ad generates a higher click-rate? Which type of metal – aluminum, brass, or stainless steel – will produce the most reliable product? Usually, the question involves either the mean response or the proportion of responses.

10 - Confidence Intervals
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 302-347
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The problem of statistical inference can be described as follows. There is a population and we would like to know certain aspects of the units that make up the population. For example, we might want to know what proportion have a certain property, or what the mean value (of some measure) of all units in the population is. The population is too large to sample in its entirety, so we rely on information from a sample taken from the population.

18 - Generalized Linear Models and Regression Trees
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 684-750
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In Chapter 14 we studied multiple regression and polynomial regression and how these techniques can be used to determine the relationship between an outcome $y$ and several predictor variables $x_{1}, x_{2},, \dots, x_{p}$ .

5 - Discrete Distributions
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 132-169
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In Chapter 3 we learned about the fundamental ideas of probability, and in Chapter 4 we generalized the notion of probability from working with sets to working with random variables and distributions. In many ways, random variables and their associated distributions can simplify probability calculations and, appropriately applied, are useful models for real-world phenomena.

4 - Healthy research: Study designs for public health
Penelope Webb, QIMR Berghofer Medical Research Institute, Chris Bain, Andrew Page, Western Sydney University
Book:

Essential Epidemiology

Published online:

27 September 2024

Print publication:

12 November 2024, pp 93-120
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter, we look at the analytic studies that are our main tools for identifying the causes of disease and evaluating health interventions. Unlike descriptive epidemiology, analytic studies involve planned comparisons between people with and without disease, or between people with and without exposures thought to cause (or prevent) disease. They try to answer the questions, ‘Why do some people develop disease?’ and ‘How strong is the association between exposure and outcome?’. This group of studies includes the intervention, cohort and case–control studies that you met briefly in Chapter 1. Together, descriptive and analytic epidemiology provide information for all stages of health planning, from the identification of problems and their causes to the design, funding and implementation of public health solutions and the evaluation of whether these solutions really work and are cost-effective in practice.

Acknowledgement of Country
Penelope Webb, QIMR Berghofer Medical Research Institute, Chris Bain, Andrew Page, Western Sydney University
Book:

Essential Epidemiology

Published online:

27 September 2024

Print publication:

12 November 2024, pp ii-ii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Acknowledgements
Penelope Webb, QIMR Berghofer Medical Research Institute, Chris Bain, Andrew Page, Western Sydney University
Book:

Essential Epidemiology

Published online:

27 September 2024

Print publication:

12 November 2024, pp xvi-xviii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

THE SPECTRAL APPROACH TO LINEAR RATIONAL EXPECTATIONS MODELS
Majid M. Al-Sadoon
Journal:

Econometric Theory , First View

Published online by Cambridge University Press:

12 November 2024, pp. 1-57
- Article
- - You have access
  - Open access
- PDF
- Export citation
This paper considers linear rational expectations models in the frequency domain. The paper characterizes existence and uniqueness of solutions to particular as well as generic systems. The set of all solutions to a given system is shown to be a finite-dimensional affine space in the frequency domain. It is demonstrated that solutions can be discontinuous with respect to the parameters of the models in the context of nonuniqueness, invalidating mainstream frequentist and Bayesian methods. The ill-posedness of the problem motivates regularized solutions with theoretically guaranteed uniqueness, continuity, and even differentiability properties.

Appendix 5 - Calculating life expectancy from a life table
Penelope Webb, QIMR Berghofer Medical Research Institute, Chris Bain, Andrew Page, Western Sydney University
Book:

Essential Epidemiology

Published online:

27 September 2024

Print publication:

12 November 2024, pp 389-390
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Appendix 3 - Calculating risk and lifetime risk from routine data
Penelope Webb, QIMR Berghofer Medical Research Institute, Chris Bain, Andrew Page, Western Sydney University
Book:

Essential Epidemiology

Published online:

27 September 2024

Print publication:

12 November 2024, pp 385-386
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

12 - Public health surveillance: Collecting data for public health action
- By Martyn Kirk, Penelope Webb, Chris Bain
Penelope Webb, QIMR Berghofer Medical Research Institute, Chris Bain, Andrew Page, Western Sydney University
Book:

Essential Epidemiology

Published online:

27 September 2024

Print publication:

12 November 2024, pp 271-291
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In the previous chapters we have considered the ‘nuts and bolts’ of epidemiology. In this and the next few chapters we look at how epidemiology is used in practice to improve public health. We start with ‘surveillance’ because without timely information on emerging and changing health problems, public health action can be paralysed or, at best, inefficient. In this chapter we discuss the design and use of surveillance systems that enable health officials to detect new risks and diseases such as mpox promptly, track known diseases and health problems, and generate data needed for effective health planning and resource allocation.

Statistics and Probability

Refine search

Refine search

Actions for selected content:

52319 results in Statistics and Probability

7 - About Data and Data Collection

Summary

Contents

16 - Time Series Methods

Summary

13 - Hypothesis Tests for Categorical Data

Summary

1 - Introduction

Summary

2 - Data Visualization

Summary

19 - Cross-Validation and Estimates of Prediction Error

Summary

9 - Point Estimation

Summary

17 - Estimating the Standard Error: Analytic Approximations, the Jackknife, and the Bootstrap

Summary

12 - Hypothesis Tests for Two or More Populations

Summary

10 - Confidence Intervals

Summary

18 - Generalized Linear Models and Regression Trees

Summary

5 - Discrete Distributions

Summary

4 - Healthy research: Study designs for public health

Summary

Acknowledgement of Country

Acknowledgements

THE SPECTRAL APPROACH TO LINEAR RATIONAL EXPECTATIONS MODELS

Appendix 5 - Calculating life expectancy from a life table

Appendix 3 - Calculating risk and lifetime risk from routine data

12 - Public health surveillance: Collecting data for public health action

Summary

Statistics and Probability

Refine search

Refine search

Actions for selected content:

Save Search

52319 results in Statistics and Probability

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary