Search results for Statistics and Probability

A new weighted means of failure rate and associated quantile versions
Part of
- Distribution theory - Probability
- Survival analysis and censored data
Subarna Bhattacharjee, S M Sunoj, Sabana Anwar
Journal:

Probability in the Engineering and Informational Sciences / Volume 39 / Issue 1 / January 2025

Published online by Cambridge University Press:

15 November 2024, pp. 64-82
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
In this paper, we define weighted failure rate and their means from the stand point of an application. We begin by emphasizing that the formation of n independent component series system having weighted failure rates with sum of weight functions being unity is same as a mixture of n distributions. We derive some parametric and non-parametric characterization results. We discuss on the form invariance property of baseline failure rate for a specific choice of weight function. Some bounds on means of aging functions are obtained. Here, we establish that weighted increasing failure rate average (IFRA) class is not closed under formation of coherent systems unlike the IFRA class. An interesting application of the present work is credited to the fact that the quantile version of means of failure rate is obtained as a special case of weighted means of failure rate.

Precise large deviations of the net loss process in a non-standard two-dimensional risk model
Part of
- Applications
- Limit theorems
Qingwu Gao, Zimai Dong, Xijun Liu, Junni Yan
Journal:

Probability in the Engineering and Informational Sciences / Volume 39 / Issue 1 / January 2025

Published online by Cambridge University Press:

15 November 2024, pp. 44-63
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
This paper investigates the precise large deviations of the net loss process in a two-dimensional risk model with consistently varying tails and dependence structures, and gives some asymptotic formulas which hold uniformly for all x varying in t-intervals. The study is among the initial efforts to analyze potential risk via large deviation results for the net loss process of the two-dimensional risk model, and can provide a novel insight to assess the operation risk in a long run by fully considering the premium income factors of the insurance company.

An extended class of multivariate counting processes and its main properties
Part of
- Special processes
- Applications
Ji Hwan Cha, Sophie Mercier
Journal:

Probability in the Engineering and Informational Sciences / Volume 39 / Issue 1 / January 2025

Published online by Cambridge University Press:

15 November 2024, pp. 83-100
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
In this paper, a new multivariate counting process model (called Multivariate Poisson Generalized Gamma Process) is developed and its main properties are studied. Some basic stochastic properties of the number of events in the new multivariate counting process are initially derived. It is shown that this new multivariate counting process model includes the multivariate generalized Pólya process as a special case. The dependence structure of the multivariate counting process model is discussed. Some results on multivariate stochastic comparisons are also obtained.

Aggregate: fast, accurate, and flexible approximation of compound probability distributions
Stephen Mildenhall
Journal:

Annals of Actuarial Science / Volume 19 / Issue 2 / July 2025

Published online by Cambridge University Press:

15 November 2024, pp. 193-232
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Aggregate implements an efficient fast Fourier transform (FFT)-based algorithm to approximate compound probability distributions. Leveraging FFT-based methods offers advantages over recursion and simulation-based approaches, providing speed and accuracy to otherwise time-consuming calculations. Combining user-friendly features and an expressive domain-specific language called DecL, Aggregate enables practitioners and nonprogrammers to work with complex distributions effortlessly. The software verifies the accuracy of its FFT-based numerical approximations by comparing their first three moments to those calculated analytically from the specified frequency and severity. This moment-based validation, combined with carefully chosen default parameters, allows users without in-depth knowledge of the underlying algorithm to be confident in the results. Aggregate supports a wide range of frequency and severity distributions, policy limits and deductibles, and reinsurance structures and has applications in pricing, reserving, risk management, teaching, and research. It is written in Python.

Ethics in Econometrics

A Guide to Research Practice
Philip Hans Franses
Published online:

14 November 2024

Print publication:

28 November 2024
- Book
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Applied econometrics uses the tools of theoretical econometrics and real-word data to develop predictive models and assess economic theories. Due to the complex nature of such analysis, various assumptions are often not understood by those people who rely on it. The danger of this is that economic policies can be assessed favourably to suit a particular political agenda and forecasts can be generated to match the needs of a particular customer. Ethics in Econometrics argues that econometricians need to be aware of potential ethical pitfalls when carrying out their analysis and that they need to be encouraged to avoid them. Using a range of empirical examples and detailed discussions of real cases, this book provides a guide for research practices in econometrics, illustrating why it is imperative that econometricians act ethically in terms of the way they conduct their analysis and treat their data.

Decentralized insurance: On the popularity of tontines and peer-to-peer (P2P) insurance schemes
Michel Denuit, Jan Dhaene, Runhuan Feng, Peter Hieber, Christian Y. Robert
Journal:

Annals of Actuarial Science / Volume 18 / Issue 2 / July 2024

Published online by Cambridge University Press:

14 November 2024, pp. 237-241
- Article
- - You have access
- PDF
- HTML
- Export citation

Reviews
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp ii-ii
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

4 - Random Variables
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 82-131
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In Chapter 3 we learned how to do basic probability calculations and even put them to use solving some fairly complicated probability problems. In this chapter and the next two, we generalize how we do probability calculations, where we will transition from working with sets and events to working with random variables.

3 - Basic Probability
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 54-81
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

To do statistics you must first be able to “speak probability.” In this chapter we are going to concentrate on the basic ideas of probability. In probability, the mechanism that generates outcomes is assumed known and the problems focus on calculating the chance of observing particular types or sets of outcomes. Classical problems include flipping “fair” coins (where fair means that on one flip of the coin the chance it comes up heads is equal to the chance it comes up tails) and “fair” dice (where fair now means the chance of landing on any side of the die is equal to that of landing on any other side).

Dedication
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp v-vi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

6 - Continuous Distributions
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 170-225
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In Chapter 5 we learned about a number of discrete distributions. In this chapter we focus on continuous distributions, which are useful as models of various real-world events. By the end of this chapter you will know nine continuous and eight discrete distributions. There are many more continuous distributions, but these nine will suffice for our purposes. These continuous distributions are useful for modeling various types of processes and phenomena that are encountered in the real world.

8 - Sampling Distributions
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 244-272
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Sampling joke: “If you don’t believe in random sampling, the next time you have a blood test, tell the doctor to take it all.” At the beginning of Chapter 7 we introduced the ideas of population vs. sample and parameter vs. statistic. We build on this in the current chapter. The key concept in this chapter is that if we were to take different samples from a distribution and compute some statistic, such as the sample mean, then we would get different results.

11 - Hypothesis Testing
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 348-385
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

The last two chapters have covered the basic concepts of estimation. In Chapter 9 we studied the problem of giving a single number to estimate a parameter. In Chapter 10 we looked at ways to give an interval that we believe will include the true parameter. In many applications, we want to ask some very specific questions about the parameter(s).

20 - Large-Scale Hypothesis Testing
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 773-804
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

We begin this chapter with a review of hypothesis testing from Chapter 12. A hypothesis is a statement about one or more parameters of a model. The null hypothesis is usually a specific statement that encapsulates “no effect.” For example, if we apply one of the two treatments, A or B, to volunteers we may be interested in testing whether the population mean outcomes are equal.

Preface
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp xiii-xvi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

15 - Bayesian Methods
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 574-617
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Up to this point we have been talking about what are often called frequentist methods, because a statistical method is based on properties of its long-run relative frequency. With this approach, the probability of an event is defined as the proportion of times the event occurs in the long run. Parameters, that is values that characterize a distribution, such as the mean and variance of a normal distribution, are considered fixed but unknown.

References
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 805-808
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Index
Steven E. Rigdon, Saint Louis University, Missouri, Ronald D. Fricker, Jr, Virginia Polytechnic Institute and State University, Douglas C. Montgomery, Arizona State University
Book:

Introduction to Probability and Statistics for Data Science

Published online:

13 December 2024

Print publication:

14 November 2024, pp 809-812
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Distill knowledge of additive tree models into generalized linear models: a new learning approach for non-smooth generalized additive models
Arthur Maillart, Christian Robert
Journal:

Annals of Actuarial Science / Volume 18 / Issue 3 / November 2024

Published online by Cambridge University Press:

14 November 2024, pp. 692-711
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Generalized additive models (GAMs) are a leading model class for interpretable machine learning. GAMs were originally defined with smooth shape functions of the predictor variables and trained using smoothing splines. Recently, tree-based GAMs where shape functions are gradient-boosted ensembles of bagged trees were proposed, leaving the door open for the estimation of a broader class of shape functions (e.g. Explainable Boosting Machine (EBM)). In this paper, we introduce a competing three-step GAM learning approach where we combine (i) the knowledge of the way to split the covariates space brought by an additive tree model (ATM), (ii) an ensemble of predictive linear scores derived from generalized linear models (GLMs) using a binning strategy based on the ATM, and (iii) a final GLM to have a prediction model that ensures auto-calibration. Numerical experiments illustrate the competitive performances of our approach on several datasets compared to GAM with splines, EBM, or GLM with binarsity penalization. A case study in trade credit insurance is also provided.

One-year and ultimate correlations in dependent claims run-off triangles
Łukasz Delong, Marcin Szatkowski
Journal:

Annals of Actuarial Science / Volume 19 / Issue 1 / March 2025

Published online by Cambridge University Press:

14 November 2024, pp. 159-192
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
We investigate bottom-up risk aggregation applied by insurance companies facing reserve risk from multiple lines of business. Since risk capitals should be calculated in different time horizons and calendar years, depending on the regulatory or reporting regime (Solvency II vs IFRS 17), we study correlations of ultimate losses and correlations of one-year losses in future calendar years in lines of business. We consider a multivariate version of a Hertig’s lognormal model and we derive analytical formulas for the ultimate correlation and the one-year correlations in future calendar years. Our main conclusion is that the correlation coefficients that should be used in a bottom-up aggregation formula depend on the time horizon and the future calendar year where the risk emerges. We investigate analytically and numerically properties of the ultimate and the one-year correlations, their possible values observed in practice, and the impact of misspecified correlations on the diversified risk capital.

Statistics and Probability

Refine search

Refine search

Actions for selected content:

52621 results in Statistics and Probability

A new weighted means of failure rate and associated quantile versions

Precise large deviations of the net loss process in a non-standard two-dimensional risk model

An extended class of multivariate counting processes and its main properties

Aggregate: fast, accurate, and flexible approximation of compound probability distributions

Ethics in Econometrics

Decentralized insurance: On the popularity of tontines and peer-to-peer (P2P) insurance schemes

Reviews

4 - Random Variables

Summary

3 - Basic Probability

Summary

Dedication

6 - Continuous Distributions

Summary

8 - Sampling Distributions

Summary

11 - Hypothesis Testing

Summary

20 - Large-Scale Hypothesis Testing

Summary

Preface

15 - Bayesian Methods

Summary

References

Index

Distill knowledge of additive tree models into generalized linear models: a new learning approach for non-smooth generalized additive models

One-year and ultimate correlations in dependent claims run-off triangles

Statistics and Probability

Refine search

Refine search

Actions for selected content:

Save Search

52621 results in Statistics and Probability

Ethics in Econometrics

Summary

Summary

Summary

Summary

Summary

Summary

Summary