Search

3 - Optimization Theory and Algorithms
Sébastien Roch, University of Wisconsin, Madison
Book:

Mathematical Methods in Data Science

Published online:

04 November 2025

Print publication:

30 October 2025, pp 128-199
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter focuses on the core concepts of optimization theory and its application in data science and AI. It begins with a review of differentiable functions of several variables, including the gradient and Hessian matrices, and key results like the Chain Rule and the Mean Value Theorem. The chapter then introduces optimality conditions for unconstrained optimization, explaining first-order and second-order conditions, and the role of convexity in ensuring global optimality. A detailed discussion of the gradient descent algorithm is provided, including its convergence analysis under different assumptions. The chapter concludes with an application to logistic regression, demonstrating how gradient descent is used to optimize the cross-entropy loss function in a supervised learning context. Practical Python examples are integrated throughout to illustrate the theoretical concepts.

12 - Regression and Classification
Carlos Fernandez-Granda, New York University
Book:

Probability and Statistics for Data Science

Published online:

19 June 2025

Print publication:

03 July 2025, pp 495-598
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter covers regression and classification, where the goal is to estimate a quantity of interest (the response) from observed features. In regression, the response is a numerical variable. In classification, it belongs to a finite set of predetermined classes. We begin with a comprehensive description of linear regression and discuss how to leverage it to perform causal inference. Then, we explain under what conditions linear models tend to overfit or to generalize robustly to held-out data. Motivated by the threat of overfitting, we introduce regularization and ridge regression, and discuss sparse regression, where the goal is to fit a linear model that only depends on a small subset of the available features. Then, we introduce two popular linear models for binary and multiclass classification: Logistic and softmax regression. At this point, we turn our attention to nonlinear models. First, we present regression and classification trees and explain how to combine them via bagging, random forests, and boosting. Second, we explain how to train neural networks to perform regression and classification. Finally, we discuss how to evaluate classification models.

10 - Mixed Effects Modelling
Sali A. Tagliamonte, University of Toronto
Book:

Analysing Sociolinguistic Variation

Published online:

19 June 2025

Print publication:

03 July 2025, pp 225-249
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

How do I conduct a mixed effects logistic regression of a linguistic variable?This chapter will illustrate the procedures for performing statistical modelling using mixed effects logistic regression with the lme4 package in R. It will review the steps for conducting analyses, for finding the best model for the feature under study, and what to do with it when you find it.

10 - Excursion: Problems in Machine Learning
Yisong Yang, New York University
Book:

Advanced Linear Algebra

Published online:

27 May 2025

Print publication:

12 June 2025, pp 272-302
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

A rich and important area for the applications of linear algebra is machine learning. In machine learning, one aims to achieve optimized or learned understanding of various kinds of real-world phenomena from data collected or observed, without real comprehension of the functioning mechanisms of such phenomena. These functioning mechanisms are often impossible or unpractical to grasp anyway. In this chapter, we present several introductory and fundamental problems in supervised machine learning including linear regression, data classification, and logistic regression and the mathematical and computational methods associated.

REDUCING TYPE 1 CHILDHOOD DIABETES IN SAUDI ARABIA BY IDENTIFYING AND MODELLING ITS KEY PERFORMANCE INDICATORS
Part of
- Linear inference, regression
AHOOD ALAZWARI
Journal:

Bulletin of the Australian Mathematical Society / Volume 112 / Issue 3 / December 2025

Published online by Cambridge University Press:

09 June 2025, pp. 574-576

Print publication:

December 2025
- Article
- - You have access
- PDF
- HTML
- Export citation

4 - Exploring the Suitability of Environmental Health Information for Parental Education Using Machine Learning Models
Meng Ji, University of Sydney, Michael Oakes, University of Birmingham
Book:

Multilingual Environmental Communications

Published online:

16 May 2025

Print publication:

05 June 2025, pp 75-103
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter, new computational models will focus on whether environmental health texts are suitable for parents rather than the general public. Logistic regression models will identify linguistic features that are important contributors to the prediction of the suitability of environmental health materials for parents and caregivers of young children, who are more likely to be affected by environmental health risks such as water pollution, excessive sun exposure, and radiation in natural and indoor environments.

2 - Statistics and Machine Learning for Textual Readability Studies
Meng Ji, University of Sydney, Michael Oakes, University of Birmingham
Book:

Multilingual Environmental Communications

Published online:

16 May 2025

Print publication:

05 June 2025, pp 23-52
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter describes how to characterize data and the distribution of data. We will also describe how the shape of the normal distribution enables hypothesis testing. In the section on regression, we look at how two variables or ways of measuring data are related to each other. We will use simple linear regression as an introduction to multiple regression, the technique used in the development of a number of traditional readability measures. A more sophisticated form of regression is called logistic regression is also discussed, which will be applied in the case studies of Chapters 4 to 6.

Online Calibration Via Variable Length Computerized Adaptive Testing
Yuan-chin Ivan Chang, Hung-Yi Lu
Journal:

Psychometrika / Volume 75 / Issue 1 / March 2010

Published online by Cambridge University Press:

01 January 2025, pp. 140-157
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Item calibration is an essential issue in modern item response theory based psychological or educational testing. Due to the popularity of computerized adaptive testing, methods to efficiently calibrate new items have become more important than that in the time when paper and pencil test administration is the norm. There are many calibration processes being proposed and discussed from both theoretical and practical perspectives. Among them, the online calibration may be one of the most cost effective processes. In this paper, under a variable length computerized adaptive testing scenario, we integrate the methods of adaptive design, sequential estimation, and measurement error models to solve online item calibration problems. The proposed sequential estimate of item parameters is shown to be strongly consistent and asymptotically normally distributed with a prechosen accuracy. Numerical results show that the proposed method is very promising in terms of both estimation accuracy and efficiency. The results of using calibrated items to estimate the latent trait levels are also reported.

A Generalized Rasch Model for Manifest Predictors
Aeilko H. Zwinderman
Journal:

Psychometrika / Volume 56 / Issue 4 / December 1991

Published online by Cambridge University Press:

01 January 2025, pp. 589-600
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
A logistic regression model is suggested for estimating the relation between a set of manifest predictors and a latent trait assumed to be measured by a set of k dichotomous items. Usually the estimated subject parameters of latent trait models are biased, especially for short tests. Therefore, the relation between a latent trait and a set of predictors should not be estimated with a regression model in which the estimated subject parameters are used as a dependent variable. Direct estimation of the relation between the latent trait and one or more independent variables is suggested instead. Estimation methods and test statistics for the Rasch model are discussed and the model is illustrated with simulated and empirical data.

Robust Inference with Binary Data
Maria-Pia Victoria-Feser
Journal:

Psychometrika / Volume 67 / Issue 1 / March 2002

Published online by Cambridge University Press:

01 January 2025, pp. 21-32
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
In this paper robustness properties of the maximum likelihood estimator (MLE) and several robust estimators for the logistic regression model when the responses are binary are analysed. It is found that the MLE and the classical Rao's score test can be misleading in the presence of model misspecification which in the context of logistic regression means either misclassification's errors in the responses, or extreme data points in the design space. A general framework for robust estimation and testing is presented and a robust estimator as well as a robust testing procedure are presented. It is shown that they are less influenced by model misspecifications than their classical counterparts. They are finally applied to the analysis of binary data from a study on breastfeeding.

A Latent Transition Model With Logistic Regression
Hwan Chung, Theodore A. Walls, Yousung Park
Journal:

Psychometrika / Volume 72 / Issue 3 / September 2007

Published online by Cambridge University Press:

01 January 2025, pp. 413-435
- Article
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Latent transition models increasingly include covariates that predict prevalence of latent classes at a given time or transition rates among classes over time. In many situations, the covariate of interest may be latent. This paper describes an approach for handling both manifest and latent covariates in a latent transition model. A Bayesian approach via Markov chain Monte Carlo (MCMC) is employed in order to achieve more robust estimates. A case example illustrating the model is provided using data on academic beliefs and achievement in a low-income sample of adolescents in the United States.

A web-based dynamic nomogram for estimating talaromycosis risk in hospitalized HIV-positive patients
Xu Li, Zhongsheng Jiang, Shenglin Mo, Xiaohong Huang, Tao Chen, Peng Zhang, Linghua Li, Bin Huang, Yanqiu Lu, Ying Wu, Jiaguang Hu
Journal:

Epidemiology & Infection / Volume 152 / 2024

Published online by Cambridge University Press:

05 December 2024, e153
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Our study aimed to develop and validate a nomogram to assess talaromycosis risk in hospitalized HIV-positive patients. Prediction models were built using data from a multicentre retrospective cohort study in China. On the basis of the inclusion and exclusion criteria, we collected data from 1564 hospitalized HIV-positive patients in four hospitals from 2010 to 2019. Inpatients were randomly assigned to the training or validation group at a 7:3 ratio. To identify the potential risk factors for talaromycosis in HIV-infected patients, univariate and multivariate logistic regression analyses were conducted. Through multivariate logistic regression, we determined ten variables that were independent risk factors for talaromycosis in HIV-infected individuals. A nomogram was developed following the findings of the multivariate logistic regression analysis. For user convenience, a web-based nomogram calculator was also created. The nomogram demonstrated excellent discrimination in both the training and validation groups [area under the ROC curve (AUC) = 0.883 vs. 0.889] and good calibration. The results of the clinical impact curve (CIC) analysis and decision curve analysis (DCA) confirmed the clinical utility of the model. Clinicians will benefit from this simple, practical, and quantitative strategy to predict talaromycosis risk in HIV-infected patients and can implement appropriate interventions accordingly.

3 - Sociolinguistic Approaches to Bilingual Phonetics and Phonology
from Part I - Approaches to Bilingual Phonetics and Phonology
- By Manuel Díaz-Campos, Molly Cole, Matthew Pollock
Edited by Mark Amengual, University of California, Santa Cruz
Book:

The Cambridge Handbook of Bilingual Phonetics and Phonology

Published online:

14 November 2024

Print publication:

21 November 2024, pp 65-85
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter examines the conceptualization and measurement of contact phenomena in the context of bilingualism across various languages. The goal of the chapter is to account for various phonetic contact phenomena in sociolinguistic analysis, as well as providing context for elaborating on quantitative methodologies in sociophonetic contact linguistics. More specifically, the chapter provides a detailed account of global phenomena in modern natural speech contexts, as well as an up-to-date examination of quantitative methods in the field of sociolinguistics. The first section provides a background of theoretical concepts important to the understanding of sociophonetic contact in the formation of sound systems. The following sections focus on several key social factors that play a major part in the sociolinguistic approach to bilingual phonetics and phonology, including language dominance and age of acquisition at the segmental and the suprasegmental levels, as well as topics of language attitudes and perception, and typical quantitative methods used in sociolinguistics.

Elementary Statistics for Public Administration

An Applied Perspective
Daniel S. Scheller
Published online:

01 November 2024

Print publication:

12 September 2024
- Textbook
- - Get access
    
    Buy a print copy
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Taking a simplified approach to statistics, this textbook teaches students the skills required to conduct and understand quantitative research. It provides basic mathematical instruction without compromising on analytical rigor, covering the essentials of research design; descriptive statistics; data visualization; and statistical tests including t-tests, chi-squares, ANOVAs, Wilcoxon tests, OLS regression, and logistic regression. Step-by-step instructions with screenshots are used to help students master the use of the freely accessible software R Commander. Ancillary resources include a solutions manual and figure files for instructors, and datasets and further guidance on using STATA and SPSS for students. Packed with examples and drawing on real-world data, this is an invaluable textbook for both undergraduate and graduate students in public administration and political science.

Identifying determinants of waste management access in Nouakchott, Mauritania: a logistic regression model
Seyid Abdellahi Ebnou Abdem, Rida Azmi, El Bachir Diop, Meriem Adraoui, Jérôme Chenal
Journal:

Data & Policy / Volume 6 / 2024

Published online by Cambridge University Press:

02 July 2024, e29
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Access to waste management services is crucial for urban sustainability, impacting public health, environmental well-being, and overall quality of life. This study employs logistic regression analysis on survey data collected from 1,032 household heads residing in Nouakchott, the capital of Mauritania. The survey investigated key household factors that determine access to waste management services. The findings reveal a significant interplay among waste service provision, the presence of cisterns, housing type and size, and access to electricity. Socioeconomic disparity in service access, with poorer housing formats like shacks receiving substandard services. In contrast, areas with robust electrification report better service access, although inconsistencies remain amid power outages. The research highlights the challenges faced by Riyadh municipality, particularly rapid growth and inadequate infrastructure, which hinder waste management efficiency. Overall, the results not only illuminate Nouakchott’s unique challenges in service provision but also propose actionable recommendations for a sustainable urban future. These recommendations aim to inform and guide targeted policies for improving living conditions and environmental sustainability in urban Mauritania.

9 - Optimization Basics and Logistic Regression
Jeffrey A. Fessler, University of Michigan, Ann Arbor, Raj Rao Nadakuditi, University of Michigan, Ann Arbor
Book:

Linear Algebra for Data Science, Machine Learning, and Signal Processing

Published online:

01 November 2024

Print publication:

16 May 2024, pp 335-364
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Many of the preceding chapters involved optimization formulations: linear least squares, Procrustes, low-rank approximation, multidimensional scaling. All these have analytical solutions, like the pseudoinverse for minimum-norm least squares problems and the truncated singular value decomposition for low-rank approximation. But often we need iterative optimization algorithms, for example if no closed-form minimizer exists, or if the analytical solution requires too much computation and/or memory (e.g., singular value decomposition for large problems. To solve an optimization problem via an iterative method, we start with some initial guess and then the algorithm produces a sequence that hopefully converges to a minimizer. This chapter describes the basics of gradient-based iterative optimization algorithms, including preconditioned gradient descent (PGD) for the linear LS problem. PGD uses a fixed step size, whereas preconditioned steepest descent uses a line search to determine the step size. The chapter then considers gradient descent and accelerated versions for general smooth convex functions. It applies gradient descent to the machine learning application of binary classification via logistic regression. Finally, it summarizes stochastic gradient descent.

2 - The Space Shuttle Challenger Disaster
Paul Embrechts, Swiss Federal University (ETH), Zürich, Marius Hofert, The University of Hong Kong, Valérie Chavez-Demoulin, Université de Lausanne, Switzerland
Book:

Risk Revealed

Published online:

05 April 2024

Print publication:

11 April 2024, pp 13-23
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

It is January 28, 1986. While the world was watching, just 73 seconds after take-off, the Challenger Space Shuttle exploded, killing all seven astronauts on board. The crew included the teacher Christa McAuliffe who would have lectured schoolchildren from space. An important factor that contributed to the disaster was the extremely low temperature at launch. “Extreme” here means “well below temperatures experienced at previous launches”. In this chapter, we give a short overview of the errors that contributed to the explosion. These errors range from purely managerial errors to technical as well as statistical errors. Our discussion includes a statistical analysis of the malfunctioning of so-called rubber O-rings as a function of temperature at launch. As a prime example of efficient risk communication we also recall the press conference at which the physics Nobel Prize winner, Richard Feynman, made his famous “piece-of-rubber-in-ice-water” presentation. This exposed the cause of the accident in all clarity.

Likelihood ratio test for the analysis of germination percentages
Yongha Rhie, Soyeon Lee, Hohsuk Noh
Journal:

Seed Science Research / Volume 34 / Issue 1 / March 2024

Published online by Cambridge University Press:

02 April 2024, pp. 10-16
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
The germination percentage (GP) is commonly employed to estimate the viability of a seed population. Statistical methods such as analysis of variance (ANOVA) and logistic regression are frequently used to analyse GP data. While ANOVA has a long history of usage, logistic regression is considered more suitable for GP data due to its binomial nature. However, both methods have inherent issues that require attention. In this study, we address previously unexplored challenges associated with these methods and propose the utilization of a likelihood ratio test as a solution. We demonstrate the advantages of employing the likelihood ratio test for GP data analysis through simulations and real data analysis.

Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic
Joren Somers, Gard B. Jenset, Jóhanna Barðdal
Journal:

Nordic Journal of Linguistics , First View

Published online by Cambridge University Press:

06 March 2024, pp. 1-35
- Article
- - You have access
  - Open access
- PDF
- HTML
- Export citation
Alternating Dat-Nom/Nom-Dat verbs in Icelandic are notorious for instantiating two diametrically opposed argument structures: the Dat-Nom and the Nom-Dat construction. We conduct a systematic study of the relevant verbs to uncover the factors steering the alternation. This involves a comparison of 15 verbs, five alternating ones, and as a control, five Nom-Dat verbs and five non-alternating Dat-Nom verbs. Our findings show that alternating verbs instantiate the Nom-Dat construction 54% of the time and the Dat-Nom construction 46% of the time on average for four of five verbs when both arguments are full NPs. However, in configurations with a nominative pronoun, the Nom-Dat construction takes precedence over the Dat-Nom construction. Also, for the double-NP configuration, a logistic regression analysis identifies indefiniteness and length as two key predictors, apart from nominative case marking. We demonstrate that the latter systematically correlates with discourse-prominence, which we show, upon closer inspection, correlates with topicality.

3 - Generalizing Regression Inside Out
from Part I - Turning Regression Inside Out
Eric W. Schoon, Ohio State University, David Melamed, Ohio State University, Ronald L. Breiger, University of Arizona
Book:

Regression Inside Out

Published online:

21 March 2024

Print publication:

22 February 2024, pp 50-61
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Chapter 3 demonstrates how the mathematics of turning Ordinary Least Squares (OLS) regression inside out can be generalized to Generalized Linear Models (GLM) including logistic, Poisson, negative binomial, random intercept, and fixed effects models.

Search Results

Refine search

Refine search

Actions for selected content:

92 results

3 - Optimization Theory and Algorithms

Summary

12 - Regression and Classification

Summary

10 - Mixed Effects Modelling

Summary

10 - Excursion: Problems in Machine Learning

Summary

REDUCING TYPE 1 CHILDHOOD DIABETES IN SAUDI ARABIA BY IDENTIFYING AND MODELLING ITS KEY PERFORMANCE INDICATORS

4 - Exploring the Suitability of Environmental Health Information for Parental Education Using Machine Learning Models

Summary

2 - Statistics and Machine Learning for Textual Readability Studies

Summary

Online Calibration Via Variable Length Computerized Adaptive Testing

A Generalized Rasch Model for Manifest Predictors

Robust Inference with Binary Data

A Latent Transition Model With Logistic Regression

A web-based dynamic nomogram for estimating talaromycosis risk in hospitalized HIV-positive patients

3 - Sociolinguistic Approaches to Bilingual Phonetics and Phonology

Summary

Elementary Statistics for Public Administration

Identifying determinants of waste management access in Nouakchott, Mauritania: a logistic regression model

9 - Optimization Basics and Logistic Regression

Summary

2 - The Space Shuttle Challenger Disaster

Summary

Likelihood ratio test for the analysis of germination percentages

Argument structure constructions in competition: The Dat-Nom/Nom-Dat alternation in Icelandic

3 - Generalizing Regression Inside Out

Summary

Search Results

Refine search

Refine search

Actions for selected content:

Save Search

92 results

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Elementary Statistics for Public Administration

Summary

Summary

Summary