Content listing

Bayesian Social Science Statistics

Getting Productive
Volume 2
Jeff Gill, Le Bao
Coming soon
Expected online publication date:

October 2025

Print publication:

30 September 2025
- Element
- Export citation
This Element introduces the basics of Bayesian regression modeling using modern computational tools. This Element only assumes that the reader has taken a basic statistics course and has seen Bayesian inference at the introductory level of Gill and Bao (2024). Some matrix algebra knowledge is assumed but the authors walk carefully through the necessary structures at the start of this Element. At the end of the process readers will fully understand how Bayesian regression models are developed and estimated, including linear and nonlinear versions. The sections cover theoretical principles and real-world applications in order to provide motivation and intuition. Because Bayesian methods are intricately tied to software, code in R and Python is provided throughout.

External Validity and Evidence Accumulation

Tara Slough, Scott A. Tyson
Published online:

29 November 2024

Print publication:

02 January 2025
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
The accumulation of empirical evidence that has been collected in multiple contexts, places, and times requires a more comprehensive understanding of empirical research than is typically required for interpreting the findings from individual studies. We advance a novel conceptual framework where causal mechanisms are central to characterizing social phenomena that transcend context, place, or time. We distinguish various concepts of external validity, all of which characterize the relationship between the effects produced by mechanisms in different settings. Approaches to evidence accumulation require careful consideration of cross-study features, including theoretical considerations that link constituent studies and measurement considerations about how phenomena are quantifed. Our main theoretical contribution is developing uniting principles that constitute the qualitative and quantitative assumptions that form the basis for a quantitative relationship between constituent studies. We then apply our framework to three approaches to studying general social phenomena: meta-analysis, replication, and extrapolation.

Bayesian Social Science Statistics

From the Very Beginning
Jeff Gill, Le Bao
Published online:

24 October 2024

Print publication:

24 October 2024
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
In this Element, the authors introduce Bayesian probability and inference for social science students and practitioners starting from the absolute beginning and walk readers steadily through the Element. No previous knowledge is required other than that in a basic statistics course. At the end of the process, readers will understand the core tenets of Bayesian theory and practice in a way that enables them to specify, implement, and understand models using practical social science data. Chapters will cover theoretical principles and real-world applications that provide motivation and intuition. Because Bayesian methods are intricately tied to software, code in both R and Python is provided throughout.

A Practical Introduction to Regression Discontinuity Designs

Extensions
Matias D. Cattaneo, Nicolas Idrobo, Rocío Titiunik
Published online:

25 March 2024

Print publication:

11 April 2024
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
In this Element, which continues our discussion in Foundations, the authors provide an accessible and practical guide for the analysis and interpretation of Regression Discontinuity (RD) designs that encourages the use of a common set of practices and facilitates the accumulation of RD-based empirical evidence. The focus is on extensions to the canonical sharp RD setup that we discussed in Foundations. The discussion covers (i) the local randomization framework for RD analysis, (ii) the fuzzy RD design where compliance with treatment is imperfect, (iii) RD designs with discrete scores, and (iv) and multi-dimensional RD designs.

Adaptive Inventories

A Practical Guide for Applied Researchers
Jacob M. Montgomery, Erin L. Rossiter
Published online:

08 July 2022

Print publication:

28 July 2022
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
The goal of this Element is to provide a detailed introduction to adaptive inventories, an approach to making surveys adjust to respondents' answers dynamically. This method can help survey researchers measure important latent traits or attitudes accurately while minimizing the number of questions respondents must answer. The Element provides both a theoretical overview of the method and a suite of tools and tricks for integrating it into the normal survey process. It also provides practical advice and direction on how to calibrate, evaluate, and field adaptive batteries using example batteries that measure variety of latent traits of interest to survey researchers across the social sciences.

Survival Analysis

A New Guide for Social Scientists
Alejandro Quiroz Flores
Published online:

03 May 2022

Print publication:

26 May 2022
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Quantitative social scientists use survival analysis to understand the forces that determine the duration of events. This Element provides a guideline to new techniques and models in survival analysis, particularly in three areas: non-proportional covariate effects, competing risks, and multi-state models. It also revisits models for repeated events. The Element promotes multi-state models as a unified framework for survival analysis and highlights the role of general transition probabilities as key quantities of interest that complement traditional hazard analysis. These quantities focus on the long term probabilities that units will occupy particular states conditional on their current state, and they are central in the design and implementation of policy interventions.

Interpreting Discrete Choice Models

Garrett Glasgow
Published online:

21 April 2022

Print publication:

12 May 2022
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
In discrete choice models the relationships between the independent variables and the choice probabilities are nonlinear, depending on both the value of the particular independent variable being interpreted and the values of the other independent variables. Thus, interpreting the magnitude of the effects (the “substantive effects”) of the independent variables on choice behavior requires the use of additional interpretative techniques. Three common techniques for interpretation are described here: first differences, marginal effects and elasticities, and odds ratios. Concepts related to these techniques are also discussed, as well as methods to account for estimation uncertainty. Interpretation of binary logits, ordered logits, multinomial and conditional logits, and mixed discrete choice models such as mixed multinomial logits and random effects logits for panel data are covered in detail. The techniques discussed here are general, and can be applied to other models with discrete dependent variables which are not specifically described here.

Text Analysis in Python for Social Scientists

Prediction and Classification
Dirk Hovy
Published online:

15 February 2022

Print publication:

17 March 2022
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Text contains a wealth of information about about a wide variety of sociocultural constructs. Automated prediction methods can infer these quantities (sentiment analysis is probably the most well-known application). However, there is virtually no limit to the kinds of things we can predict from text: power, trust, misogyny, are all signaled in language. These algorithms easily scale to corpus sizes infeasible for manual analysis. Prediction algorithms have become steadily more powerful, especially with the advent of neural network methods. However, applying these techniques usually requires profound programming knowledge and machine learning expertise. As a result, many social scientists do not apply them. This Element provides the working social scientist with an overview of the most common methods for text classification, an intuition of their applicability, and Python code to execute them. It covers both the ethical foundations of such work as well as the emerging potential of neural network methods.

Modern Dimension Reduction

Philip D. Waggoner
Published online:

10 July 2021

Print publication:

05 August 2021
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Data are not only ubiquitous in society, but are increasingly complex both in size and dimensionality. Dimension reduction offers researchers and scholars the ability to make such complex, high dimensional data spaces simpler and more manageable. This Element offers readers a suite of modern unsupervised dimension reduction techniques along with hundreds of lines of R code, to efficiently represent the original high dimensional data space in a simplified, lower dimensional subspace. Launching from the earliest dimension reduction technique principal components analysis and using real social science data, I introduce and walk readers through application of the following techniques: locally linear embedding, t-distributed stochastic neighbor embedding (t-SNE), uniform manifold approximation and projection, self-organizing maps, and deep autoencoders. The result is a well-stocked toolbox of unsupervised algorithms for tackling the complexities of high dimensional data so common in modern society. All code is publicly accessible on Github.

Using Shiny to Teach Econometric Models

Shawna K. Metzger
Published online:

01 May 2021

Print publication:

20 May 2021
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
This Element discusses how shiny, an R package, can help instructors teach quantitative methods more effectively by way of interactive web apps. The interactivity increases instructors' effectiveness by making students more active participants in the learning process, allowing them to engage with otherwise complex material in an accessible, dynamic way. The Element offers four detailed apps that cover two fundamental linear regression topics: estimation methods (least squares, maximum likelihood) and the classic linear regression assumptions. It includes a summary of what the apps can be used to demonstrate, detailed descriptions of the apps' full capabilities, vignettes from actual class use, and example activities. Two other apps pertain to a more advanced topic (LASSO), with similar supporting material. For instructors interested in modifying the apps, the Element also documents the main apps' general code structure, highlights some of the more likely modifications, and goes through what functions need to be amended.

Unsupervised Machine Learning for Clustering in Political and Social Research

Philip D. Waggoner
Published online:

15 December 2020

Print publication:

28 January 2021
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
In the age of data-driven problem-solving, applying sophisticated computational tools for explaining substantive phenomena is a valuable skill. Yet, application of methods assumes an understanding of the data, structure, and patterns that influence the broader research program. This Element offers researchers and teachers an introduction to clustering, which is a prominent class of unsupervised machine learning for exploring and understanding latent, non-random structure in data. A suite of widely used clustering techniques is covered in this Element, in addition to R code and real data to facilitate interaction with the concepts. Upon setting the stage for clustering, the following algorithms are detailed: agglomerative hierarchical clustering, k-means clustering, Gaussian mixture models, and at a higher-level, fuzzy C-means clustering, DBSCAN, and partitioning around medoids (k-medoids) clustering.

Text Analysis in Python for Social Scientists

Discovery and Exploration
Dirk Hovy
Published online:

14 December 2020

Print publication:

21 January 2021
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Text is everywhere, and it is a fantastic resource for social scientists. However, because it is so abundant, and because language is so variable, it is often difficult to extract the information we want. There is a whole subfield of AI concerned with text analysis (natural language processing). Many of the basic analysis methods developed are now readily available as Python implementations. This Element will teach you when to use which method, the mathematical background of how it works, and the Python code to implement it.

Target Estimation and Adjustment Weighting for Survey Nonresponse and Sampling Bias

Devin Caughey, Adam J. Berinsky, Sara Chatfield, Erin Hartman, Eric Schickler, Jasjeet S. Sekhon
Published online:

29 September 2020

Print publication:

22 October 2020
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
We elaborate a general workflow of weighting-based survey inference, decomposing it into two main tasks. The first is the estimation of population targets from one or more sources of auxiliary information. The second is the construction of weights that calibrate the survey sample to the population targets. We emphasize that these tasks are predicated on models of the measurement, sampling, and nonresponse process whose assumptions cannot be fully tested. After describing this workflow in abstract terms, we then describe in detail how it can be applied to the analysis of historical and contemporary opinion polls. We also discuss extensions of the basic workflow, particularly inference for causal quantities and multilevel regression and poststratification.

Images as Data for Social Science Research

An Introduction to Convolutional Neural Nets for Image Classification
Nora Webb Williams, Andreu Casas, John D. Wilkerson
Published online:

17 July 2020

Print publication:

13 August 2020
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Images play a crucial role in shaping and reflecting political life. Digitization has vastly increased the presence of such images in daily life, creating valuable new research opportunities for social scientists. We show how recent innovations in computer vision methods can substantially lower the costs of using images as data. We introduce readers to the deep learning algorithms commonly used for object recognition, facial recognition, and visual sentiment analysis. We then provide guidance and specific instructions for scholars interested in using these methods in their own research.

Agent-Based Models of Polarization and Ethnocentrism

Michael Laver
Published online:

13 March 2020

Print publication:

16 April 2020
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Building on the Cambridge Element Agent Based Models of Social Life: Fundamentals (Cambridge, 2020), we move on to the next level. We do this by building agent based models of polarization and ethnocentrism. In the process, we develop: stochastic models, which add a crucial element of uncertainty to human interaction; models of human interactions structured by social networks; and 'evolutionary' models in which agents using more effective decision rules are more likely to survive and prosper than others. The aim is to leave readers with an effective toolkit for building, running and analyzing agent based modes of social interaction.

Agent-Based Models of Social Life

Fundamentals
Michael Laver
Published online:

13 March 2020

Print publication:

16 April 2020
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Social interactions are rich, complex, and dynamic. One way to understand these is to model interactions that fascinate us. Some of the more realistic and powerful models are computer simulations. Simple, elegant and powerful, tools are available in user-friendly free software to help you design, build and run your own models of social interactions that intrigue you, and do this on the most basic laptop computer. Focusing on a well-known model of housing segregation, this Element is about how to unleash that power, setting out the fundamentals of what is now known as 'agent based modeling'.

A Practical Introduction to Regression Discontinuity Designs

Foundations
Matias D. Cattaneo, Nicolás Idrobo, Rocío Titiunik
Published online:

16 November 2019

Print publication:

13 February 2020
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
In this Element and its accompanying second Element, A Practical Introduction to Regression Discontinuity Designs: Extensions, Matias Cattaneo, Nicolás Idrobo, and Rocıìo Titiunik provide an accessible and practical guide for the analysis and interpretation of regression discontinuity (RD) designs that encourages the use of a common set of practices and facilitates the accumulation of RD-based empirical evidence. In this Element, the authors discuss the foundations of the canonical Sharp RD design, which has the following features: (i) the score is continuously distributed and has only one dimension, (ii) there is only one cutoff, and (iii) compliance with the treatment assignment is perfect. In the second Element, the authors discuss practical and conceptual extensions to this basic RD setup.

Twitter as Data

Zachary C. Steinert-Threlkeld
Published online:

18 January 2018

Print publication:

18 January 2018
- Element
- - Get access
    
    Buy the print Element
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
The rise of the internet and mobile telecommunications has created the possibility of using large datasets to understand behavior at unprecedented levels of temporal and geographic resolution. Online social networks attract the most users, though users of these new technologies provide their data through multiple sources, e.g. call detail records, blog posts, web forums, and content aggregation sites. These data allow scholars to adjudicate between competing theories as well as develop new ones, much as the microscope facilitated the development of the germ theory of disease. Of those networks, Twitter presents an ideal combination of size, international reach, and data accessibility that make it the preferred platform in academic studies. Acquiring, cleaning, and analyzing these data, however, require new tools and processes. This Element introduces these methods to social scientists and provides scripts and examples for downloading, processing, and analyzing Twitter data.

Cambridge Elements

Refine listing

Refine listing

Actions for selected content:

18 results in Cambridge Elements

Bayesian Social Science Statistics

External Validity and Evidence Accumulation

Bayesian Social Science Statistics

A Practical Introduction to Regression Discontinuity Designs

Adaptive Inventories

Survival Analysis

Interpreting Discrete Choice Models

Text Analysis in Python for Social Scientists

Modern Dimension Reduction

Using Shiny to Teach Econometric Models

Unsupervised Machine Learning for Clustering in Political and Social Research

Text Analysis in Python for Social Scientists

Target Estimation and Adjustment Weighting for Survey Nonresponse and Sampling Bias

Images as Data for Social Science Research

Agent-Based Models of Polarization and Ethnocentrism

Agent-Based Models of Social Life

A Practical Introduction to Regression Discontinuity Designs

Twitter as Data

Cambridge Elements

Refine listing

Refine listing

Actions for selected content:

Save Search

18 results in Cambridge Elements

Bayesian Social Science Statistics

External Validity and Evidence Accumulation

Bayesian Social Science Statistics

A Practical Introduction to Regression Discontinuity Designs

Adaptive Inventories

Survival Analysis

Interpreting Discrete Choice Models

Text Analysis in Python for Social Scientists

Modern Dimension Reduction

Using Shiny to Teach Econometric Models

Unsupervised Machine Learning for Clustering in Political and Social Research

Text Analysis in Python for Social Scientists

Target Estimation and Adjustment Weighting for Survey Nonresponse and Sampling Bias

Images as Data for Social Science Research

Agent-Based Models of Polarization and Ethnocentrism

Agent-Based Models of Social Life

A Practical Introduction to Regression Discontinuity Designs

Twitter as Data