Hostname: page-component-89b8bd64d-dvtzq Total loading time: 0 Render date: 2026-05-08T19:25:33.381Z Has data issue: false hasContentIssue false

Comparing Bayesian Variable Selection to Lasso Approaches for Applications in Psychology

Published online by Cambridge University Press:  01 January 2025

Sierra A. Bainter*
Affiliation:
University of Miami
Thomas G. McCauley
Affiliation:
University of California San Diego
Mahmoud M. Fahmy
Affiliation:
University of Miami
Zachary T. Goodman
Affiliation:
University of Miami
Lauren B. Kupis
Affiliation:
University of California Los Angeles
J. Sunil Rao
Affiliation:
University of Miami
*
Correspondence should be made to Sierra A. Bainter, Department of Psychology, University of Miami, 5665 Ponce de Leon Blvd, Coral Gables, FL 33146, USA. Email: sbainter@miami.edu
Rights & Permissions [Opens in a new window]

Abstract

In the current paper, we review existing tools for solving variable selection problems in psychology. Modern regularization methods such as lasso regression have recently been introduced in the field and are incorporated into popular methodologies, such as network analysis. However, several recognized limitations of lasso regularization may limit its suitability for psychological research. In this paper, we compare the properties of lasso approaches used for variable selection to Bayesian variable selection approaches. In particular we highlight advantages of stochastic search variable selection (SSVS), that make it well suited for variable selection applications in psychology. We demonstrate these advantages and contrast SSVS with lasso type penalization in an application to predict depression symptoms in a large sample and an accompanying simulation study. We investigate the effects of sample size, effect size, and patterns of correlation among predictors on rates of correct and false inclusion and bias in the estimates. SSVS as investigated here is reasonably computationally efficient and powerful to detect moderate effects in small sample sizes (or small effects in moderate sample sizes), while protecting against false inclusion and without over-penalizing true effects. We recommend SSVS as a flexible framework that is well-suited for the field, discuss limitations, and suggest directions for future development.

Information

Type
Original Research
Creative Commons
Creative Common License - CCCreative Common License - BY
This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Copyright
Copyright © 2023 The Author(s)
Figure 0

Table 1 Self-report and selected brain connectivity measures and coefficients—depression example.

Figure 1

Figure 1. Pattern of Marginal Inclusion Probabilities from SSVS for motivating example data.

Figure 2

Table 2 Simulation design.

Figure 3

Table 3 Rates of inclusion by variable selection method, effect size, sample size, correlations among predictors, and pattern of true effects.

Figure 4

Figure 2. Average coefficient estimates by method, effect size, and sample size for conditions with uncorrelated predictors. Note. True parameter values are shown by the dashed horizontal line. Mean estimates are shown as symbols by each method with 95% intervals of observed estimates. SSVS average coefficient estimates are inclusive of both zero and non-zero values.

Figure 5

Figure 3. Average coefficient estimates by method, effect size, and sample size for conditions with correlated predictors, mixed true effects. Note. True parameter values are shown by the dashed horizontal line. Mean estimates are shown as symbols by each method with 95% intervals of observed estimates. SSVS average coefficient estimates are inclusive of both zero and non-zero values.

Figure 6

Figure 4. Average coefficient estimates by method, effect size, and sample size for conditions with correlated predictors, clustered true effects. Note.Y-axis scaling is equivalent to previous Figs. 2 and 3, however note that true effect size decreases as the correlation among true effects increases. True parameter values are shown by the dashed horizontal line. Mean estimates are shown as symbols by each method with 95% intervals of observed estimates. SSVS average coefficient estimates are inclusive of both zero and non-zero values.

Supplementary material: File

Bainter et al. supplementary material

Bainter et al. supplementary material
Download Bainter et al. supplementary material(File)
File 95.6 KB