Hostname: page-component-6766d58669-l4t7p Total loading time: 0 Render date: 2026-05-21T04:45:55.053Z Has data issue: false hasContentIssue false

Predicting food insecurity in a pediatric population using the electronic health record

Published online by Cambridge University Press:  28 October 2024

Joseph Rigdon
Affiliation:
Department of Biostatistics and Data Science, Wake Forest School of Medicine, Winston-Salem, USA
Kimberly Montez
Affiliation:
Department of Pediatrics, Wake Forest School of Medicine, Winston-Salem, USA Maya Angelou Center for Health Equity, Wake Forest School of Medicine, Winston-Salem, USA
Deepak Palakshappa
Affiliation:
Department of Pediatrics, Wake Forest School of Medicine, Winston-Salem, USA Maya Angelou Center for Health Equity, Wake Forest School of Medicine, Winston-Salem, USA Department of Epidemiology and Prevention, Wake Forest School of Medicine, Winston-Salem, USA Center for Healthcare Innovation, Wake Forest School of Medicine, Winston-Salem, USA Department of Internal Medicine, Wake Forest School of Medicine, Winston-Salem, USA
Callie Brown
Affiliation:
Department of Pediatrics, Wake Forest School of Medicine, Winston-Salem, USA Department of Epidemiology and Prevention, Wake Forest School of Medicine, Winston-Salem, USA
Stephen M. Downs
Affiliation:
Department of Pediatrics, Wake Forest School of Medicine, Winston-Salem, USA Center for Biomedical Informatics, Wake Forest School of Medicine, Winston-Salem, USA
Laurie W. Albertini
Affiliation:
Department of Pediatrics, Wake Forest School of Medicine, Winston-Salem, USA
Alysha Taxter*
Affiliation:
Department of Pediatric Rheumatology, Nationwide Children’s Hospital, Columbus, USA Department of Clinical Informatics, Nationwide Children’s Hospital, Columbus, USA
*
Corresponding author: A. Taxter; Email: alysha.taxter@nationwidechildrens.org
Rights & Permissions [Opens in a new window]

Abstract

Introduction:

More than 5 million children in the United States experience food insecurity (FI), yet little guidance exists regarding screening for FI. A prediction model of FI could be useful for healthcare systems and practices working to identify and address children with FI. Our objective was to predict FI using demographic, geographic, medical, and historic unmet health-related social needs data available within most electronic health records.

Methods:

This was a retrospective longitudinal cohort study of children evaluated in an academic pediatric primary care clinic and screened at least once for FI between January 2017 and August 2021. American Community Survey Data provided additional insight into neighborhood-level information such as home ownership and poverty level. Household FI was screened using two validated questions. Various combinations of predictor variables and modeling approaches, including logistic regression, random forest, and gradient-boosted machine, were used to build and validate prediction models.

Results:

A total of 25,214 encounters from 8521 unique patients were included, with FI present in 3820 (15%) encounters. Logistic regression with a 12-month look-back using census block group neighborhood variables showed the best performance in the test set (C-statistic 0.70, positive predictive value 0.92), had superior C-statistics to both random forest (0.65, p < 0.01) and gradient boosted machine (0.68, p = 0.01), and showed the best calibration. Results were nearly unchanged when coding missing data as a category.

Conclusions:

Although our models could predict FI, further work is needed to develop a more robust prediction model for pediatric FI.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2024. Published by Cambridge University Press on behalf of Association for Clinical and Translational Science
Figure 0

Table 1. Demographic predictor variables in model training and test data sets

Figure 1

Table 2. Social determinants of health predictor variables in model training and test data sets

Figure 2

Table 3. ICD-10 predictor variables in model training and test data sets

Figure 3

Table 4. Results from 10-fold cross-validation on training set, by geographic level (census block vs. zip code) and look-back time (12, 18, and 24 months)

Figure 4

Figure 1. Discrimination statistics (C-statistics) for logistic regression, random forest, and gradient boosted models in test set to predict food insecurity. Legend: C-stat = C-statistic.

Figure 5

Figure 2. Calibration statistics for logistic regression, random forest, and gradient boosted model in test set to predict food insecurity. Legend: FI = food insecurity.

Figure 6

Table 5. Model performance in test set. Predictors chosen are those at the census block level and a 12-month look-back, the best-performing set in Table 3

Supplementary material: File

Rigdon et al. supplementary material

Rigdon et al. supplementary material
Download Rigdon et al. supplementary material(File)
File 35.8 KB