Hostname: page-component-89b8bd64d-5bvrz Total loading time: 0 Render date: 2026-05-12T06:42:05.700Z Has data issue: false hasContentIssue false

Validation of a province-wide commercial food store dataset in a heterogeneous predominantly rural food environment

Published online by Cambridge University Press:  16 April 2020

Nathan GA Taylor*
Affiliation:
Faculty of Health, School of Health Administration, Dalhousie University, 5850 College Street, PO Box 15000, Halifax, NS B3H 4R2, Canada
Jillian Stymest
Affiliation:
Faculty of Health, School of Health Administration, Dalhousie University, 5850 College Street, PO Box 15000, Halifax, NS B3H 4R2, Canada
Catherine L Mah
Affiliation:
Faculty of Health, School of Health Administration, Dalhousie University, 5850 College Street, PO Box 15000, Halifax, NS B3H 4R2, Canada Dalla Lana School of Public Health, University of Toronto, 155 College Street, Toronto, ON M5T 3M7, Canada
*
*Corresponding author: Email nathan.taylor@dal.ca
Rights & Permissions [Opens in a new window]

Abstract

Objective:

Commercially available business (CAB) datasets for food environments have been investigated for error in large urban contexts and some rural areas, but there is a relative dearth of literature that reports error across regions of variable rurality. The objective of the current study was to assess the validity of a CAB dataset using a government dataset at the provincial scale.

Design:

A ground-truthed dataset provided by the government of Newfoundland and Labrador (NL) was used to assess a popular commercial dataset. Concordance, sensitivity, positive-predictive value (PPV) and geocoding errors were calculated. Measures were stratified by store types and rurality to investigate any association between these variables and database accuracy.

Setting:

NL, Canada.

Participants:

The current analysis used store-level (ecological) data.

Results:

Of 1125 stores, there were 380 stores that existed in both datasets and were considered true-positive stores. The mean positional error between a ground-truthed and test point was 17·72 km. When compared with the provincial dataset of businesses, grocery stores had the greatest agreement, sensitivity = 0·64, PPV = 0·60 and concordance = 0·45. Gas stations had the least agreement, sensitivity = 0·26, PPV = 0·32 and concordance = 0·17. Only 4 % of commercial data points in rural areas matched every criterion examined.

Conclusions:

The commercial dataset exhibits a low level of agreement with the ground-truthed provincial data. Particularly retailers in rural areas or belonging to the gas station category suffered from misclassification and/or geocoding errors. Taken together, the commercial dataset is differentially representative of the ground-truthed reality based on store-type and rurality/urbanity.

Information

Type
Short Communication
Copyright
© The Authors 2020
Figure 0

Table 1 Classification and geospatial error of true-positive stores when comparing a provincial dataset with the DMTI enhanced points of interest in Newfoundland and Labrador, Canada (n 380)

Figure 1

Table 2 Sensitivity, positive predictive value and concordance between a government dataset and the DMTI enhanced points of interest in Newfoundland and Labrador, Canada (n 1124)

Figure 2

Table 3 True positives (TP), false positives (FP) and false negatives (FN) based on store type, ownership and population centre class when comparing a provincial dataset with the DMTI enhanced points of interest in Newfoundland and Labrador, Canada (n 1124)