Hostname: page-component-77f85d65b8-6c7dr Total loading time: 0 Render date: 2026-03-29T06:27:50.982Z Has data issue: false hasContentIssue false

Generating community measures of food purchasing activities using store-level electronic grocery transaction records: an ecological study in Montreal, Canada

Published online by Cambridge University Press:  23 August 2021

Hiroshi Mamiya*
Affiliation:
School of Population and Global Health, Department of Epidemiology, Biostatistics, and Occupational Health, McGill University, 772 Sherbrooke St West, Montreal, QC H3A 1G1, Canada
Alexandra M Schmidt
Affiliation:
School of Population and Global Health, Department of Epidemiology, Biostatistics, and Occupational Health, McGill University, 772 Sherbrooke St West, Montreal, QC H3A 1G1, Canada
Erica EM Moodie
Affiliation:
School of Population and Global Health, Department of Epidemiology, Biostatistics, and Occupational Health, McGill University, 772 Sherbrooke St West, Montreal, QC H3A 1G1, Canada
Yu Ma
Affiliation:
Desautels Faculty of Management, McGill University, Montreal, QC, Canada
David L Buckeridge
Affiliation:
School of Population and Global Health, Department of Epidemiology, Biostatistics, and Occupational Health, McGill University, 772 Sherbrooke St West, Montreal, QC H3A 1G1, Canada
*
*Corresponding author: Email hiroshi.mamiya@mail.mcgill.ca
Rights & Permissions [Opens in a new window]

Abstract

Objective:

Geographic measurement of diets is generally not available at areas smaller than a national or provincial (state) scale, as existing nutrition surveys cannot achieve sample sizes needed for an acceptable statistical precision for small geographic units such as city subdivisions.

Design:

Using geocoded Nielsen grocery transaction data collected from supermarket, supercentre and pharmacy chains combined with a gravity model that transforms store-level sales into area-level purchasing, we developed small-area public health indicators of food purchasing for neighbourhood districts. We generated the area-level indicators measuring per-resident purchasing quantity for soda, diet soda, flavoured (sugar-added) yogurt and plain yogurt purchasing. We then provided an illustrative public health application of these indicators as covariates for an ecological spatial regression model to estimate spatially correlated small-area risk of type 2 diabetes mellitus (T2D) obtained from the public health administrative data.

Setting:

Greater Montreal, Canada in 2012.

Participants:

Neighbourhood districts (n 193).

Results:

The indicator of flavoured yogurt had a positive association with neighbourhood-level risk of T2D (1·08, 95 % credible interval (CI) 1·02, 1·14), while that of plain yogurt had a negative association (0·93, 95 % CI 0·89, 0·96). The indicator of soda had an inconclusive association, and that of diet soda was excluded due to collinearity with soda. The addition of the indicators also improved model fit of the T2D spatial regression (Watanabe–Akaike information criterion = 1765 with the indicators, 1772 without).

Conclusion:

Store-level grocery sales data can be used to reveal micro-scale geographic disparities and trends of food selections that would be masked by traditional survey-based estimation.

Information

Type
Research paper
Copyright
© The Author(s), 2021. Published by Cambridge University Press on behalf of The Nutrition Society
Figure 0

Fig. 1 Map of 193 neighbourhoods and two excluded neighbourhoods in the Census Metropolitan Area of Montreal, and an illustration of sampled and out-of-sample chain retail supermarkets, pharmacies and supercentres, 2012. Grey squares indicate sampled stores with sales data observed. Empty circles indicate out-of-sample stores, thus missing sales data. The location of points does not reflect exact location of stores, but randomly placed within the neighbourhood boundaries stores belong to. Two dotted areas are the federally registered First Nations communities without census data. These areas were excluded, leaving 193 neighbourhoods with white fill for this study. Black lines represent neighbourhood boundaries. Black fills in the inset map represent water. Census Metropolitan Area is defined as the aggregation of contiguous Canadian census subdivisions that are merged if extensive trips of residents occur. , Sampled store; , out-of-sample store; , Federally Registered First Nation Community (excluded area)

Figure 1

Fig. 2 Illustration of analytical approach to generate area-level indicator of purchasing from store-level sales data. Panel (a) illustrates hypothetical geographic location of sampled (squares with black fill) and out-of-sample stores (empty circles), the latter missing sales records. Panel (b) illustrates out-of-sample stores with predicted distribution of sales (dark fill in circles) as well as the sampled stores with observed sales. Panel (c) illustrates the neighbourhood-level store visit probabilities for a hypothetical store (star symbol) as generated by the Huff gravity model, which represent a probabilistic catchment zone of the store in the form of discrete (area-level) surface of store visit probabilities. The product of these area-level visit probabilities and population size in neighbourhood represents weights to split and allocate predictive distribution of sales (for out-of-sample stores) or observed sales (for sampled stores) into surrounding areas, with the quantity of store-level sales fixed. Note that there are as many probability maps as the number of other stores that are not displayed in this map. Solid lines represent boundaries of Montreal neighbourhood

Figure 2

Table 1 Characteristics of store-level transactions by food category for sampled stores in the Census Metropolitan Area of Montreal, 2012*

Figure 3

Table 2 Characteristics of neighbourhoods on which sampled chain stores were located (eighty-three areas) and all neighbourhoods (193 areas) in the Census Metropolitan Area of Montreal, 2011 Canadian National Household Survey

Figure 4

Fig. 3 Fitted and observed log sales of (a) soda, (b) diet soda, (c) flavoured yogurt and (d) plain yogurt in the sales model. Vertical solid line indicates 95 % posterior credible interval of fitted log sales. Dashed line is reference slope, where y = x

Figure 5

Fig. 4 Posterior summary of sales model for each food category. Models for each food category were ran separately. The values of the coefficients represent association with log store-level sales in servings. Store chain indicator represents a retail chain identifier for random effect; unlike a dummy variable in a fixed effect model, there is no baseline category to which the indicator of store chain is compared. Chains 1–5 are pharmacies, chains 6–11 are supermarkets and chains 12 and 13 are supercentres. Fixed effects representing neighbourhood-level predictor of sales were mean cantered and scaled at one standard deviation. The covariate ‘young’ represents the proportion of residents under 18 years old, and the covariate ‘family size’ represents the mean number of family members. The sales models for flavoured and plain yogurts did not include sales in chains 1–5 and 13, as pharmacies and one supercentre chain rarely sold yogurt. , Soda; , diet soda; , flavoured yogurt; , plain yogurt

Figure 6

Fig. 5 Posterior mean of neighbourhood indicator for (a) soda, (b) diet soda, (c) flavoured yogurt and (d) plain yogurt in the Census Metropolitan Area of Montreal, 2012. The greyscale key to the right of each map indicates posterior mean of purchasing quantity per resident in serving. Note that the quantities in the greyscale keys are not standardised across food categories, as the quantities were very different in scale

Figure 7

Fig. 6 Posterior standard deviation of neighbourhood indicator for (a) soda, (b) diet soda, (c) flavoured yogurt and (d) plain yogurt in the Census Metropolitan Area of Montreal, 2012. The greyscale keys to the right of each map indicate posterior standard deviation of purchasing quantity per resident in each neighbourhood. Note that the quantities in the greyscale keys are not standardised across food categories, as the quantities were very different in scale

Figure 8

Table 3 Posterior mean and 95 % credible interval of exponentiated coefficients and model fit of neighbourhood-level (n 193) diabetes risk model, Census Metropolitan Area of Montreal, 2012*

Supplementary material: File

Mamiya et al. supplementary material

Mamiya et al. supplementary material 1
Download Mamiya et al. supplementary material(File)
File 2 MB
Supplementary material: File

Mamiya et al. supplementary material

Mamiya et al. supplementary material 2
Download Mamiya et al. supplementary material(File)
File 31.7 KB
Supplementary material: File

Mamiya et al. supplementary material

Mamiya et al. supplementary material 3
Download Mamiya et al. supplementary material(File)
File 30.5 KB