Hostname: page-component-89b8bd64d-r6c6k Total loading time: 0 Render date: 2026-05-10T11:58:16.506Z Has data issue: false hasContentIssue false

Improved interpretation of studies comparing methods of dietary assessment: combining equivalence testing with the limits of agreement

Published online by Cambridge University Press:  16 February 2016

Marijka J. Batterham*
Affiliation:
National Institute for Applied Statistics Research Australia, University of Wollongong, Northfields Avenue, Wollongong, NSW 2522, Australia
Christel Van Loo
Affiliation:
Faculty of Social Sciences, University of Wollongong, Wollongong, NSW 2522, Australia
Karen E. Charlton
Affiliation:
Faculty of Science, Medicine and Health, University of Wollongong, Wollongong, NSW 2522, Australia
Dylan P. Cliff
Affiliation:
Faculty of Social Sciences, University of Wollongong, Wollongong, NSW 2522, Australia
Anthony D. Okely
Affiliation:
Faculty of Social Sciences, University of Wollongong, Wollongong, NSW 2522, Australia
*
* Corresponding author: M. J. Batterham, email marijka@uow.edu.au
Rights & Permissions [Opens in a new window]

Abstract

The aim of this study was to demonstrate the use of testing for equivalence in combination with the Bland and Altman method when assessing agreement between two dietary methods. A sample data set, with eighty subjects simulated from previously published studies, was used to compare a FFQ with three 24 h recalls (24HR) for assessing dietary I intake. The mean I intake using the FFQ was 126·51 (sd 54·06) µg and using the three 24HR was 124·23 (sd 48·62) µg. The bias was −2·28 (sd 43·93) µg with a 90 % CI 10·46, 5·89 µg. The limits of agreement (LOA) were −88·38, 83·82 µg. Four equivalence regions were compared. Using the conventional 10 % equivalence range, the methods are shown to be equivalent both by using the CI (−12·4, 12·4 µg) and the two one-sided tests approach (lower t=−2·99 (79 df), P=0·002; upper t=2·06 (79 df), P=0·021). However, we make a case that clinical decision making should be used to set the equivalence limits, and for nutrients where there are potential issues with deficiency or toxicity stricter criteria may be needed. If the equivalence region is lowered to ±5 µg, or ±10 µg, these methods are no longer equivalent, and if a wider limit of ±15 µg is accepted they are again equivalent. Using equivalence testing, acceptable agreement must be assessed a priori and justified; this makes the process of defining agreement more transparent and results easier to interpret than relying on the LOA alone.

Information

Type
Full Papers
Copyright
Copyright © The Authors 2016 
Figure 0

Table 1 Summary of a highly cited sample of the literature assessing agreement of a FFQ with a reference method using the Bland and Altman (BA) method

Figure 1

Fig. 1 Bland and Altman plots with superimposed equivalence intervals and the 90 % CI of the mean difference. (a) Equivalence ±5 µg I, (b) equivalence ±10 µg I, (c) equivalence ±10 % mean I 3×24HR and (d) equivalence ±15 µg I. 3×24HR, average of three 24 h recalls.

Figure 2

Fig. 2 CI plots. (a) CI plot using the mean difference between the 3×24 h recall (24HR) and FFQ and (b) CI plot using the mean I intake in the 3×24HR (124·23 µg). 3×24HR, average of three 24 h recalls.

Figure 3

Fig. 3 95 % CI of the upper and lower limits of agreement (LOA) for the mean bias in I intake (µg) between the 3×24HR and the FFQ.

Figure 4

Table 2 Summary statistics, paired t test, Bland and Altman (BA) limits of agreement (LOA) and equivalence tests for assessing agreement between the 3×24HR and the FFQ

Supplementary material: File

Batterham supplementary material

Appendix 1

Download Batterham supplementary material(File)
File 97.9 KB