Hostname: page-component-89b8bd64d-nlwjb Total loading time: 0 Render date: 2026-05-07T22:13:20.152Z Has data issue: false hasContentIssue false

Multiblock modelling to assess the overall risk factors for a composite outcome

Published online by Cambridge University Press:  14 April 2011

S. BOUGEARD*
Affiliation:
Department of Epidemiology, French Agency for Food, Environmental, and Occupational Health Safety (ANSES), Zoopole, Ploufragan, France
C. LUPO
Affiliation:
Department of Epidemiology, French Agency for Food, Environmental, and Occupational Health Safety (ANSES), Zoopole, Ploufragan, France
S. Le BOUQUIN
Affiliation:
Department of Epidemiology, French Agency for Food, Environmental, and Occupational Health Safety (ANSES), Zoopole, Ploufragan, France
C. CHAUVIN
Affiliation:
Department of Epidemiology, French Agency for Food, Environmental, and Occupational Health Safety (ANSES), Zoopole, Ploufragan, France
E. M. QANNARI
Affiliation:
Department of Chemometrics and Sensometrics, Nantes-Atlantic National College of Veterinary Medicine, Food Science and Engineering (ONIRIS), Nantes, France
*
*Author for correspondence: Dr S. Bougeard, Department of Epidemiology, French Agency for Food, Environmental, and Occupational Health Safety (ANSES), Zoopole, BP 53, 22440 Ploufragan, France. (Email: Stephanie.bougeard@anses.fr)
Rights & Permissions [Opens in a new window]

Summary

Research in epidemiology may be concerned with assessing risk factors for complex health issues described by several variables. Moreover, epidemiological data are usually organized in several blocks of variables, consisting of a block of variables to be explained and a large number of explanatory variables organized in meaningful blocks. Usual statistical procedures such as generalized linear models do not allow the explanation of a multivariate outcome, such as a complex disease described by several variables, with a single model. Moreover, it is not easy to take account of the organization of explanatory variables into blocks. Here we propose an innovative method in the multiblock modelling framework, called multiblock redundancy analysis, which is designed to handle most specificities of complex epidemiological data. Overall indices and graphical displays associated with different interpretation levels are proposed. The interest and relevance of multiblock redundancy analysis is illustrated using a dataset pertaining to veterinary epidemiology.

Information

Type
Original Papers
Copyright
Copyright © Cambridge University Press 2011
Figure 0

Fig. 1. Example of multiblock data structure and associated conceptual scheme of multiblock redundancy analysis, highlighting the relationships between the variable blocks (X1, X2, Y) and their associated components (t1(1), t2(1), u(1)) for the first dimension.

Figure 1

Table 1. Contribution of the explanatory variables to the explanation of the four variables of losses Y=[y1, y2, y3, y4], by means of significant incidence rate ratio (IRR) associated with their 95% tolerance interval

Figure 2

Fig. 2. Contribution of the main explanatory variables [variable importance for the projection (VIP)2 ⩾3%] to the explanation of the overall losses (Y), through VIP2 (expressed as percentages) associated with their 95% tolerance interval.

Figure 3

Fig. 3. Contribution of the four explanatory blocks (X1, …, X4) to the explanation of the overall losses (Y), through block importance in the prediction (BIP2; expressed as percentages) associated with their 95% tolerance interval (TI). * Significant association; n.s., non-significant.

Figure 4

Table 2. Contribution of the explanatory blocks (X1, …, X4) to the explanation of each variable of the losses Y=[y1, y2, y3, y4], by means of block importance in the prediction (BIP2 expressed as percentages) associated with their 95% tolerance interval