Hostname: page-component-89b8bd64d-j4x9h Total loading time: 0 Render date: 2026-05-13T06:37:47.089Z Has data issue: false hasContentIssue false

Data mining of tuberculosis patient data using multiple correspondence analysis

Published online by Cambridge University Press:  19 May 2009

T. W. RENNIE*
Affiliation:
North East London Tuberculosis Commissioning Unit, Newham Primary Care Trust, London, UK
W. ROBERTS
Affiliation:
North East London Tuberculosis Commissioning Unit, Newham Primary Care Trust, London, UK
*
*Author for correspondence: Dr T. W. Rennie, North East London Tuberculosis Commissioning Unit, Newham Primary Care Trust, Warehouse K, 2 Western Gateway, London E16 1DR, UK. (Email: timothy.rennie@pharmacy.ac.uk)
Rights & Permissions [Opens in a new window]

Summary

The aim of this study was to demonstrate the epidemiological use of multiple correspondence analysis (MCA), as applied to tuberculosis (TB) data from North East London. Data for TB notifications in North East London primary care trusts (PCTs) between the years 2002 and 2007 were used. TB notification data were entered for MCA allowing display of graphical data output (n=4947); MCA analyses were performed on the whole dataset, by PCT, and by year of notification. Graphical MCA output displayed variance of data categories; clustering of variable categories in MCA output signified association. Clustering patterns in MCA output demonstrated different associations by year of notification, within PCTs and between PCTs. MCA is a useful technique for displaying association of variable categories used in TB epidemiology. Results suggest that MCA could be a useful tool in informing commissioning of TB services.

Information

Type
Original Papers
Copyright
Copyright © Cambridge University Press 2009
Figure 0

Table 1. Demographic and clinical TB patient data, 2002–2006

Figure 1

Fig. 1. Multiple correspondence analysis graphical output of TB variable categories. (a) All variables, all years. (b) All variables, all years except ethnicity.

Figure 2

Fig. 2. Multiple correspondence analysis graphical output of TB variable categories by primary care trust (PCT). (a) PCT2; (b) PCT7.

Figure 3

Fig. 3. Multiple correspondence analysis graphical output of TB variable categories by year. (a) 2002; (b) 2007.