Geographical variation of sporadic Legionnaires' disease analysed in a grid model

M. RUDBECK; M. R. JEPSEN; I. B. SONNE; S. A. ULDUM; S. VISKUM; K. MØLBAK

doi:10.1017/S0950268809990185

Geographical variation of sporadic Legionnaires' disease analysed in a grid model

Published online by Cambridge University Press: 16 June 2009

M. RUDBECK ,

S. VISKUM and

M. RUDBECK*: Affiliation:
Department of Bacteriology, Mycology, and Parasitology, Statens Serum Institut, Copenhagen, Denmark Department of Occupational and Environmental Medicine, Aarhus University Hospital, Aalborg, Denmark
M. R. JEPSEN: Affiliation:
Department of Epidemiology, Statens Serum Institut, Copenhagen, Denmark
I. B. SONNE: Affiliation:
Department of Environment, Aalborg County, Denmark
S. A. ULDUM: Affiliation:
Department of Bacteriology, Mycology, and Parasitology, Statens Serum Institut, Copenhagen, Denmark
S. VISKUM: Affiliation:
Department of Occupational and Environmental Medicine, Aarhus University Hospital, Aalborg, Denmark
K. MØLBAK: Affiliation:
Department of Epidemiology, Statens Serum Institut, Copenhagen, Denmark
*: *Author for correspondence: Dr M. Rudbeck, Statens Serum Institut, Department of Bacteriology, Mycology, and Parasitology, Artillerivej 5, DK-2300 Copenhagen S, Denmark. (Email: rudbeck@dadlnet.dk)

Article contents

Summary
INTRODUCTION
METHODS
RESULTS
DISCUSSION
References

Rights & Permissions

Summary

The aim was to analyse variation in incidence of sporadic Legionnaires' disease in a geographical information system in three time periods (1990–2005) by the application of a grid model and to assess the model's validity by analysing variation according to grid position. Coordinates of the addresses at time of disease of 606 confirmed cases with Legionnaires' disease were obtained. The incidence was calculated in cells of 10×10 km in 25 different grids superimposed on a map of Denmark. A 95% and 99% threshold was applied to identify cells with excess incidence representing potential clusters. Four cells had excess incidence in all three time periods. The analysis in 25 different grid positions indicated a low risk of overlooking cells with excess incidence in a random grid. The coefficient of variation ranged from 0·08 to 0·11 independent of the threshold. By application of a random grid model we demonstrated that it was possible to detect small areas with excess incidence that were not detected in the present surveillance system.

Keywords

Epidemiology geographical information systems Legionella Legionnaire's disease surveillance

Information

Type: Original Papers
Information: Epidemiology & Infection , Volume 138 , Issue 1 , January 2010 , pp. 9 - 14

DOI: https://doi.org/10.1017/S0950268809990185 [Opens in a new window]
Copyright: Copyright © Cambridge University Press 2009

INTRODUCTION

The proportion of community-acquired pneumonia caused by infection with Legionella spp. has been estimated to be at least 2% in industrialized countries based on a study of hospitalized community-acquired pneumonia [Reference Bohte, van Furth and van den Broek1]. The true incidence of Legionnaires' disease (LD) is unknown because of limitations in both detection and reporting. In Denmark, the incidence of LD has been high and quite stable for the past 15 years with an overall incidence of about 20 cases per million [Reference Joseph2].

LD varies geographically, but internationally most of the described geographical differences are still thought to be due to differences in definitions, diagnostic methods, surveillance systems, or data presentation [Reference Bhopal3]. An indefinable part of the differences seems to be due to variability in host susceptibility, or environmental differences, including differences in Legionella strains and their varying virulence [Reference Joseph2, Reference Bhopal3]. To what extent the incidence may differ according to local sources of infection and virulence of the environmental strains is not well described. Some areas or cities report a higher incidence of LD than for the country in general, possibly with residential water systems as a source [Reference Lim4]. Domestic water systems are known to harbour Legionella and to be sources of LD, especially apartment blocks with complex hot-water systems [Reference Heudorf5].

LD has been a notifiable disease in Denmark since 1980, and throughout the years a high incidence of non-outbreak cases has been recorded. Until the present study, the surveillance system had primarily analysed the incidence in LD at the county level, where the underlying population varies between 200 000 and 650 000 inhabitants. The overall aim of the present study was to explore if the surveillance system could be improved by enhancing its capacity to detect local clusters of LD. Specifically, we used historical data from 1990 to 2005 to analyse variation in the incidence of sporadic LD in time and space in a geographical information system (GIS) by the application of a grid model and assessed the validity of this model by analysing variation according to position of the grids.

METHODS

Study population

The Danish case definition of LD includes: isolation of Legionella by culture, or a ⩾fourfold increase in antibody titre to ⩾1:128 in immunofluorescence antibody test (IFAT) to Legionella pneumophila serogroup (sg) 1, 3, or 6, or detection of L. pneumophila antigen in urine, or a combination of positive Legionella polymerase chain reaction (PCR) on respiratory secretions and a presumptive positive serological result [rise in titre to non-sg 1, 3 or 6 or positive reaction (titre ⩾1:256) for any serogroup] or a borderline positive urinary antigen test.

In the period from 1980 to May 2005, a total of 1964 cases of LD were notified to Statens Serum Institut, Copenhagen, which is the national institute for surveillance of communicable diseases in Denmark. Only sporadic confirmed cases of LD (n=673) were eligible for the present study, and the period 1980–1989 was excluded because of low numbers (n=38) (Table 1).

Table 1. Number of sporadic and confirmed cases of Legionnaires' disease (LD) in Denmark, 1980–2005

* Period 2000–May 2005.

Sporadic cases were defined as patients fulfilling the case definition who had not acquired the infection during travel or at a healthcare facility, or were associated with recognized outbreaks. In the study period, no outbreaks had been identified in Denmark. However, four cases were registered as part of two possible community clusters and three subjects were each registered separately as part of a possible community cluster without further information. The seven subjects were kept in the analyses. In total, we included 606 sporadic cases of LD (1990–2005) with a known address at the time of disease.

Application of the grid model

Every person with LD was registered according to their home address at time of diagnosis linking the surveillance database with the Danish civil registry system which contains historical and current addresses of Danish residents. Almost all addresses in Denmark have been geocoded, providing all postal addresses with x and y coordinates indicating their geographic position to an accuracy of <3 m. The address code is a unique code for any address and includes the municipality code, street code and house number [Reference Hansen, Poulstrup and Briggs6]. The geographical x and y coordinates of every home address were coded in GIS software (ArcView GIS 3.3, India).

The Danish Statistical Office (Danmarks Statistik) supplied the numbers of citizens in a 100×100 m grid for 2 years (1992, 2002). The number of citizens within each cell was used to calculate the incidence of LD in each cell for the time periods 1990–1994, 1995–1999, and 2000–2005.

A grid net composed of 10×10 km cells was superimposed on a map Denmark, and the population and number of cases for each cell were calculated. The calculation was iterated 25 times, shifting the grid net systematically in x and y directions, in 2-km steps, thus resulting in 25 grids. Denmark has a population of 5·4 million (2005). The mean population density of the ~620 populated cells was ~8600. The number of populated cells and mean population density depend slightly on the position of the grid.

We defined cells of interest, representing potential clusters or outbreaks, to be cells with an incidence higher than the upper limit of the 95% and 99% confidence intervals (CI). In each period, a Poisson distribution was used to identify such cells with an excess number of cases. We assumed that the number of cases in each cell had a natural variability that could be described by a Poisson model, and that the number of cases in a cell was independent of the numbers in neighbouring cells. The null hypothesis was that the true underlying rate that generated the cases observed in a given cell was the same as the underlying rate overall in the country [Reference Olsen, Martuzzi and Elliott7].

In the analysis of the sensitivity of the random grid, coefficient of variation was used as a measure of the replicability of the different grid measurements [Reference Dawson and Trapp8]. Coefficient of variation describes the variation in number of cells with excess incidence of LD within each grid for the three time periods. In the analysis of coefficients of variation and in the overall interpretation of the results, we excluded cells with one case only to avoid significance merely because of a low population density. We assumed that only cells with more than one case of LD were of interest in the identification of possible clusters. Stata version 9.2 (StataCorp., USA) was used in the statistical analysis.

The study was approved by the Danish Data Protection Agency.

RESULTS

Geographical variation

Every case of sporadic LD was included in the random grid (Fig. 1) to visually demonstrate the geographical variation in incidence.

Fig. 1. Incidence of sporadic Legionnaires' disease in a random grid. Denmark, three periods during 1990–2005. , >97·5 incidence; , >99·5 incidence. (A larger, colour version of this figure is available online.)

Three cells had excess incidence in two time periods, and four cells had excess incidence (95% CI) in all three time periods when cells with only one case were excluded.

Testing of the grid model

The grid net was shifted systematically by 2 km in x or y directions for every calculation of incidence in the cells (Fig. 2). The number of cells with excess incidence in the different grids varied from 22 to 33 cells in the three periods when the threshold of 95% CI was applied, and varied from 21 to 32 cells when the threshold of 99% CI was applied. Coefficient of variation in number of cells with excess incidence in the three periods was 0·107, 0·086, and 0·076 (95% CI). Coefficient of variation in number of cells with excess incidence in the three periods was similar when the 99% threshold was applied (0·114, 0·094, 0·077). Number of cells with excess incidence through all three periods varied from 1 to 5 with a mean of 3·2 and a coefficient of variation of 0·32 (95% CI). These cells were considered to represent areas with a probable persistently high incidence of LD.

Fig. 2. Testing of variations in offset of the grid model by the application of 25 grids. The figure shows grid position (changes in x and y coordinates in 2-km steps) and number of cells with incidence exceeding a 97·5% threshold.

The position of cells with excess incidence was roughly similar in the different grids.

DISCUSSION

A relatively simple grid model applied on routine surveillance data could visualize a limited number of geographical areas with a high incidence of LD. Repeated excess incidence over time was of special interest to reduce the risk of chance findings. Furthermore, the finding was relatively insensitive to the offset of the grid, as illustrated by a low coefficient of variation.

The use of GIS to improve exposure assessment and assess the spatial distribution of diseases is increasing [Reference Ward and Wartenberg9]. GIS as a tool has been applied in the general registration of infectious diseases [Reference Rolfhamre, Grabowska and Ekdahl10], but to our knowledge has not been described in use in a continuous surveillance model. Examples of previous GIS applications include spatial analysis of several infectious diseases, e.g. Campylobacter infections, which are partly environmentally acquired [Reference Ethelberg11], and several studies of animal diseases [Reference Nygard12–Reference Conraths, Staubach and Tackmann15]. LD is an infectious disease that is only environmentally acquired without human-to-human transmission. LD is therefore an obvious candidate for GIS-based surveillance with the aim of describing and monitoring spatial distribution, both continuously as a tool to detect clusters and also analytically in verified outbreaks. GIS has successfully been used in outbreaks to determine possible sources [Reference Nygard16].

Description of geographical variation of LD according to residence was described in Scotland from 1978 to 1986, but geographical variations have not otherwise been described historically for a lengthy period. These space–time clusters in Scotland in town areas were described according to residential postcode [Reference Bhopal, Diggle and Rowlingson17, Reference Bhopal18]. These studies were before GIS was available. Recently, these findings were reanalysed by the use of GIS in an effort to develop point-pattern modelling regarding the spatial epidemiology of LD [Reference Dunn19]. GIS has been used in France in an ecological study to describe sporadic LD and exposure to an industrial plume of smoke in postcode areas [Reference Che20]. Postcode areas are not always the most relevant areas in geographical analysis of exposure; smaller or more specific units might be more suitable for some analysis.

The application of a simple grid model with cells of 10×10 km enabled us to identify areas with a persistently high incidence of LD over time. There was only limited variation in number of cells with excess incidence when the cells were shifted from one position of the grid to another with coefficients of variation from 0·07 to 0·11 in the different time periods. In interpretation of these observations, it is necessary to be aware of regional variations in both the diagnostic methods and the use of these methods [Reference Den Boer and Yzerman21]. Denmark is a small country with national recommendations for the use of diagnostic methods, and Statens Serum Institut is the only national institute for surveillance and follow-up of any suspected LD cases, which diminishes the risks of differences in the surveillance registration. A former study showed no differences in use of diagnostic methods in a large county of 650 000 inhabitants [Reference Rudbeck and Hansen22]. By chance, we previously detected this high-incidence area in a specific municipality (Randers) of 62 000 inhabitants, and have since performed further studies in the area with comparison to other areas [Reference Rudbeck, Mølbak and Uldum23, Reference Rudbeck, Mølbak and Uldum24]. The same high-incidence area (Randers) was identified in the present study, and it is unlikely that this area represents an artefact. The high-incidence area was not detected by the present surveillance system where routine analyses are limited to the county level.

One issue of concern is based on the fact that the offset of the cells are randomly located, and it is possible to overlook some clusters due to intersections through a possible geographical cluster. To overcome intersection problems a model with variable grids can be used. Although this concept is scientifically sound, the trade-off relates to the fact that large computer calculation capacities are demanded for such iterations. In the present study, the sensitivity analyses with variations of the offsets of the grids suggest acceptable coefficient of variation. This indicates that in some circumstances, depending on purpose, a random grid may be acceptable for description. We used a grid size of 10×10 km based on previous practical experience [Reference Rudbeck and Hansen22]. The relevant size of the cells in the grid will vary, because it depends on both numbers of LD and the population density in the areas. Further studies of interaction between cell sizes, numbers of individuals with disease, and population density are needed. It is also important to address the relevant threshold of excess incidence for detection of clusters.

Only cells with more than one case of LD are relevant in detecting clusters. In the analysis of grid offset we increased the limit to more than one case per cell for cluster definition. Our distribution of cases of LD and population density makes it unnecessary to use other calculations to overcome noise due to low numbers.

As previously mentioned, the persistent high incidence in two neighbouring cells, covering Randers, confirms a previous study [Reference Rudbeck and Hansen22]. This demonstrates to some extent the validity of the model because cells with a known high incidence were detected. Additionally, the present study included two registered community clusters of two cases each. The community cluster in 1999 was detected in our analyses, whereas the community cluster in 2000 was not detected. The cluster in 2000 was in the capital, Copenhagen, within an area with a very high population density. High-density population areas may need a grid net with smaller cells than we have used for the whole country. The cases of LD have been coded and mapped only according to residence; other addresses, e.g. workplace, could be important for mapping in GIS. The sources of a historical cluster can be difficult if not impossible to determine compared to more recent clusters. In future surveillance, it will be possible to register important individual information, and to look for further causal relations or sources. The surveillance may be further improved by including microbiological information such as serogroups and subgroups and other typing results for the causative agents, thereby increasing the possibility of detecting specific clusters, and increasing the possibility of detecting an environmental or domestic source. The intention with the model was to show that historical data in a grid model may be useful in a future surveillance model in GIS as a continuously updated background layer. To make the model practical for surveillance, we kept it simple. It was not intended to take into account any individual risk factors, or to make any formal cluster calculations [Reference Olsen, Martuzzi and Elliott7, Reference Elliott and Wartenberg25].

We have described the geographical variation of sporadic LD in a simple geographical grid model in GIS. By the application of a random grid model and the use of historical data, we have demonstrated that the surveillance of LD may be improved to enhance the ability to detect and specify small areas that need further attention; high-incidence areas not discovered by the present surveillance system of large geographical areas. Surveillance for LD and capacity for cluster detection may be improved by routine application of this tool.

ACKNOWLEDGEMENTS

We thank Den Obelske Familiefond, Aalborg, and Aarhus University Hospital, Aalborg for financial support.

NOTE

Supplementary material accompanies this paper on the Journal's website (http://journals.cambridge.org/hyg).

DECLARATION OF INTEREST

None.

References

REFERENCES

1. Bohte, R, van Furth, R, van den Broek, PJ. Aetiology of community-acquired pneumonia: a prospective study among adults requiring admission to hospital. Thorax 1995; 50: 543–547.CrossRef Google Scholar PubMed

2. Joseph, CA. Legionnaires' disease in Europe 2000–2002. Epidemiology and Infection 2004; 132: 417–424.Google Scholar

3. Bhopal, RS. Geographical variation of Legionnaires' disease: a critique and guide to future research. International Journal of Epidemiology 1993; 22: 1127–1136.CrossRef Google Scholar PubMed

4. Lim, WS, et al. Community-acquired Legionnaires' disease in Nottingham – too many cases? Epidemiology and Infection 2003; 131: 1097–1103.CrossRef Google Scholar PubMed

5. Heudorf, U, et al. Legionellas in domestic warm water – effects on the health of residents [in German]. Gesundheitswesen 2001; 63: 326–334.Google Scholar

6. Hansen, HL, Poulstrup, A. Address geocoding for small area environmentla health studies in Denmark. In: Briggs, DJ, ed. GIS for Emergency Preparedness and Health Risk Reduction, pp. 227–240. The Netherlands: Kluwer Academic Publishers, 2002.Google Scholar

7. Olsen, SF, Martuzzi, M, Elliott, P. Cluster analysis and disease mapping – why, when, and how? A step by step guide. British Medical Journal 1996; 313: 863–866.Google Scholar

8. Dawson, B, Trapp, RG. Basic and Clinical Biostatistics. New York, USA: Lange Medical Books/McGraw-Hill, 2001.Google Scholar

9. Ward, MH, Wartenberg, D. Invited commentary: on the road to improved exposure assessment using geographic information systems. American Journal of Epidemiology 2006; 164: 208–211.CrossRef Google Scholar PubMed

10. Rolfhamre, P, Grabowska, K, Ekdahl, K. Implementing a public web based GIS service for feedback of surveillance data on communicable diseases in Sweden. BMC Infectious Diseases 2004; 4: 17.Google Scholar

11. Ethelberg, S, et al. Spatial distribution and registry-based case-control analysis of Campylobacter infections in Denmark, 1991–2001. American Journal of Epidemiology 2005; 162: 1008–1015.Google Scholar

12. Nygard, K, et al. Association between environmental risk factors and campylobacter infections in Sweden. Epidemiology and Infection 2004; 132: 317–325.Google Scholar

13. Rinaldi, L, et al. Giardias and Cryptosporidium in water buffaloes (Babulus bubalis). Parasitology Research 2007; 100: 1113–1118.CrossRef Google Scholar PubMed

14. Rinaldi, L, et al. Canine faecal contamination and parasitic risk in the city of Naples (southern Italy). BMC Veterinary Research 2006; 2: 29.CrossRef Google Scholar PubMed

15. Conraths, FJ, Staubach, C, Tackmann, K. Statistics and sample design in epidemiological studies of Echinococcus mulitlocularis in fox populations. Acta Tropica 2003; 85: 183–189.Google Scholar

16. Nygard, K, et al. An outbreak of legionnaires disease caused by long-distance spread from an industrial air scrubber in Sarpsborg, Norway. Clinical Infectious Dieases 2008; 46: 61–69.CrossRef Google Scholar PubMed

17. Bhopal, RS, Diggle, P, Rowlingson, B. Pinpointing clusters of apparently sporadic cases of Legionnaires' disease. British Medical Journal 1992; 304: 1022–1027.Google Scholar

18. Bhopal, RS, et al. Proximity of the home to a cooling tower and risk of non-outbreak Legionnaires' disease. British Medical Journal 1991; 302: 378–383.CrossRef Google Scholar PubMed

19. Dunn, CE, et al. Advancing insights into methods for studying environment-health relationships: a multidisciplinary approach to understanding Legionnaires' disease. Health & Place 2006; 13: 677–690.CrossRef Google Scholar PubMed

20. Che, D, et al. Sporadic cases of community acquired legionnaires' disease: an ecological study to identify new sources of contamination. Journal of Epidemiology and Community Health 2003; 57: 466–469.Google Scholar

21. Den Boer, JW, Yzerman, EP. Diagnosis of Legionella infection in Legionnaires' disease. European Journal of Clinical Microbiology and Infectious Diseases 2004; 23: 871–878.Google Scholar

22. Rudbeck, MG, Hansen, HL. Legionella infections in Randers [in Danish]. Ugeskrift for Laeger 2004; 166: 159–162.Google Scholar PubMed

23. Rudbeck, M, Mølbak, K, Uldum, S. High prevalence to Legionella spp. in Danish blood donors. A study in areas with high and average incidence of Legionnaires' disease. Epidemiology and Infection 2008; 136: 257–262.CrossRef Google Scholar PubMed

24. Rudbeck, M, Mølbak, K, Uldum, SA. Dynamics of Legionella antibody levels during 1 year in a healthy population. Published online: 23 December 2008. Epidemiology and Infection. doi:10.1017/S0950268808001684Google Scholar

25. Elliott, P, Wartenberg, D. Spatial epidemiology: current approaches and future challenges. Environmental Health Perspectives 2004; 112: 998–1006.Google Scholar

Table 1. Number of sporadic and confirmed cases of Legionnaires' disease (LD) in Denmark, 1980–2005

Rudbeck supplementary material

Figure.jpg

Image 3.5 MB

Article contents

Geographical variation of sporadic Legionnaires' disease analysed in a grid model

Summary

Keywords

Information

INTRODUCTION

METHODS

Study population

Application of the grid model

RESULTS

Geographical variation

Testing of the grid model

DISCUSSION

ACKNOWLEDGEMENTS

NOTE

DECLARATION OF INTEREST

References

REFERENCES

Rudbeck supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests