Hostname: page-component-6766d58669-kn6lq Total loading time: 0 Render date: 2026-05-19T22:42:51.146Z Has data issue: false hasContentIssue false

Hyak mortality monitoring system: innovative sampling and estimation methods – proof of concept by simulation

Published online by Cambridge University Press:  05 February 2018

S. J. Clark*
Affiliation:
Department of Sociology, The Ohio State University, Columbus, Ohio, USA MRC/Wits Rural Public Health and Health Transitions Research Unit (Agincourt), Faculty of Health Sciences, University of the Witwatersrand, School of Public Health, Johannesburg, South Africa INDEPTH Network, Accra, Ghana ALPHA Network, London, UK
J. Wakefield
Affiliation:
Department of Statistics, University of Washington Seattle, Washington, USA Department of Biostatistics, University of Washington, Seattle, Washington, USA
T. McCormick
Affiliation:
Department of Statistics, University of Washington Seattle, Washington, USA Department of Sociology, University of Washington, Seattle, Washington, USA
M. Ross
Affiliation:
Department of Biostatistics and Epidemiology, University of Pennsylvania, Philadelphia, Pennsylvania, USA
*
*Address for correspondence: S. J. Clark, Department of Sociology, The Ohio State University, Ohio, USA. (Email: work@samclark.net)
Rights & Permissions [Opens in a new window]

Abstract

Traditionally health statistics are derived from civil and/or vital registration. Civil registration in low- to middle-income countries varies from partial coverage to essentially nothing at all. Consequently the state of the art for public health information in low- to middle-income countries is efforts to combine or triangulate data from different sources to produce a more complete picture across both time and space – data amalgamation. Data sources amenable to this approach include sample surveys, sample registration systems, health and demographic surveillance systems, administrative records, census records, health facility records and others. We propose a new statistical framework for gathering health and population data – Hyak – that leverages the benefits of sampling and longitudinal, prospective surveillance to create a cheap, accurate, sustainable monitoring platform. Hyak has three fundamental components:

  • Data amalgamation: A sampling and surveillance component that organizes two or more data collection systems to work together: (1) data from HDSS with frequent, intense, linked, prospective follow-up and (2) data from sample surveys conducted in large areas surrounding the Health and Demographic Surveillance System (HDSS) sites using informed sampling so as to capture as many events as possible;

  • Cause of death: Verbal autopsy to characterize the distribution of deaths by cause at the population level; and

  • Socioeconomic status (SES): Measurement of SES in order to characterize poverty and wealth.

We conduct a simulation study of the informed sampling component of Hyak based on the Agincourt HDSS site in South Africa. Compared with traditional cluster sampling, Hyak's informed sampling captures more deaths, and when combined with an estimation model that includes spatial smoothing, produces estimates of both mortality counts and mortality rates that have lower variance and small bias.

Information

Type
Statistical Methods
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited
Copyright
Copyright © The Author(s) 2018
Figure 0

Fig. 1. The 20 villages of the Agincourt region with Voronoi tesselations defining neighborhood structure. Gray lines indicate neighboring villages.

Figure 1

Fig. 2. The simulated spatial random effects for the Agincourt region.

Figure 2

Fig. 3. The predicted probabilities of dying for the Agincourt region: (a) young girls, (b) young boys, (c) older girls, (d) older boys.

Figure 3

Table 1. Deaths, bias, variance, MSE for cluster sampling, stratified sampling, Hyak, and optimum sampling for n = 5200

Figure 4

Fig. 4. The distributions of the estimated probability of dying from models I, III, and IV under the Hyak sampling strategy for n = 5200.

Figure 5

Fig. 5. The average village- and strata-specific estimates for the (unobserved) population counts of death plotted against the true values across each of the four models under the Hyak sampling scheme for n = 5200. Plotting symbols indicate village numbers, and colors indicate model number with key in the upper-left plot. The spatial model IV (purple) symbols are in general closest to the y = x line of equality.

Supplementary material: PDF

Clark et al. supplementary material

Appendix

Download Clark et al. supplementary material(PDF)
PDF 370 KB