Hostname: page-component-89b8bd64d-b5k59 Total loading time: 0 Render date: 2026-05-10T08:48:00.046Z Has data issue: false hasContentIssue false

Uncertainty quantification in breast cancer risk prediction models using self-reported family health history

Published online by Cambridge University Press:  20 January 2017

Lance T. Pflieger
Affiliation:
Department of Biomedical Informatics, University of Utah, Salt Lake City, UT, USA
Clinton C. Mason
Affiliation:
Department of Pediatrics, University of Utah, Salt Lake City, UT, USA
Julio C. Facelli*
Affiliation:
Department of Biomedical Informatics, University of Utah, Salt Lake City, UT, USA
*
*Address for correspondence: J. C. Facelli, Ph.D., Department of Biomedical Informatics, University of Utah, 421 Wakara Way, Suite 140, Salt Lake City, UT 84112, USA. (Email: julio.facelli@utah.edu)
Rights & Permissions [Opens in a new window]

Abstract

Introduction. Family health history (FHx) is an important factor in breast and ovarian cancer risk assessment. As such, multiple risk prediction models rely strongly on FHx data when identifying a patient’s risk. These models were developed using verified information and when translated into a clinical setting assume that a patient’s FHx is accurate and complete. However, FHx information collected in a typical clinical setting is known to be imprecise and it is not well understood how this uncertainty may affect predictions in clinical settings. Methods. Using Monte Carlo simulations and existing measurements of uncertainty of self-reported FHx, we show how uncertainty in FHx information can alter risk classification when used in typical clinical settings. Results. We found that various models ranged from 52% to 64% for correct tier-level classification of pedigrees under a set of contrived uncertain conditions, but that significant misclassification are not negligible. Conclusions. Our work implies that (i) uncertainty quantification needs to be considered when transferring tools from a controlled research environment to a more uncertain environment (i.e, a health clinic) and (ii) better FHx collection methods are needed to reduce uncertainty in breast cancer risk prediction in clinical settings.

Information

Type
Translational Research, Design and Analysis
Creative Commons
Creative Common License - CCCreative Common License - BYCreative Common License - NCCreative Common License - ND
This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits noncommercial re-use, distribution, and reproduction in any medium, provided the original work is unaltered and is properly cited. The written permission of Cambridge University Press must be obtained for commercial re-use or in order to create a derivative work.
Copyright
© The Association for Clinical and Translational Science 2017
Figure 0

Fig. 1 Sensitivity in lifetime risk estimates of various models to uncertainty in an example pedigree. A hypothetical situation where a proband is assessing her lifetime risk for breast cancer (BR) based on her family knowledge. The proband is marked by the triangle and lifetime risk (risk of developing cancer from age 20 to age 80) is assessed by each model. The proband has a mother with BR with age at onset of 53 years and is uncertain of the cancer status of the 60-year-old aunt. The table on the right shows how each tool evaluates the proband’s lifetime risk under the scenario of the 60-year-old maternal aunt being unaffected for BR or ovarian cancer (OV) in the first column, followed by proband’s lifetime risk for BR under the perturbation that the aunt had affected status for BR or OV at various onset ages. IBIS, International Breast Intervention Study; BOADICEA, Breast and Ovarian Analysis of Disease Incidence and Carrier Estimation Algorithm.

Figure 1

Table 1 American Cancer Society risk classification strata for breast screening

Figure 2

Fig. 2 Distribution of lifetime breast cancer risk for initial simulated pedigrees and risk estimates following uncertainty perturbation calculated by the Breast and Ovarian Analysis of Disease Incidence and Carrier Estimation Algorithm (BOADICEA) risk prediction model. Distribution of lifetime breast cancer risk for initial simulated pedigrees utilized in assessing the BOADICEA model. Initial risk categories are noted by color (green=low; yellow=moderate; and red=high). Each risk category contains 50 pedigrees.

Figure 3

Table 2 Average number of affected individuals in the simulated pedigrees by model and degree of relatedness/cancer type

Figure 4

Fig. 3 Effect of uncertainty on initial versus final risk classification for each model. Bars show percentage of pedigree classification from the initial risk strata to the final risk strata (L, low; M, moderate; H, high) determined after adding uncertainty to the initial pedigree according to the average case of uncertainty scenario. Colored bars represent no change in classification, gray indicates a change in classification. Upper-bound and lower-bound bars show best-case and worst-case of uncertainty scenarios. IBIS, International Breast Intervention Study; BOADICEA, Breast and Ovarian Analysis of Disease Incidence and Carrier Estimation Algorithm.

Figure 5

Table 3 Frequency of pedigree replicas that remained in the same risk-tier classification following perturbation with uncertainty

Supplementary material: File

Pflieger supplementary material

Tables S1-S2 and Figure S1

Download Pflieger supplementary material(File)
File 486.2 KB