Hostname: page-component-5db58dd55d-f6s65 Total loading time: 0 Render date: 2026-06-02T03:33:30.346Z Has data issue: false hasContentIssue false

Assessing and improving research readiness in PCORnet®

Published online by Cambridge University Press:  17 December 2025

Keith Marsolo*
Affiliation:
Duke Clinical Research Institute, Duke University School of Medicine, Durham, NC, USA Department of Population Health Sciences, Duke University School of Medicine, Durham, NC, USA
Laura Goettinger Qualls
Affiliation:
Duke Clinical Research Institute, Duke University School of Medicine, Durham, NC, USA
Darcy Louzao
Affiliation:
Duke Clinical Research Institute, Duke University School of Medicine, Durham, NC, USA
Thomas A. Phillips
Affiliation:
Duke Clinical Research Institute, Duke University School of Medicine, Durham, NC, USA
Adrian F. Hernandez
Affiliation:
Duke Clinical Research Institute, Duke University School of Medicine, Durham, NC, USA Department of Medicine, Duke University School of Medicine, Durham, NC, USA
Lesley Curtis
Affiliation:
Duke Clinical Research Institute, Duke University School of Medicine, Durham, NC, USA Department of Population Health Sciences, Duke University School of Medicine, Durham, NC, USA
*
Corresponding author: K. Marsolo; Email: keith.marsolo@duke.edu
Rights & Permissions [Opens in a new window]

Abstract

Purpose:

We describe the steps taken to assess and improve the research readiness of data within PCORnet®, specifically focusing on the results of the PCORnet data curation process between Cycle 7 (October 2019) and Cycle 16 (October 2024).

Material and methods:

We describe the process for extending the PCORnet® CDM and for creating data checks.

Results:

We highlight growth in the number of records available across PCORnet between data curation Cycles 7 and 16 (e.g., diagnoses increasing from ∼3.7B to ∼6.9B and laboratory results from ∼7.7B to ∼15.1B among legacy DataMarts), present the current list of data checks and describe performance of the network. We highlight examples of data checks with relatively stable performance (e.g., future dates), those where performance has improved (e.g., RxNorm mapping), and others performance is more variable (e.g., persistence of records).

Conclusion:

Studies are a crucial source of information on the design of new data checks. The attention of PCORnet partners is focused primarily on those metrics that are generally modifiable. A transparent data curation process is an essential component of PCORnet, allowing network partners to learn from one another, while also informing the decisions of study investigators on which sites to include in their projects. The quality issues that exist within PCORnet stem from the way that data are captured within healthcare generally. We have been able to make to make great strides on improving data quality and research readiness. Many of the techniques piloted within PCORnet will be broadly applicable to other efforts.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Association for Clinical and Translational Science
Figure 0

Table 1. Release date of each version of the PCORnet® CDM and the tables (data domains) that were added during each release. Minor releases (e.g., X.1) involve changes to existing tables and do not include the addition of new domains

Figure 1

Table 2. Growth of records across the network over time. Results are shown for the PCORnet legacy DataMarts (EHR-based DataMarts that participated during cycles 7 through 16) and for all EHR-based DataMarts that participated in cycle 16 refresh 2

Figure 2

Figure 1. Growth in foundational data quality checks over time. Each bar represents the start of a new data curation cycle. Listed are the dates the cycle started, corresponding version of the PCORnet® CDM and number of data checks and measures. Data checks are broad rules such as “Values must conform to CDM specifications.” Measures are the number of PCORnet® CDM tables and/or fields affected by the checks.

Figure 3

Table 3. Overview of data checks by cycle and network performance (overall and compared to prior refreshes). To improve readability, we report the results of alternating cycles between cycles 7 and 15

Figure 4

Table 4. Evolution of PCORnet data checks over time. Shown are the data checks that were active as of data curation cycle 16, the type of check (investigative or required), and % of legacy DataMarts (n = 51) with exceptions. One data partner was not approved for the second refresh of cycle 7, and denominators for some data checks are lower if some partners do not have the applicable data

Figure 5

Figure 2. Improvement in data check performance over time. The spike in DC2.08 is due to volumes being affected by the COVID-19 pandemic. The expectation is that all EHR-based DataMarts should be able to eventually pass these checks.

Figure 6

Figure 3. Data checks with relatively stable performance, where the DataMarts with exceptions tend to also be stable. Each line presents the percentage of DataMarts with an exception during a given data curation cycle.

Figure 7

Figure 4. Example of DC4.01, a data check where network performance is stable, but the DataMarts with exceptions tends to change over time. Each row represents a DataMart, and each column a refresh. If a DataMart had an exception for the check, the cell is shaded orange. No exception is shaded green. A gray cell indicates that a DataMart did not submit new data for that refresh or was not approved.

Supplementary material: File

Marsolo et al. supplementary material 1

Marsolo et al. supplementary material
Download Marsolo et al. supplementary material 1(File)
File 930.3 KB
Supplementary material: File

Marsolo et al. supplementary material 2

Marsolo et al. supplementary material
Download Marsolo et al. supplementary material 2(File)
File 34.8 KB