Hostname: page-component-77f85d65b8-pztms Total loading time: 0 Render date: 2026-04-20T05:08:51.574Z Has data issue: false hasContentIssue false

From theory to practice – assessing translation of physical fitness research in the emergency department through machine learning and natural language processing

Published online by Cambridge University Press:  21 May 2025

Kristin Morrow
Affiliation:
School of Engineering, University of Virginia, VA, USA
Debajyoti Datta
Affiliation:
School of Engineering, University of Virginia, VA, USA
Lindsey Spiegelman
Affiliation:
Department of Emergency Medicine, University of California, Irvine, CA, USA
Roy Almog
Affiliation:
Department of Emergency Medicine, University of California, Irvine, CA, USA
Kai Zheng
Affiliation:
Department of Informatics, University of California, Irvine, CA, USA
Don Brown
Affiliation:
School of Data Science, University of Virginia, VA, USA
Dan Michael Cooper*
Affiliation:
UCI Institute for Precision Health, University of California, Irvine, CA, USA
*
Corresponding author: Dan M. Cooper; Email: dcooper@hs.uci.edu
Rights & Permissions [Opens in a new window]

Abstract

Background:

A critical challenge for biomedical investigators is the delay between research and its adoption, yet there are few tools that use bibliometrics and artificial intelligence to address this translational gap. We built a tool to quantify translation of clinical investigation using novel approaches to identify themes in published clinical trials from PubMed and their appearance in the natural language elements of the electronic health record (EHR).

Methods:

As a use case, we selected the translation of known health effects of exercise for heart disease, as found in published clinical trials, with the appearance of these themes in the EHR of heart disease patients seen in an emergency department (ED). We present a self-supervised framework that quantifies semantic similarity of themes within the EHR.

Results:

We found that 12.7% of the clinical trial abstracts dataset recommended aerobic exercise or strength training. Of the ED treatment plans, 19.2% related to heart disease. Of these, the treatment plans that included heart disease identified aerobic exercise or strength training only 0.34% of the time. Treatment plans from the overall ED dataset mentioned aerobic exercise or strength training less than 5% of the time.

Conclusions:

Having access to publicly available clinical research and associated EHR data, including clinician notes and after-visit summaries, provided a unique opportunity to assess the adoption of clinical research in medical practice. This approach can be used for a variety of clinical conditions, and if assessed over time could measure implementation effectiveness of quality improvement strategies and clinical guidelines.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press on behalf of Association for Clinical and Translational Science
Figure 0

Figure 1. Data pipeline. Legend: Clinical trials (CT), Gaussian mixture model (GMM).

Figure 1

Figure 2. Top 15 Topics in PubMed clinical trial data: the Y axis shows the words that make up each topic, while the X axis represents how prevalent that topic is to the overall corpus.

Figure 2

Table 1. Heart disease PubMed recommendation table: counts for each recommended intervention

Figure 3

Table 2. Heart disease ICD-10 codes found in the EHR data

Figure 4

Table 3. Similarity search on overall EHR dataset

Figure 5

Table 4. Manual search on heart disease EHR dataset