Hostname: page-component-77f85d65b8-5ngxj Total loading time: 0 Render date: 2026-04-19T18:09:20.111Z Has data issue: false hasContentIssue false

Mining news media for understanding public health concerns

Published online by Cambridge University Press:  23 October 2019

Maryam Zolnoori
Affiliation:
Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
Ming Huang
Affiliation:
Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
Christi A. Patten
Affiliation:
Center for Clinical and Translational Science, Community Engagement Program, Mayo Clinic, Rochester, MN, USA Department of Psychiatry and Psychology, Mayo Clinic, Rochester, MN, USA
Joyce E. Balls-Berry
Affiliation:
Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA Mayo Clinic College of Medicine and Science, Rochester, MN, USA
Somaieh Goudarzvand
Affiliation:
School of Computing and Engineering, University of Missouri-Kansas, Kansas City, MO, USA
Tabetha A. Brockman
Affiliation:
Center for Clinical and Translational Science, Community Engagement Program, Mayo Clinic, Rochester, MN, USA Department of Psychiatry and Psychology, Mayo Clinic, Rochester, MN, USA
Elham Sagheb
Affiliation:
Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
Lixia Yao*
Affiliation:
Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA
*
Address for correspondence: L. Yao, PhD, Department of Health Sciences Research, Mayo Clinic, 200 First Street SW, Rochester, MN 55905, USA. Email: lixia.cn.yao@gmail.com
Rights & Permissions [Opens in a new window]

Abstract

Introduction:

News media play an important role in raising public awareness, framing public opinions, affecting policy formulation, and acknowledgment of public health issues. Traditional qualitative content analysis for news sentiments and focuses are time-consuming and may not efficiently convey sentiments nor the focuses of news media.

Methods:

We used descriptive statistics and state-of-art text mining to conduct sentiment analysis and topic modeling, to efficiently analyze over 3 million Reuters news articles during 2007–2017 for identifying their coverage, sentiments, and focuses for public health issues. Based on the top keywords from public health scientific journals, we identified 10 major public health issues (i.e., “air pollution,” “alcohol drinking,” “asthma,” “depression,” “diet,” “exercise,” “obesity,” “pregnancy,” “sexual behavior,” and “smoking”).

Results:

The news coverage for seven public health issues, “Smoking,” “Exercise,” “Alcohol drinking,” “Diet,” “Obesity,” “Depression,” and “Asthma” decreased over time. The news coverage for “Sexual behavior,” “Pregnancy,” and “Air pollution” fluctuated during 2007–2017. The sentiments of the news articles for three of the public health issues, “exercise,” “alcohol drinking,” and “diet” were predominately positive and associated such as “energy.” Sentiments for the remaining seven public health issues were mainly negative, linked to negative terms, e.g., diseases. The results of topic modeling reflected the media’s focus on public health issues.

Conclusions:

Text mining methods may address the limitations of traditional qualitative approaches. Using big data to understand public health needs is a novel approach that could help clinical and translational science awards programs focus on community-engaged research efforts to address community priorities.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright
© The Association for Clinical and Translational Science 2019
Figure 0

Fig. 1. A schematic view of methods for mining Reuters news. MeSH, Medical Subject Heading; UMLS, Unified Medical Language System.

Figure 1

Table 1. Frequencies of MeSH terms related to public health

Figure 2

Fig. 2. Normalized numbers of articles and Google Trends searches for the 10 public health issues over time. The numbers are normalized to the highest point on each subfigure. A value of 100 represents the peak popularity for the public health issue.

Figure 3

Fig. 3. Counts of news articles with positive, neutral, and negative sentiments toward 10 public health issues.

Figure 4

Fig. 4. Sentiment scores of news media toward 10 public health issues over 11 years (2007–2017).

Figure 5

Fig. 5. Word clouds of five meaningful topics identified in news articles related to the public health issues, “smoking” and “alcohol drinking.”

Supplementary material: File

Zolnoori et al. Supplementary Materials

Zolnoori et al. Supplementary Materials 1

Download Zolnoori et al. Supplementary Materials(File)
File 52.3 KB
Supplementary material: File

Zolnoori et al. Supplementary Materials

Zolnoori et al. Supplementary Materials 2

Download Zolnoori et al. Supplementary Materials(File)
File 33.4 KB
Supplementary material: File

Zolnoori et al. Supplementary Materials

Zolnoori et al. Supplementary Materials 3

Download Zolnoori et al. Supplementary Materials(File)
File 11.4 KB
Supplementary material: File

Zolnoori et al. Supplementary Materials

Zolnoori et al. Supplementary Materials 4

Download Zolnoori et al. Supplementary Materials(File)
File 12.4 KB