Hostname: page-component-89b8bd64d-7zcd7 Total loading time: 0 Render date: 2026-05-05T18:19:40.124Z Has data issue: false hasContentIssue false

The impact of biomedical innovation on US mortality, 1999–2019: evidence partly based on 286 million descriptors of 27 million PubMed articles

Published online by Cambridge University Press:  23 April 2025

Frank R. Lichtenberg*
Affiliation:
Columbia University, Graduate School of Business, New York, NY, USA, National Bureau of Economic Research, and CESifo
Kriste Krstovski
Affiliation:
Graduate School of Business, Columbia University, New York, NY, USA
*
Corresponding author: Frank R. Lichtenberg; Email: frl1@columbia.edu

Abstract

We investigate whether the diseases for which there was more biomedical innovation had larger 1999–2019 reductions in premature mortality. Biomedical innovation related to a disease is measured by the change in the mean vintage of descriptors of PubMed articles about the disease. We analyze data on 286 million descriptors of 27 million articles about over 800 diseases. Premature mortality from a disease is significantly inversely related to the lagged vintage of descriptors of articles about the disease. In the absence of biomedical innovation, age-adjusted mortality rates would not have declined. Some factors other than biomedical innovation (e.g., a decline in smoking and an increase in educational attainment) contributed to the decline in mortality. But other factors (e.g., a rise in obesity and the prevalence of chronic conditions) contributed to an increase in mortality. Biomedical innovation reduced the mortality of white people sooner than it reduced the mortality of black people.

Information

Type
Research Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press in association with Université catholique de Louvain
Figure 0

Figure 1. Fraction of 2023 MeSH descriptors first appearing in PubMed by year (1940, 1950, …, 2020): unweighted, and weighted by frequency in post-2015 PubMed articles.

Figure 1

Table 1. Top 25 (out of 1028) 3-digit ICD10 diseases, ranked by number of PubMed articles

Figure 2

Table 2. Estimates of ρk from equation (2), based on data on about 600 drugs: n_rxst = ρk n_descriptorss,tk + αs + δt + εst

Figure 3

Figure 2. Estimates of ρk from equation (2), based on data on about 600 drugs: n_rxst = ρk n_descriptorss,t−k + αs + δt + ɛst. Each estimate is from a separate regression. Disturbances were clustered within drugs. n_rxst = the estimated number (in millions) of US outpatient prescriptions for drug (chemical substance) s in year t (t = 1996, 1997, …, 2021). n_descriptorss,t−k = the number of times the descriptor of drug s occurred in PubMed in year t − k (k = 0, 1, 2, …, 20).

Figure 4

Table 3. Estimates of γj from equation (3): ln(aa_mort_ratedt) = γj ln(aa_inc_rated,t−j) + αd + δt + ɛdt

Figure 5

Table 4. Estimates of the effect of descriptor vintage on age-adjusted cancer mortality rate, not controlling and controlling for incidence

Figure 6

Table 5. Years of potential life lost before ages 85, 75, and 65 in 1999 and 2019

Figure 7

Table 6. Estimates of βk from equation (1): ln(mortalitydt) = βk vintage_measured,tk + αd + δt + ɛdt

Figure 8

Figure 3. Estimates of βk from equation (1): ln(mortalitydt) = βk vintage_measured,tk + αd + δt + εdt. (a) vint_mean ==> YPLL85. (b) post1990% ==> YPLL85. (c) vint_mean ==> YPLL75. (d) post1990% ==> YPLL75. (e) vint_mean ==> YPLL65. (f) post1990% ==> YPLL65. Each estimate is from a separate regression. Disturbances were clustered within diseases. Solid squares denote statistically significant estimates; the large solid squares denote the most significant estimates, and hollow squares denote statistically insignificant estimates.

Figure 9

Table 7. Estimates of βk from equation (1): ln(mortalitydt) = βk vintage_measured,tk + αd + δt + ɛdt, by race

Figure 10

Figure 4. Estimates of βk from equation (1): ln(mortalitydt) = βk vintage_measured,tk + αd + δt + ɛdt, by race.

Figure 11

Figure 5. Estimated 1999–2019 changes in age-adjusted mortality rates from all diseases in the presence and absence of biomedical innovation. (a) vint_meand,t−6 ==> ypll85dt. (b) post1990%d,t−6 ==> ypll85dt. (c) vint_meand,t−12 ==> ypll75. (d) post1990%d,t−4 ==> ypll75dt. (e) vint_meand,t−12 ==> ypll65dt. (f) post1990%d,t−14 ==> ypll65dt.

Figure 12

Figure 6. Estimated 1975–2019 change in age-adjusted cancer mortality rate in the presence and absence of changes in cancer incidence and biomedical innovation.

Supplementary material: File

Lichtenberg and Krstovski supplementary material

Lichtenberg and Krstovski supplementary material
Download Lichtenberg and Krstovski supplementary material(File)
File 767 KB