Hostname: page-component-5db58dd55d-xnzfm Total loading time: 0 Render date: 2026-06-01T23:25:49.483Z Has data issue: false hasContentIssue false

Evaluating Google Trends as a proxy for symptom incidence: insights from the winter COVID-19 infection study in England 2023/24

Published online by Cambridge University Press:  28 November 2025

Phoebe Asplin
Affiliation:
UK Health Security Agency, UK University of Warwick, UK
Martyn Fyles
Affiliation:
UK Health Security Agency, UK
Jack Kennedy
Affiliation:
UK Health Security Agency, UK
Thomas Ward
Affiliation:
UK Health Security Agency, UK
Jonathon Mellor*
Affiliation:
UK Health Security Agency, UK
*
Corresponding author: Jonathon Mellor; Email: jonathon.mellor@ukhsa.gov.uk
Rights & Permissions [Opens in a new window]

Abstract

Google Trends is used in research and surveillance as a proxy for community infection incidence. Signals are difficult to validate, as most surveillance biases towards severe outcomes and certain demographics.

Using Winter COVID-19 Infection Study (WCIS) data in England, symptom prevalence is estimated via generalized additive model with multilevel-regression and poststratification. Symptom duration was estimated using interval censored time delay modelling, converting prevalence to incidence. Google Trends and WCIS incidence and growth rates were compared using cross-correlation.

Google Trends and WCIS agreement varied by symptom and age group. The national maximum growth rate cross-correlation for sore throat was 0.81, with 90% prediction intervals of [0.69, 0.90]. Google Trends growth rates generally lagged the WCIS growth rates across symptoms (cough: −5.0 days [−8.0, 0.0], fever: −3.0 days [−6.0, 1.0], loss of smell: −9.0 days [−13, −3.0], shortness of breath: −12 days [−16, −5.0], and sore throat: −4.0 days [−5.0, −2.0]).

This work shows Google Trends and community symptom incidence can align, although substantial variation between symptoms and age groups exists, underscoring utility in predicting other surveillance indicators.

Information

Type
Original Paper
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2025. Published by Cambridge University Press
Figure 0

Table 1. Mean and 90% credible interval of the symptom duration estimate, in days, stratified by age group

Figure 1

Figure 1. Daily symptom incidence over time for Google Trends and WCIS symptoms. Median estimates and 90% prediction intervals are shown. WCIS estimates are stratified by age group and across all ages (national) and presented as a per capita rate per 1,000 population. Google Trends is a relative search volume.

Figure 2

Figure 2. Daily symptom incidence growth rate over time for Google Trends and WCIS symptoms. Median estimates and 90% prediction intervals are shown. WCIS estimates are stratified by age group and across all ages (national). Both Google and WCIS growth rate estimates are presented as a daily percentage change.

Figure 3

Figure 3. Cross-correlation at lags −17 to 17 between the Google Trends symptom growth rate and WCIS growth rates by age group stratification, with median estimate and 90% prediction intervals.

Figure 4

Table 2. Median and 90% prediction interval of the lag, in days, with the maximum cross-correlation between the growth rate of Google Trends and WCIS age stratifications

Figure 5

Table 3. Median and 90% prediction interval of the maximum cross-correlation between the growth rate of Google Trends and WCIS age stratifications

Supplementary material: File

Asplin et al. supplementary material

Asplin et al. supplementary material
Download Asplin et al. supplementary material(File)
File 2.9 MB