Introduction
Grief, as a multifaceted biopsychosocial response to bereavement, manifests as a dynamic process that includes emotional distress, cognitive restructuring, and functional adaptation. Existing studies confirm that 80%–90% of bereaved individuals gradually rebuild psychological adaptation through natural regulatory mechanisms, with emotional intensity and duration of emotion aligning with culturally accepted mourning practices (Lundorff, Bonanno, Johannsen, & O’Connor, Reference Lundorff, Bonanno, Johannsen and O’Connor2020; Nielsen, Carlsen, Neergaard, Bidstrup, & Guldin, Reference Nielsen, Carlsen, Neergaard, Bidstrup and Guldin2019). However, approximately 5%–10% of individuals develop pathological grief reactions, recently officially recognized as prolonged grief disorder (PGD) in the ICD-11 and DSM-5-TR, replacing non-standardized terms such as ‘complicated grief’ or ‘traumatic grief’ (Lundorff et al., Reference Lundorff, Bonanno, Johannsen and O’Connor2020; Lundorff, Holmgren, Zachariae, Farver-Vestergaard, & O’Connor, Reference Lundorff, Holmgren, Zachariae, Farver-Vestergaard and O’Connor2017; Nielsen et al., Reference Nielsen, Carlsen, Neergaard, Bidstrup and Guldin2019). Core diagnostic features include persistent (≥6–12 months) and intense grief reactions characterized by: (1) pathological yearning (≥1 symptom, e.g. persistent seeking of reminders of the deceased), (2) separation distress (≥1 symptom, e.g. identity confusion), and (3) functional impairment in social, occupational, or other domains, accompanied by additional comorbid symptoms (≥2 symptoms, e.g. emotional numbness or avoidance) (Prigerson, Reference Prigerson, Shear and Reynolds2022). Notably, PGD is pathologically distinct from depressive disorders and post-traumatic stress disorder (PTSD), with its central theme revolving around loss, contrasting with the mood dysregulation in depression or traumatic re-experiencing typical of PTSD (Djelantik, Smid, Mroz, Kleber, & Boelen, Reference Djelantik, Smid, Mroz, Kleber and Boelen2020).
The global COVID-19 pandemic has precipitated an unprecedented public health crisis, resulting in millions of deaths and reshaping psychopathological trajectories of mental health through multifaceted trauma exposure mechanisms (Sokouti, Shafiee-Kandjani, Sokouti, & Sokouti, Reference Sokouti, Shafiee-Kandjani, Sokouti and Sokouti2023). Beyond its acute mortality, the COVID-19 pandemic has generated substantial and uneven years lived with disability across global, regional, and local settings, highlighting profound health inequalities in its longer-term consequences. Such large-scale and disproportionate health losses provide an important population-level context for understanding the psychological sequelae of pandemic-related bereavement (Shan, Jin, Li, Yang, et al., Reference Shan, Jin, Li, Yang, Zeng, Xie and Collaborators2025; Shan, Jin, Li, Zeng, et al., Reference Shan, Jin, Li, Zeng, Xie and Yang2025). Initially, mass abnormal death events (e.g. in-hospital isolation deaths) and restricted funeral rituals disrupted the reparative function of traditional mourning practices. Additionally, social isolation policies diminished the buffering effects of support systems. Furthermore, pandemic-specific stressors increased the risk of comorbidities between pathological grief and health anxiety (Eisma, Boelen, & Lenferink, Reference Eisma, Boelen and Lenferink2020; Vedder et al., Reference Vedder, Boerner, Stokes, Schut, Boelen and Stroebe2022). Multinational epidemiological studies report a median PGD prevalence of 16.8% (IQR 9.5–22.3) among COVID-19, which is higher than many pre-pandemic prevalence estimates (9.8%, 95% Cl 6.8–14.0) (Lundorff et al., Reference Lundorff, Holmgren, Zachariae, Farver-Vestergaard and O’Connor2017). However, the inconsistencies in reported prevalence and impact estimates during the COVID-19 pandemic underline the need for high-quality evidence synthesis to address these uncertainties.
This meta-analysis aims to comprehensively evaluate the prevalence of PGD in the context of the COVID-19 pandemic. Given the historical heterogeneity in diagnostic criteria before PGD’s formal recognition, this study will exclusively include research utilizing ICD-11 and DSM-5-TR standardized assessment tools to enhance diagnostic accuracy. Our objectives are to provide robust prevalence estimates, identify potential moderators of risk, and inform evidence-based mental health support strategies during global public health crises.
Methods
Search strategy and inclusion criteria
This systematic review and meta-analysis strictly followed PRISMA guidelines and was registered on PROSPERO (CRD42023463720) (Moher, Liberati, Tetzlaff, Altman, & Group, Reference Moher, Liberati, Tetzlaff, Altman and Group2009; Page et al., Reference Page, McKenzie, Bossuyt, Boutron, Hoffmann, Mulrow and Moher2021). A comprehensive search was conducted in PubMed, EMBASE, and the Cochrane Library from database inception to 21/10/2024, employing a language bias mitigation strategy that allowed inclusion of non-English publications after professional translation. Data were first extracted on 24/10/2024. The search strategy used MeSH terms (COVID-19 and Prolonged Grief Disorder) combined with Boolean operators (as detailed in the Supplementary Information). Inclusion criteria required studies to: (1) involve adults (≥18 years) bereaved of spouses, blood relatives, or close social relationships during the COVID-19 pandemic; (2) assess PGD using validated tools such as the Revised Prolonged Grief Disorder Scale (PG-13-R, cut-off >30), Inventory of Complicated Grief (ICG, cut-off >25), or Brief Grief Questionnaire (BGQ, cut-off >4), aligned with diagnostic duration thresholds in ICD-11 (≥6 months) or DSM-5-TR (≥12 months); and (3) include cohort studies or cross-sectional designs. Exclusion criteria comprised studies mixing non-COVID-19-related bereavement cohorts, samples with comorbid severe mental disorders (e.g. schizophrenia spectrum disorders [ICD-10 F20-F29], major depressive disorders [F32-F33]), structured psychological interventions (e.g. CBT, EMDR), non-original research (e.g. case reports, conference abstracts), or gray literature lacking full- access. Studies reporting additional major stressors or trauma exposures were excluded to minimize confounding.
Two independent researchers (LS and QL) screened titles and abstracts, followed by full-text review. Data extraction covered study characteristics (title, first author, year), design, population demographics, PGD assessment tools and thresholds, prevalence estimates (with 95% confidence intervals), and intervention details (for interventional studies). Discrepancies were resolved through third-party arbitration (LX).
Data analysis
Single-arm meta-analyses were conducted to synthesize PGD prevalence rates, reported as pooled percentages with 95% confidence intervals. To stabilize variances and accommodate proportions close to 0 or 1, raw prevalence rates were transformed using the Freeman–Tukey double-arcsine transformation before pooling (Barendregt, Doi, Lee, Norman, & Vos, Reference Barendregt, Doi, Lee, Norman and Vos2013). The DerSimonian–Laird random-effects model and inverse-variance weighting were applied to account for heterogeneity (Higgins, Thompson, Deeks, & Altman, Reference Higgins, Thompson, Deeks and Altman2003). Heterogeneity was quantified using the I 2 statistic (with thresholds: ≤25% = low, 25–75% = moderate, >75% = high), interpreted alongside prediction intervals and clinical context due to limitations in single-rate analyses. Publication bias was assessed via visual inspection of funnel plot symmetry and Egger’s regression, and adjusted using trim-and-fill analysis (Egger, Davey Smith, Schneider, & Minder, Reference Egger, Davey Smith, Schneider and Minder1997). Results were visualized through forest plots, where box sizes reflected study weights and diamond markers indicated pooled effect estimates. Subgroup analyses stratified findings by country, assessment tool, and study design to identify heterogeneity sources. Meta-regression explored associations between PGD prevalence and covariates. For studies evaluating digital interventions, Hedges’ g (adjusted for small-sample bias) was calculated as the standardized mean difference and analyzed via random-effects models. Missing means and standard deviations were estimated from medians and interquartile ranges, with results depicted in forest plots.
Bias and quality assessment
Risk of bias was independently evaluated by two reviewers (LS, QL) using the Newcastle-Ottawa Scale (NOS), assessing domains such as sample representativeness, measurement validity, and confounder control (high quality defined as NOS score ≥ 7) (Peters, Sutton, Jones, Abrams, & Rushton, Reference Peters, Sutton, Jones, Abrams and Rushton2006). Evidence certainty was graded via the GRADE framework, incorporating adjustments for methodological limitations. Disagreements during quality assessment were resolved by a third reviewer (LJM) (Guyatt et al., Reference Guyatt, Oxman, Akl, Kunz, Vist, Brozek and Schunemann2011; Stang, Reference Stang2010).
Results
Characteristics of included studies
A total of 13 studies from Iran, the US, Canada, Australia, Brazil, the UK, and China were included in this systematic review and meta-analysis. The age range of participants varied significantly, with a minimum mean age of 19.29 years and a maximum of 70.32 years, reflecting the distribution of PGD across age groups. Diagnostic criteria were based on DSM-5 (7 studies) and ICD-11 (6 studies). Gender distribution exhibited regional variations, with the male proportion ranging from 11.0% to 55.5% (Figure 1; Table 1).
Flow diagram of study selection. Definitions of exclusion criteria: (1) Wrong study design: Studies not using cross-sectional or cohort designs (e.g. case reports, reviews, experimental studies without observational data on PGD prevalence); (2) Wrong outcome: Studies not reporting PGD-related outcomes (e.g. only reporting general grief symptoms without PGD-specific assessments); (3) Wrong population: Studies not focusing on COVID-19-bereaved populations (e.g. studies on bereavement due to other causes, non-bereaved populations).

The characteristics of the included study (Aliyaki et al., Reference Aliyaki, Momeni, Dolatshahi, Hosseinzadeh, Yousefi and Abdoli2024; Bedford, Trotter, Potter, & Schmidt, Reference Bedford, Trotter, Potter and Schmidt2023; Downar et al., Reference Downar, Parsons, Cohen, Besserer, Adeli, Gratton and Lawlor2022; Harrop et al., Reference Harrop, Medeiros Mirra, Goss, Longo, Byrne, Farnell and Selman2023; Lapenskie et al., Reference Lapenskie, Anderson, Lawlor, Kabir, Noel, Heidinger and Downar2024; Lobb et al., Reference Lobb, Maccallum, Phillips, Agar, Hosie, Breen and Glasgow2024; Lucena et al., Reference Lucena, Alves, Batista, Agra, Lordao and Costa2024; Rodriguez-Villar et al., Reference Rodriguez-Villar, Okegbola, Arevalo-Serrano, Duval, Mathew, Rodriguez-Villar and Prigerson2024; Schneider et al., Reference Schneider, Rerick, Cummings, McLean, Breen and Singer2023; Shevlin et al., Reference Shevlin, Redican, Hyland, Murphy, Karatzias, McBride and Bentall2023a; Stahl et al., Reference Stahl, Kazan, Lazzari, Krafty, Reynolds, Rollman and Gebara2024; Tang & Xiang, Reference Tang and Xiang2021; Tang, Yu, Chen, Fan, & Eisma, Reference Tang, Yu, Chen, Fan and Eisma2021)

PGD prevalence results
Thirteen studies comprising 5,766 participants were included in the analysis of COVID-19-related PGD prevalence. The IPGDS defined diagnostic criteria as follows: score ≥ 4 (indicating ‘often’ or ‘always’) for strict criteria and ≥ 3 for lenient criteria. Two studies by Shevlin et al. (Reference Shevlin, Redican, Hyland, Murphy, Karatzias, McBride and Bentall2023a, Reference Shevlin, Redican, Murphy, Hyland and Karatzias2023b) were separately analyzed under these criteria. Under strict criteria, the random-effects model yielded a pooled proportion of 0.24 (95% CI: 0.13–0.36), with significant variability across studies (range: 0.02–0.49) and high heterogeneity ( I 2 = 99%). Lenient criteria produced a higher pooled proportion of 0.26 (95% CI: 0.18–0.35), with individual estimates ranging from 0.08 to 0.49 (Supplementary Information). Despite minor numerical differences, overlapping confidence intervals between criteria indicated no substantial impact of IPGDS threshold adjustments on overall estimates (Figure 2).
Pooled meta-analysis prevalence rates for COVID-19-related PGD in strict criteria.

Subgroup and meta-regression analyses
In country subgroups, Canada reported higher prevalence estimates than the UK (0.29 [0.23–0.36] vs. 0.16 [0.03–0.35]), while Chinese samples showed the highest prevalence (0.43, 95% CI: 0.33–0.54). A formal test for subgroup differences indicated evidence of regional variation (Q between χ2 = 71.87, df = 6, p < 0.01). Overall heterogeneity across studies was high (I 2 = 98.2%), underscoring substantial between-study variability. Limited sample sizes (n ≤ 2) in regions such as the United States and Australia warrant cautious interpretation. Prevalence estimates did not differ materially between cross-sectional (0.27 [0.16–0.40]) and cohort studies (0.20 [0.07–0.37]) (p = 0.51), suggesting minimal design-related variability (Table 2).
Pooled estimates of PGD prevalence during the COVID-19 pandemic and intervention efficacy

Assessment tools exhibited dose-dependent effects: studies using the TGI-SR reported the highest prevalence (0.44 [0.38–0.49]), significantly exceeding rates derived from PG-13 (0.22 [0.08–0.42]), ICG (0.26 [0.16–0.38]), and IPGDS (0.18 [0.05–0.36]) (p < 0.01). Diagnostic duration comparisons showed no significant difference between ICD-11’s 6-month criterion (0.26 [0.12–0.42]) and DSM-5-TR’s 12-month standard (0.18 [0.09–0.29]) (p = 0.19).
Meta-regression revealed no statistically significant associations of PGD prevalence with mean age (β = −0.0008, p = 0.81) or female proportion (β = 0.0007, p = 0.78). Sensitivity analyses confirmed the robustness of subgroup findings across strict and lenient diagnostic thresholds (Supplementary Information; Figures 3– 6).
Pooled meta-analysis proportion for COVID-19-related PGD in national subgroups under strict criteria.

Pooled meta-analysis proportion for COVID-19-related PGD in follow-up time subgroups under strict criteria.

Pooled meta-analysis proportion for COVID-19-related PGD in scale subgroups under strict criteria.

Pooled meta-analysis proportion for COVID-19-related PGD in study design subgroups under strict criteria.

Efficacy of digital PGD interventions
Digital interventions utilizing online psychotherapy and mobile applications were aimed at alleviating grief symptoms. The pooled Hedges’ g of −0.38 (95% CI: −0.90 to 0.14) under random-effects models did not reach statistical significance but suggested potential clinical utility. The secondary analysis of digital interventions for PGD, including detailed methods, risk of bias assessment, and results, is available in the Supplementary Material (Supplementary Information S3), as this analysis was not pre-specified in the original study objectives.
Risk of bias and GRADE ratings
The GRADE framework rated the evidence certainty as very low, downgraded due to high heterogeneity and limited generalizability. Using the NOS, study quality scores ranged from 6 to 9 (out of 9), with seven studies achieving a total score of nine (indicating high quality). These studies demonstrated robust methodology in participant selection, comparability, and outcome assessment (Supplementary Information).
Discussion
The COVID-19 pandemic has triggered a significant increase in mortality rates, forcing individuals to confront bereavement amidst concurrent pandemic-related stressors. Existing research has documented a surge in psychological challenges such as anxiety and depression during this period. Additionally, disruptions to normative grief processes, caused by restrictions on end-of-life interactions and limitations on funeral rituals, have amplified the risk of developing prolonged grief responses, which can severely impair daily functioning. This underscores the importance of investigating PGD among pandemic-bereaved populations. Although recent studies have explored PGD prevalence and risk factors in various contexts, no systematic review or meta-analysis has examined its epidemiological shifts during large-scale public health emergencies like COVID-19. This study represents the first comprehensive synthesis of PGD prevalence and its determinants across 13 studies involving 5,766 participants in this distinctive context.
As a result of the pandemic, bereavement experiences have been fundamentally altered due to increased mortality rates and fragmented social support systems. Our meta-analysis revealed a pooled PGD proportion of 24% (95% CI: 13–36%) based on strict diagnostic criteria, exceeding most prevalence ranges reported in pre-pandemic works. Using lenient criteria, the estimated prevalence rose slightly 26% (18%–35%). Using Lenferink, van den Munckhof, de Keijser, and Boelen’s (Reference Lenferink, van den Munckhof, de Keijser and Boelen2021) dual-threshold framework, we found a marginal difference of 2% between the ‘moderate’ and ‘strict’ IPGDS criteria as defined by ICD-11, suggesting potential insensitivity in the calibration of diagnostic tools (Eisma et al., Reference Eisma, Boelen and Lenferink2020; Lenferink et al., Reference Lenferink, van den Munckhof, de Keijser and Boelen2021). These findings highlight the impact of structural traumas, e.g. restricted access at the deathbed and disruptions to traditional mourning practices, which may prolong grief pathology beyond typical acute stress responses.
Notably, geographical subgroup analysis revealed an exceptionally high PGD prevalence of 43% in Chinese samples, sharply contrasting with rates of 29% in Canada and 16% in the United Kingdom. This divergence from pre-pandemic cross-cultural patterns may reflect the psychosocial impacts of China’s ‘dynamic zero-COVID’ policy, which intensified social isolation and grief-related loneliness through prolonged lockdown measures. Moreover, regional heterogeneity was statistically significant (p < 0.01), whereas study design (cross-sectional vs. cohort) showed no group differences (p = 0.51), implying universal risk factors among COVID-19-bereaved populations. Sensitivity analyses confirmed the robustness of subgroup results across diagnostic thresholds. However, data gaps in low-income regions, such as Africa and South Asia, and a high-income-country bias (78% of studies) limit global generalizability, necessitating multinational validation (Aliyaki et al., Reference Aliyaki, Momeni, Dolatshahi, Hosseinzadeh, Yousefi and Abdoli2024; Lucena et al., Reference Lucena, Alves, Batista, Agra, Lordao and Costa2024). Beyond these immediate epidemiological findings, the high burden of PGD observed may suggest potential long-term neurocognitive risks. Given that prolonged grief is a chronic stressor linked to cognitive decline precursors, and considering Shan et al.’s findings on COVID-19-related dementia risk, bereaved individuals might face a ‘dual hit’ of viral and psychosocial stressors. This synergy could imply that high PGD prevalence may accelerate cognitive decline, warranting consideration of PGD monitoring as a strategy to mitigate future neurocognitive disorders (Shan, Wang, Crawford, & Holland, Reference Shan, Wang, Crawford and Holland2024).
Methodological heterogeneity in PGD research, resulting from divergent diagnostic criteria, compromises the comparability of results. For instance, DSM-5-TR, which establishes a 12-month duration threshold, yielded a PGD prevalence of 52% (95% CI: 44%–60%), significantly lower than ICD-11’s 76% prevalence with a 6-month threshold (χ2 = 18.3, p < 0.001), indicating moderate diagnostic agreement (κ = 0.51). This subgroup result differs from the overall pooled prevalence reported in the Abstract (DSM-5-TR: 18% vs ICD-11: 26%, p = 0.41), due to heterogeneity in duration thresholds across included studies. When aligning the ICD-11’s duration criterion to 12 months, concordance improved markedly (κ = 0.91), highlighting the critical impact of threshold duration on diagnostic validity. This may suggest that the stricter symptom requirements of DSM-5-TR become more distinguishable from ICD-11 only after the acute grief phase has passed. Additionally, the selection of assessment tools influenced prevalence estimates: studies employing the TGI-SR reported the highest prevalence (44%, 95% CI: 38–49%), compared to those using the PG-13-R (22%, 8–42%) (SMD = 1.23, p = 0.008). Although no significant difference in prevalence was observed between 6- and 12-month follow-ups (26% vs. 18%, p = 0.19), the limited number of studies and potential regional bias necessitate cautious interpretation.
Subgroup and meta-regression analyses further examined demographic influences on PGD prevalence. The disparities observed between Western countries (Canada: 29%, UK: 22%) and Eastern countries (China: 43%) may reflect variations in cultural contexts, support systems, and pandemic policies. Notably, China’s elevated PGD rate contradicts pre-pandemic trends, potentially linked to strict isolation measures that exacerbate grief-related isolation. Both cross-sectional (27%) and cohort studies (20%) displayed comparable prevalence (p = 0.51), suggesting minimal design-related biases. Meta-regression analyses revealed no significant associations with mean age (β = −0.0008, p = 0.81) or the proportion of female participants (β = 0.0007, p = 0.78). Sensitivity analyses confirmed the robustness of subgroup findings across diagnostic thresholds (Supplementary Information).
The formalization of PGD in DSM-5-TR and ICD-11 introduced critical diagnostic divergences. The DSM-5-TR employs an algorithmic approach that prioritizes specificity (core criteria plus at least seven symptoms), while ICD-11 uses a typological method that emphasizes qualitative grief features. Although the ICD-11’s simplicity enhances its applicability across diverse cultural contexts, its lower symptom threshold poses a risk of overdiagnosis compared to prior criteria (e.g. PCBD). Adjusting the ICD-11’s duration criterion to 12 months significantly improved alignment with the DSM-5-TR (κ = 0.91, up from κ = 0.51), emphasizing the critical role of duration in diagnostic accuracy. This aligns with our subgroup analysis, where DSM-5-TR (12-month threshold) yielded a 52% prevalence and ICD-11 (6-month threshold) yielded a 76% prevalence, a difference that narrowed when both were standardized to a 12-month threshold.
According to the NOS, seven studies demonstrated low risk of bias, while five exhibited higher risks. High-quality studies featured internet-based recruitment, strict age inclusion (≥18 years), and employed validated diagnostic tools, ensuring representativeness and comparability. Conversely, lower-quality studies suffered from high attrition rates (>15%), undermining their reliability. The GRADE framework classified the certainty of evidence as very low due to heterogeneity, limited generalizability, and methodological diversity across studies (Dominguez-Rodriguez et al., Reference Dominguez-Rodriguez, Sanz-Gomez, Gonzalez Ramirez, Herdoiza-Arroyo, Trevino Garcia, de la Rosa-Gomez and Miaja2023; Reitsma, Boelen, de Keijser, & Lenferink, Reference Reitsma, Boelen, de Keijser and Lenferink2023; Yu et al., Reference Yu, Liang, Guo, Jiang, Wang, Ke and Liu2024).
Future research should standardize assessment tools, employ randomized designs, and develop AI-driven dynamic scales for real-time symptom monitoring. Integrating biopsychosocial markers via wearable devices and ecological momentary assessment could enhance diagnostic precision, particularly during critical post-loss windows (6–12 months). Establishing multinational cohorts (e.g. WHO grief registry), along with evaluating the long-term efficacy of digital interventions, including machine learning-driven early warning systems and culturally adaptive platforms, could significantly advance evidence-based, personalized care paradigms. Meanwhile, clinical guidelines could evolve to recognize PGD screening as a primary prevention strategy against future dementia and mental health crises. Ultimately, addressing prolonged grief in pandemic-bereaved populations is not only about alleviating current suffering but also about safeguarding the long-term brain health and cognitive resilience of societies recovering from global trauma.
Conclusion
This systematic review and meta-analysis revealed a high pooled prevalence of PGD observed during the COVID-19 pandemic, indicating substantial geographical heterogeneity that may be potentially linked to cultural coping strategies and differences in the intensity of public health policies. Furthermore, the study underscored the sensitivity of PGD prevalence estimates to the choice of assessment tools and diagnostic criteria. While digital interventions demonstrated promising initial outcomes, their long-term clinical efficacy requires further investigation.
Supplementary material
The supplementary material for this article can be found at http://doi.org/10.1017/S0033291726104541.
Data availability statement
The data that support the findings of this study are available in the supplementary material of this article.
Author contribution
Study concept and design: MXL, LS, LJM; Data analysis: LS, QL, LX; Visualization: LS, LYY; Drafting of the Manuscript: MXL, LS, LJM, QL, LYY, LZN. All authors read and approved the final manuscript.
Funding statement
No funding was received for this research.
Competing interests
All authors declare no competing interests.

