Hostname: page-component-77f85d65b8-9nbrm Total loading time: 0 Render date: 2026-04-12T08:30:07.679Z Has data issue: false hasContentIssue false

The dual diversity crisis in EEG biomarker research for cognitive fatigue

Published online by Cambridge University Press:  27 March 2026

Thorsten Rudroff*
Affiliation:
PET Centre, Turku University Hospital, Turku, Finland
*
Corresponding author: Thorsten Rudroff; Email:thrudr@utu.fi
Rights & Permissions [Opens in a new window]

Abstract

Information

Type
Correspondence
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (http://creativecommons.org/licenses/by/4.0), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press

Linnhoff et al. present compelling evidence for the frontal aperiodic exponent as a cognitive fatigue biomarker (71% sensitivity, 74% specificity, area under the curve [AUC] = 0.78) (Linnhoff, Cohen Kadosh, & Zaehle, Reference Linnhoff, Cohen Kadosh and Zaehle2026). However, the study exemplifies what I term the “dual diversity crisis”: simultaneous neglect of population diversity (who was studied) and brain diversity (neurobiological heterogeneity within groups). These constitute validity threats that may fundamentally constrain clinical translation.

Beyond gender and age, demographic characterization remains limited – no information on race, ethnicity, socioeconomic status, or education. Recruitment from a single site in Magdeburg, Germany, likely produced a demographically homogeneous sample. If proposed cutoff values derive from such a sample, we cannot assume they generalize to diverse populations. The authors claim “transdiagnostic” value across Multiple Sclerosis (MS) and Long coronavirus disease (COVID), but transdiagnostic ≠ demographically universal. A biomarker performing consistently across two conditions within one population provides no evidence that it will perform equivalently across racial, ethnic, or socioeconomic groups. Previous research demonstrates systematic differences in brain structure and function across demographic groups (Falk et al., Reference Falk, Hyde, Mitchell, Faul, Gonzalez, Heitzeg, Keating, Langa, Martz, Maslowsky, Morrison, Noll, Patrick, Pfeffer, Reuter-Lorenz, Thomason, Davis-Kean, Monk and Schulenberg2013; Chiao & Cheon, Reference Chiao and Cheon2010).

If deployed clinically without proper validation, this biomarker risks systematic misclassification, potentially disadvantaging underserved groups. Moreover, with 119 participants distributed across four groups, statistical power to detect biomarker performance differences across demographic or neurobiological subgroups would be severely limited.

More fundamentally, the study overlooks brain diversity: substantial neurobiological heterogeneity even among individuals with identical symptom profiles. Recent Positron Emission Tomography (PET) neuroimaging provides direct empirical evidence. In a study examining frontal-striatal glucose metabolism in 27 participants (9 MS, 9 Long COVID, and 9 COVID-19 recovered controls) matched for identical fatigue severity scores, identical symptom presentations reflected completely different brain metabolic patterns (Rudroff, Reference Rudroff2024): MS patients showed significantly lower frontal glucose metabolism compared to both Long COVID patients and recovered controls; Long COVID patients exhibited significantly higher striatal metabolism than MS patients. These differences persisted despite matching for symptom severity, depression, age, and body mass index, with substantial effect sizes (Cohen’s d = 0.89–1.42 for frontal differences, 0.84–1.19 for striatal differences).

These divergent metabolic patterns imply that the Electroencephalogram (EEG) biomarker may be detecting different pathophysiological processes in different individuals – mechanistic pluralism masked by phenomenological similarity. This directly challenges the assumption underlying Linnhoff et al.’s approach that a single biomarker can reliably map onto fatigue regardless of underlying neurobiological heterogeneity. PET neuroimaging demonstrates that individuals experiencing identical cognitive fatigue arrive at that phenomenological endpoint via divergent neural mechanisms – hypometabolism in MS, potentially hypermetabolism in Long COVID.

The convergence of population and brain diversity concerns creates several validity threats. First, mechanistic pluralism: The authors interpret flatter aperiodic slopes as reflecting glutamatergic overload and disrupted E/I balance. However, if identical fatigue symptoms arise from distinct metabolic patterns (as PET demonstrates), the E/I balance captured by EEG may reflect different pathophysiological processes across individuals. A “flatter slope” in one person may represent compensatory hyperexcitation; in another, failed inhibitory control – phenomenologically similar, mechanistically distinct, potentially requiring different therapeutic approaches.

Second, classification performance variability: The reported AUC of 0.78 represents average discriminative ability. Brain diversity predicts that this metric will vary substantially across neurobiological subtypes. Individuals whose fatigue stems from frontal hypometabolism may show excellent biomarker sensitivity; those with preserved frontal function but striatal dysregulation may show poor sensitivity. Without examining performance across neurobiologically defined subgroups, we cannot know if the biomarker works well for everyone or exceptionally well for some and poorly for others.

Third, the symptom score equivalence fallacy: Standardized fatigue scores reflect behavioral outputs, not neural mechanisms. PET data demonstrate that identical scores can reflect opposite metabolic patterns (hypo- vs. hypermetabolism). Using symptom scores as ground truth for biomarker validation assumes neurobiological homogeneity that empirically does not exist.

The authors emphasize transdiagnostic convergence – reduced frontal aperiodic exponents in both MS and Long COVID – as evidence for “overlapping cortical mechanisms” and a “final common pathway.” However, transdiagnostic electrophysiological similarity does not establish mechanistic equivalence. If these conditions reach the same phenomenological endpoint through divergent metabolic pathways, the shared EEG signature may reflect downstream consequences rather than primary pathophysiology. This distinction matters clinically: Interventions targeting glutamate metabolism might benefit individuals with metabolic overactivity but prove ineffective or harmful for those with hypometabolism.

To address the dual diversity crisis in biomarker research, studies should: report comprehensive demographics including race, ethnicity, education, and socioeconomic indicators; conduct stratified analyses examining biomarker performance across demographic and neurobiologically defined subgroups; integrate multimodal neuroimaging (PET, Magnetic Resonance Spectroscopy (MRS), and functional Magnetic Resonance Imaging (fMRI)) to characterize heterogeneity; correlate EEG-derived E/I estimates with direct neurochemical measures; and determine if biomarker-guided treatment selection improves outcomes relative to symptom-based approaches.

The frontal aperiodic exponent represents valuable progress toward objective fatigue assessment. However, simultaneous neglect of population diversity (who was studied) and brain diversity (neurobiological heterogeneity within diagnostic groups) constitutes a dual validity threat that may fundamentally limit clinical translation. PET neuroimaging provides direct evidence that identical symptom presentations arise from divergent brain metabolic patterns, challenging the assumption that any single biomarker can reliably classify fatigue across neurobiologically diverse individuals.

Until we validate proposed biomarkers across both population and brain diversity, we risk systematically excluding patients who might benefit most from objective assessment tools. The goal should not be abandoning biomarker research but pursuing it with methodological rigor matching clinical ambitions.

Funding statement

This work received no specific funding.

Competing interests

The author declares none.

References

Chiao, J. Y., & Cheon, B. K. (2010). The weirdest brains in the world. Behavioral and Brain Sciences, 33(2–3), 8890. https://doi.org/10.1017/S0140525X10000282.CrossRefGoogle ScholarPubMed
Falk, E. B., Hyde, L. W., Mitchell, C., Faul, J., Gonzalez, R., Heitzeg, M. M., Keating, D. P., Langa, K. M., Martz, M. E., Maslowsky, J., Morrison, F. J., Noll, D. C., Patrick, M. E., Pfeffer, F. T., Reuter-Lorenz, P. A., Thomason, M. E., Davis-Kean, P., Monk, C. S., & Schulenberg, J. (2013). What is a representative brain? Neuroscience meets population science. Proceedings of the National Academy of Sciences of the United States of America, 110(44), 1761517622. https://doi.org/10.1073/pnas.1310134110.CrossRefGoogle ScholarPubMed
Linnhoff, S., Cohen Kadosh, R., & Zaehle, T. (2026). EEG-based frontal excitation/inhibition balance as an objective biomarker for cognitive fatigue across multiple sclerosis and long COVID. Psychological Medicine, 56(e21), 18. https://doi.org/10.1017/S0033291725103024.CrossRefGoogle Scholar
Rudroff, T. (2024). Frontal-striatal glucose metabolism and fatigue in patients with multiple sclerosis, long COVID, and COVID-19 recovered controls. Experimental Brain Research, 242(9), 21252136. https://doi.org/10.1007/s00221-024-06882-z.CrossRefGoogle ScholarPubMed