Hostname: page-component-5db58dd55d-smskv Total loading time: 0 Render date: 2026-05-31T13:01:52.079Z Has data issue: false hasContentIssue false

Machine learning prediction of dementia conversion in mild cognitive impairment: A two- to six-year follow-up study

Published online by Cambridge University Press:  13 January 2026

Valgeir Thorvaldsson*
Affiliation:
Department of Psychology, University of Gothenburg, Gothenburg, Sweden Center for Ageing and Health, University of Gothenburg, Gothenburg, Sweden
Johan Svensson
Affiliation:
Department of Internal Medicine and Clinical Nutrition, Institute of Medicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden Region Västra Götaland, Skaraborg Central Hospital, Department of Internal Medicine, Skövde, Sweden
Emir Basic
Affiliation:
R&D-Unit, Neuropsychiatry, Sahlgrenska University Hospital, Mölndal, Sweden
Michael Jonsson
Affiliation:
R&D-Unit, Neuropsychiatry, Sahlgrenska University Hospital, Mölndal, Sweden Department of Psychiatry and Neurochemistry, Institute of Neuroscience of Physiology, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
Petronella Kettunen
Affiliation:
R&D-Unit, Neuropsychiatry, Sahlgrenska University Hospital, Mölndal, Sweden Department of Psychiatry and Neurochemistry, Institute of Neuroscience of Physiology, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
Anders Wallin
Affiliation:
R&D-Unit, Neuropsychiatry, Sahlgrenska University Hospital, Mölndal, Sweden Department of Psychiatry and Neurochemistry, Institute of Neuroscience of Physiology, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
*
Corresponding author: Valgeir Thorvaldsson; Email: valgeir.thorvaldsson@psy.gu.se
Rights & Permissions [Opens in a new window]

Abstract

Objectives:

Mild cognitive impairment (MCI) involves measurable cognitive decline that does not yet significantly disrupt daily functioning but may signal increased risk of dementia. Reliable prediction of dementia conversion in MCI is essential for early intervention and optimized clinical trial design. This study aimed to evaluate the predictive performance of various machine learning (ML) classification algorithms using clinical and neuropsychological data.

Methods:

Data were drawn from the Gothenburg MCI Study and included 347 patients from a memory clinic, of whom 84 (24%) converted to dementia within two to six years. We applied 11 ML classification algorithms (logistic regression, linear discriminant analysis, naïve Bayes, k-nearest neighbors, LASSO, ridge regression, elastic net, decision tree, random forest, gradient boosting, and support vector machine (SVM)) to predict dementia conversion based on 54 clinical predictors (e.g., cerebrospinal fluid biomarkers, neuropsychological test scores, comorbidities, and demographics). In a second step, we included delta scores reflecting change in neuropsychological test performance from baseline to follow-up.

Results:

Without delta scores, LASSO, ridge, elastic net, random forest, and SVM performed best, achieving accuracy ≥0.87, kappa = 0.64, and AUC-ROC ≥0.90. These models demonstrated high specificity (0.94) but moderate sensitivity (0.68). Including delta scores improved performance, with ridge and elastic net achieving accuracy = 0.90, kappa = 0.73 and 0.72, AUC-ROC = 0.94, specificity = 0.96, and sensitivity = 0.73. The elastic net model yielded a positive predictive value of 0.85 and a negative predictive value of 0.92.

Conclusions:

ML models incorporating clinical and cognitive change data can accurately predict dementia conversion in MCI, supporting their utility in clinical decision-making.

Information

Type
Research Article
Creative Commons
Creative Common License - CCCreative Common License - BY
This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright
© The Author(s), 2026. Published by Cambridge University Press on behalf of International Neuropsychological Society
Figure 0

Table 1. Frequency of dementia diagnoses at each follow-up in the Gothenburg MCI study (n = 84)

Figure 1

Table 2. Descriptive statistics for variables included in the machine learning prediction analysis of dementia conversion among individuals with mild cognitive impairment (MCI; excluding cognitive tests variables)

Figure 2

Table 3. Descriptive statistics for the neuropsychological cognitive test variables included in the machine learning prediction analyses of dementia conversion among individuals with mild cognitive impairment (MCI)

Figure 3

Table 4. Performance of machine learning algorithms in predicting dementia diagnosis two to six years after baseline among individuals with mild cognitive impairment (MCI; N = 347; prevalence = 84 cases, 24%)

Figure 4

Figure 1. Receiver operating characteristic (ROC) curves for the machine learning algorithms predicting dementia conversion two to six years after baseline among individuals with mild cognitive impairment (MCI).

Figure 5

Figure 2. The 20 most important variables for predicting subsequent dementia conversion two to six years after baseline among individuals with mild cognitive impairment (MCI) in the ridge regression and random forest models. Variable importance was standardized relative to the most important variable. Comparable plots for the other models are presented in the Supplementary Material (Figures S1–S3).

Figure 6

Table 5. Performance of machine learning algorithms in predicting dementia diagnosis two to six years after baseline among individuals with mild cognitive impairment (MCI), including cognitive delta scores (N = 347; prevalence = 84 cases, 24%)

Figure 7

Figure 3. Receiver operating characteristic (ROC) curves for the machine learning algorithms predicting dementia conversion two to six years after baseline among individuals with mild cognitive impairment (MCI), based on models including cognitive delta (i.e., difference) scores. The logistic regression model did not converge when delta scores were included.

Figure 8

Figure 4. The 20 most important variables for predicting subsequent dementia conversion two to six years after baseline among individuals with mild cognitive impairment (MCI) in the elastic net regression and gradient boosting models that included the cognitive delta (i.e., difference) scores as predictors. Variable importance was standardized relative to the most important variable. Comparable plots for the other models are presented in the Supplementary Material (Figures S4–S6).

Supplementary material: File

Thorvaldsson et al. supplementary material

Thorvaldsson et al. supplementary material
Download Thorvaldsson et al. supplementary material(File)
File 521.3 KB