Assessing anomia across cultures and languages: A head-to-head comparison of abbreviated versions of the Copenhagen Cross-Linguistic Naming Test and Naming Assessment in Multicultural Europe in a multicultural memory clinic population

Sofie Krogh Larsen; Sanne Franzen; Alfonso Delgado-Álvarez; Alvaro Lozano-Ruiz; Maria Özden; Juliette Palisson; Naaheed Mukadam; T. Rune Nielsen

doi:10.1017/S135561772610188X

Assessing anomia across cultures and languages: A head-to-head comparison of abbreviated versions of the Copenhagen Cross-Linguistic Naming Test and Naming Assessment in Multicultural Europe in a multicultural memory clinic population

Published online by Cambridge University Press: 30 March 2026

Sofie Krogh Larsen ,

Sanne Franzen

Alfonso Delgado-Álvarez ,

Naaheed Mukadam and

Sofie Krogh Larsen: Affiliation:
Department of Psychology, University of Copenhagen , Copenhagen, Denmark
Sanne Franzen: Affiliation:
Department of Neurology & Alzheimer Center, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands
Alfonso Delgado-Álvarez: Affiliation:
Department of Neurology, Hospital Clinico San Carlos, San Carlos Institute for Health Research (IdiSSC), Universidad Complutense de Madrid, Madrid, Spain Department of Psychobiology & Behavioral Sciences Methods, Universidad Complutense de Madrid, Madrid, Spain
Alvaro Lozano-Ruiz: Affiliation:
Department of Health Sciences, Valencian International University – VIU, Valencia, Spain
Maria Özden: Affiliation:
Department of Brain and Spinal Cord Injury, The Neuroscience Centre, Copenhagen University Hospital – Rigshospitalet, Copenhagen, Denmark
Juliette Palisson: Affiliation:
Neurology Department, Avicenne Hospital, University Hospital Group (GHU), Assistance Publique – Hôpitaux de Paris (AP-HP), Bobigny, France
Naaheed Mukadam: Affiliation:
Division of Psychiatry, University College London, London, United Kingdom North London NHS Foundation Trust, London, United Kingdom
T. Rune Nielsen*: Affiliation:
Department of Psychology, University of Copenhagen , Copenhagen, Denmark Danish Dementia Research Centre, Department of Neurology, Copenhagen University Hospital - Rigshospitalet , Copenhagen, Denmark
*: Corresponding author: T. Rune Nielsen; Email: thomas.rune.nielsen.01@regionh.dk

Article contents

Abstract
Objective:
Methods:
Results:
Conclusion:
Statement of Research Significance
Introduction
Materials and methods
Results
Discussion
Funding statement
Competing interests
References

Rights & Permissions

Abstract

Objective:

This study aimed to make a head-to-head comparison of the diagnostic accuracy and cross-cultural applicability of abbreviated 20-item versions of the Copenhagen Cross-Linguistic Naming Test (C-CLNT20) and Naming Assessment in Multicultural Europe (NAME20).

Methods:

The present study was conducted in a multicultural and multilingual patient sample from memory clinics across five European countries. Receiver operating characteristic curve analysis was used to assess the diagnostic accuracy of C-CLNT20 and NAME20 in classifying dementia and mild cognitive impairment (MCI). Binary logistic regression analysis was performed to evaluate the influence of demographic and cultural factors on diagnostic accuracy.

Results:

C-CLNT20 and NAME20 showed acceptable diagnostic accuracy for dementia with areas under the curve (AUC) of .75 and .82, respectively, but had low accuracy for MCI (AUC of .64 and .62, respectively). Compared to C-CLNT20, NAME20 had slightly higher, but statistically non-significant, AUCs for dementia in both in the full sample and in participants with immigrant background. The diagnostic accuracy of the C-CLNT20 and NAME20 was not significantly influenced by education and immigrant status in the full sample, or by acculturation and use of an interpreter in participants with immigrant background.

Conclusion:

Both C-CLNT20 and NAME20 are promising brief alternatives to the full versions of the naming tests when time is limited. They also present a promising alternative to other established naming tests by maintaining diagnostic accuracy while showing minimal cross-cultural and cross-linguistic bias.

Keywords

Language impairment naming dementia mild cognitive impairment immigrant cross-cultural comparison

Information

Type: Research Article
Information: Journal of the International Neuropsychological Society , First View , pp. 1 - 10

DOI: https://doi.org/10.1017/S135561772610188X [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2026. Published by Cambridge University Press on behalf of International Neuropsychological Society

Statement of Research Significance

Research Question(s) or Topic(s): This study compared the diagnostic accuracy of abbreviated 20-item versions of the Copenhagen Cross-Linguistic Naming Test and Naming Assessment in Multicultural Europe for dementia and mild cognitive impairment in a multicultural memory clinic population and examined the influence of demographic and cultural factors on diagnostic accuracy. Main Findings: Both naming tests demonstrated moderate to high diagnostic accuracy for dementia and limited accuracy for mild cognitive impairment. Diagnostic accuracy of neither the Copenhagen Cross-Linguistic Naming Test or Naming Test and Naming Assessment in Multicultural Europe were influenced by immigrant status, acculturation, or administration with an interpreter, indicating little cultural and language bias. Study Contributions: The study supports the validity of the Copenhagen Cross-Linguistic Naming Test and Naming Assessment in Multicultural Europe for assessing anomia in patients with dementia in multicultural populations. Both naming tests appear to be valid time-saving alternatives to their full-length versions.

Introduction

Anomia is a common linguistic impairment seen in various neurological conditions, including stroke, acquired head injury, and in several dementia disorders (Kristensson et al., Reference Kristensson, Longoni, Östberg, Rödseth Smith, Åke and Saldert2024; Nørkær et al., Reference Nørkær, Halai, Woollams, Lambon Ralph and Schumacher2024; Strain et al., Reference Strain, Didehbani, Spence, Conover, Bartz, Mansinghani, Jeroudi, Rao, Fields, Kraut, Cullum, Hart and Womack2017; Vogel et al., Reference Vogel, Mellergaard and Frederiksen2025). Thus, it is standard clinical practice to assess anomia in patients referred to memory clinics, which is most frequently done with confrontation naming tests (Georgiou et al., Reference Georgiou, Prapiadou, Thomopoulos, Skondra, Charalampopoulou, Pachi, Anagnostopoulou, Vorvolakos, Perneczky, Politis and Alexopoulos2022).

The Boston Naming Test (BNT) (Kaplan et al., Reference Kaplan, Goodglass and Weintraub2001) is the most widely used confrontation naming (Maruta et al., Reference Maruta, Guerreiro, De Mendonça, Hort and Scheltens2011; Rabin et al., Reference Rabin, Paolillo and Barr2016). The original version consists of 60 black-and-white line drawings depicting objects of increasing difficulty, but several briefer 30-, 20- or 15-item versions have also been developed (Mack et al., Reference Mack, Freed, Williams and Henderson1992; Williams et al., Reference Williams, Mack and Henderson1989). However, the BNT has faced longstanding criticism for its cultural and linguistic bias, limiting its cross-cultural applicability (Harry & Crowe, Reference Harry and Crowe2014; March et al., Reference March, Worrall and Hickson2000). In addition, studies have shown systematic disparities across ethnic and linguistic groups, with lower scores observed in bilingual and multilingual individuals, and in minoritized populations (Baird et al., Reference Baird, Ford and Podell2007; Boone et al., Reference Boone, Victor, Wen, Razani and Ponton2007; Gollan et al., Reference Gollan, Fennema-Notestine, Montoya and Jernigan2007; Kohnert et al., Reference Kohnert, Hernandez and Bates1998; Roberts et al., Reference Roberts, Garcia, Desrochers and Hernandez2002). Different patterns in BNT performance have also been linked to other factors, such as education or rural versus urban upbringing (Kim et al., Reference Kim, Lee, Bae, Kim, Kim, Kim, Park, Cho and Chang2017).

In response to these concerns, there have been several efforts to develop cross-cultural naming tests, including the Cross-Linguistic Naming Test (CLNT) (Ardila, Reference Ardila2007) and Multilingual Naming Test (MINT) (Gollan et al., Reference Gollan, Weissberger, Runnqvist, Montoya and Cera2012). While these efforts represent meaningful progress, they also highlight the ongoing need for developing brief, cross-cultural confrontation tests with high diagnostic accuracy. For instance, while CLNT has shown promising cross-cultural properties it has low sensitivity for naming impairment associated with dementia due to a ceiling effect (Gálvez-Lara et al., Reference Gálvez-Lara, Moriana, Vilar-López, Fasfous, Hidalgo-Ruzzante and Pérez-García2015), and although MINT has shown high diagnostic accuracy for dementia (Ivanova et al., Reference Ivanova, Salmon and Gollan2013; Stasenko et al., Reference Stasenko, Jacobs, Salmon and Gollan2019), it is influenced by factors such as sex, education, and ethnic background (Stasenko et al., Reference Stasenko, Jacobs, Salmon and Gollan2019) and has been criticized for including items that are culturally unfamiliar for some populations (Li et al., Reference Li, Zeng, Neugroschl, Aloysi, Zhu, Xu, Teresi, Ocepek-Welikson, Ramirez, Joseph, Cai, Grossman, Martin, Sewell, Loizos and Sano2022).

The above criticisms are increasingly relevant with rising global migration. Neuropsychologists have frequently reported language differences as a primary barrier in neuropsychological assessments (Franzen et al., Reference Franzen, Papma, Van Den Berg and Nielsen2020, Reference Franzen, Watermeyer, Pomati, Papma, Nielsen, Narme, Mukadam, Lozano-Ruiz, Ibanez-Casas, Goudsmit, Fasfous, Daugherty, Canevelli, Calia, Van Den Berg and Bekkhus-Wetterberg2022; Nielsen et al., Reference Nielsen, Andersen, Kastrup, Phung and Waldemar2011; Nielsen et al., Reference Nielsen, De Mendonça, Frölich, Engelborghs, Gove, Lamirel, Calia and Waldemar2024), and recent surveys of European clinical dementia centers found that half of the centers found it more challenging to assess culturally and linguistically diverse patients (Nielsen et al., Reference Nielsen, De Mendonça, Frölich, Engelborghs, Gove, Lamirel, Calia and Waldemar2024). Misdiagnosis in these populations remains a well-documented issue (Hinton et al., Reference Hinton, Tran, Peak, Meyer and Quiñones2024; Lin et al., Reference Lin, Daly, Olchanski, Cohen, Neumann, Faul, Fillit and Freund2021; Nielsen, Andersen, et al., Reference Nielsen, Andersen, Kastrup, Phung and Waldemar2011; Nielsen, Vogel, Phung, et al., Reference Nielsen, Vogel, Phung, Gade and Waldemar2011). At the same time, dementia prevalence continues to rise globally. Thus, dementia is currently estimated to affect approximately 50 million people worldwide, and this number is expected to triple within the next 25 years (World Health Organization, 2021). This underscores the urgent need for appropriate tools to diagnose dementia disorders in a timely and accurate manner, no matter ethnicity, language, or cultural origin.

Two promising cross-cultural naming tests, the Copenhagen Cross-Linguistic Naming Test (C-CLNT) and the Naming Assessment in Multicultural Europe (NAME) were both developed for use across diverse cultures, languages, and educational backgrounds (Franzen et al., Reference Franzen, Van Den Berg, Ayhan, Satoer, Türkoğlu, Genç Akpulat, Visch-Brink, Scheffers, Kranenburg, Jiskoot, Van Hemmen and Papma2023; Nielsen et al., Reference Nielsen, Grollenberg, Ringkøbing, Özden, Weekes and Waldemar2023). However, given the time constraints often present in clinical practices, longer assessment tools may be impractical due to fatigue, cognitive load, limited personnel resources, etc. (Calero et al., Reference Calero, Arnedo, Navarro, Ruiz-Pedrosa and Carnero2002). As such, abbreviated versions can be particularly valuable for rapid screening, as well as for other purposes such as in brief test batteries used in research and clinical trials. This study aimed to compare the diagnostic accuracy of the abbreviated 20-item versions of C-CLNT20 and NAME20 for dementia and mild cognitive impairment (MCI) in a multicultural memory clinic sample, and to examine the influence of demographic and cultural factors on diagnostic accuracy. We hypothesized that: 1) the C-CLNT20 and NAME 20 would have similar diagnostic accuracies for dementia and MCI, and 2) their diagnostic accuracies would be unrelated to cultural factors.

Materials and methods

Participants

Patients were recruited from multidisciplinary memory clinics across five European countries (Denmark, Spain, the Netherlands, France, and the United Kingdom). All patients underwent a comprehensive clinical assessment that included interviews with both the patient and, when possible, a close relative or caregiver. This was followed by neurological, physical, and psychiatric evaluations, incorporating cognitive screening tools such as the Mini-Mental State Examination (MMSE) (Folstein et al., Reference Folstein, Folstein and McHugh1975) or Rowland Universal Dementia Assessment Scale (RUDAS) (Storey et al., Reference Storey, Rowland, Conforti and Dickson2004). Standard laboratory tests, including blood work and electrocardiograms, and structural brain imaging with computed tomography or magnetic resonance imaging were also performed. Additional assessments, such as positron emission tomography scans, cerebrospinal fluid analysis, or in-depth neuropsychological and psychiatric evaluations were conducted when clinically indicated. A team of experienced clinicians established diagnoses based on evidence from all clinical and investigational results, except the C-CLNT20 and NAME20, using the 5^th edition of the Diagnostic and Statistical Manual of Mental Disorders (American Psychiatric Association, 2013) criteria for major neurocognitive disorder (i.e., dementia), and diagnostic research criteria for specific dementia subtypes (Gorno-Tempini et al., Reference Gorno-Tempini, Hillis, Weintraub, Kertesz, Mendez, Cappa, Ogar, Rohrer, Black, Boeve, Manes, Dronkers, Vandenberghe, Rascovsky, Patterson, Miller, Knopman, Hodges, Mesulam and Grossman2011; McKeith et al., Reference McKeith, Boeve, Dickson, Halliday, Taylor, Weintraub, Aarsland, Galvin, Attems, Ballard, Bayston, Beach, Blanc, Bohnen, Bonanni, Bras, Brundin, Burn, Chen-Plotkin and Kosaka2017; McKhann et al., Reference McKhann, Knopman, Chertkow, Hyman, Jack, Kawas, Klunk, Koroshetz, Manly, Mayeux, Mohs, Morris, Rossor, Scheltens, Carrillo, Thies, Weintraub and Phelps2011; Rascovsky et al., Reference Rascovsky, Hodges, Knopman, Mendez, Kramer, Neuhaus, Van Swieten, Seelaar, Dopper, Onyike, Hillis, Josephs, Boeve, Kertesz, Seeley, Rankin, Johnson, Gorno-Tempini, Rosen and Miller2011; Sachdev et al., Reference Sachdev, Kalaria, O’Brien, Skoog, Alladi, Black, Blacker, Blazer, Chen, Chui, Ganguli, Jellinger, Jeste, Pasquier, Paulsen, Prins, Rockwood, Roman and Scheltens2014), MCI (Winblad et al., Reference Winblad, Palmer, Kivipelto, Jelic, Fratiglioni, Wahlund, Nordberg, Bäckman, Albert, Almkvist, Arai, Basun, Blennow, De Leon, DeCarli, Erkinjuntti, Giacobini, Graff, Hardy and Petersen2004), and subjective cognitive decline (SCD) (Jessen et al., Reference Jessen, Amariglio, van Boxtel, Breteler, Ceccaldi, Chételat, Dubois, Dufouil, Ellis, K., van der Flier, Glodzik, van Harten, de Leon, McHugh, Mielke, Molinuevo, Mosconi, Osorio and Perrotin2014). Patients with primary affective or other psychiatric conditions or cognitive impairment due to causes other than dementia or MCI, were excluded. Participants with physical impairments likely to interfere with cognitive testing (e.g., significant movement disorders, uncorrected hearing or vision problems) were excluded.

Cognitively healthy participants were recruited from local community centers, general practice clinics or through social networks of the researchers working at the memory clinics. The exclusion criteria for cognitively healthy participants included severe psychiatric or neurological disorder, substance abuse, or scoring <24/30 points on the MMSE or <23/30 points on the RUDAS, or >5/15 points on the two-step 5/15-item Geriatric Depression Scale (GDS-5/15). (Weeks et al., Reference Weeks, McGann, Michaels and Penninx2003).

Participants with immigrant background were defined as first-generation immigrants or refugees residing in the country where the data was collected. European native-born participants were defined as participants without migration background, meaning those who were born in the country of data collection, and typically belonged to the majority ethnic group of that country (e.g., ethnic Danes in Denmark). All participants were included between March 2023 and August 2024.

Procedure

As part of the data collection process, participants completed an assessment of approximately one hour. During this assessment, demographic and medical information was collected, and a brief neuropsychological test battery was administered, including the C-CLNT20 and NAME20. To minimize bias, assessors were generally blinded to participants’ diagnostic classifications, except for the cognitively intact group (as these participants were recruited separately). Participants with immigrant background (n = 116) were assessed in their first language whenever possible, either by multilingual research staff or with the assistance of interpreters (n = 79). However, a subset of these participants (n = 37) was assessed in their second language. The study adhered to the Declaration of Helsinki for research involving human subjects and was assessed and approved by the Scientific Ethics Committees (reference no. 22007675) and Data Protection Agency (reference no. P-2022-444) for the Capital Region of Denmark as well as relevant local ethics and data protection authorities at other sites. All participants provided written consent.

Measures

C-CLNT20

C-CLNT is a newly developed cross-cultural naming test (Nielsen et al., Reference Nielsen, Grollenberg, Ringkøbing, Özden, Weekes and Waldemar2023) that consists of 30 colored drawings, 20 of which depict objects and 10 depict actions. One point is given for each correctly named item, and participants are given 20 seconds to respond per item. Semantic cues may be provided when appropriate (e.g., in cases of visual misperception) (Nielsen et al., Reference Nielsen, Grollenberg, Ringkøbing, Özden, Weekes and Waldemar2023) and a correct response following this count as correctly named. The abbreviated 20-item version of C-CLNT (C-CLNT20) was developed by including only the 20 object items, excluding the original test’s 10 action items (Nielsen et al., Reference Nielsen, Grollenberg, Ringkøbing, Özden, Weekes and Waldemar2023). In the context of multilingualism and inherent language mixing, participants are allowed to respond in any language. A correct response in any language is considered correct.

NAME20

NAME represents another novel cross-cultural naming test (Franzen et al., Reference Franzen, Van Den Berg, Ayhan, Satoer, Türkoğlu, Genç Akpulat, Visch-Brink, Scheffers, Kranenburg, Jiskoot, Van Hemmen and Papma2023) that consists of 60 items, including colored photographs of objects, natural phenomena, animals, body parts, colors, occupations, and actions. One point is given for each correctly named item. When administering NAME, no cues are provided and there is no formal time limit. The abbreviated 20-item version of NAME (NAME20) was constructed by selecting the 20 items that best separated patients with Alzheimer’s disease (AD) and mixed dementia (AD/vascular dementia [VaD]) from the remainder of the sample in the original validation study (Franzen et al., Reference Franzen, Van Den Berg, Ayhan, Satoer, Türkoğlu, Genç Akpulat, Visch-Brink, Scheffers, Kranenburg, Jiskoot, Van Hemmen and Papma2023). NAME20 includes items representing all the original categories of NAME. Participants are allowed to respond in any language. A correct response in any language is considered correct. Figure 1 provides examples of items from C-CLNT20 and NAME20.

Figure 1. Examples of items from C-CLNT20 (bone and fly) and NAME20 (butcher and nose). Items are reproduced from the original C-CLNT and NAME papers (Franzen et al., Reference Franzen, Van Den Berg, Ayhan, Satoer, Türkoğlu, Genç Akpulat, Visch-Brink, Scheffers, Kranenburg, Jiskoot, Van Hemmen and Papma2023 and Nielsen et al., Reference Nielsen, Grollenberg, Ringkøbing, Özden, Weekes and Waldemar2023) with permission from the authors.

Other measures

In addition to the C-CLNT20 and NAME20, participants were administered a brief battery of neuropsychological tests. These tests included RUDAS (Storey et al., Reference Storey, Rowland, Conforti and Dickson2004), Category fluency (animals and supermarket items; Lehman, Reference Lehman1970), Clock Reading Test (CRT; Schmidtke & Olbrich, Reference Schmidtke and Olbrich2007), and Interlocking Finger Test (ILFT; Moo, Reference Moo2003).

RUDAS is a cross-cultural cognitive screening tool for dementia. It takes approx. 10 minutes to administer and comprises subtasks covering six different domains: episodic memory, body orientation, visuo-spatial construction, practical coordination, judgement, and language function. Scores range from 0–30 (Storey et al., Reference Storey, Rowland, Conforti and Dickson2004).

In Category fluency, the participant is instructed to name as many words as possible belonging to a specific category within 60 seconds (Wright et al., Reference Wright, De Marco and Venneri2023). In this study, two versions were used: 1) animals, and 2) items found in a supermarket. The score is the number of correct words produced within 60 seconds.

CRT is a brief 12-item visuo-spatial test, where the participant is presented with 12 different clocks faces with no digits. All clock faces show different times, and the task is to read and report the time. The score range is 0–12 points (Schmidtke & Olbrich, Reference Schmidtke and Olbrich2007).

In the ILFT, the participant is shown four non-symbolic hand gestures that are to be imitated. The score range is 0–4 points (Moo, Reference Moo2003).

Also, to measure acculturation in participants with immigrant backgrounds, the Brief Acculturation Scale (Norris et al., Reference Norris, Ford and Bova1996) was used, which is a four-item self-report measure focusing on language use. Each item is rated on a five-point Likert scale, with a total score ranging from 4–20 points. Higher scores indicate more acculturation towards the mainstream majority culture (Norris et al., Reference Norris, Ford and Bova1996).

Statistical analyses

To determine the significance of group differences on categorical variables, Pearson’s χ ²-test or Fishers Exact Test were used, while differences between groups on continuous variables were determined using analysis of variance (ANOVA). All group differences were pretested for homogeneity of variances, and when homogeneity of variances was not met, Welch’s ANOVA was used. Effects sizes were calculated as Partial Eta Squared (η ²), with η ² = .01 considered a small effect, η ² = .06 a medium effect, and η ² = .14 a large effect. Spearman’s rank order correlations were used to determine correlations between performance on C-CLNT20 and NAME20, and other neuropsychological tests, with r = .00–.20 considered a negligible effect, r = .21–.40 a weak effect, r = .41–.60 a moderate effect, r = .61–.80 a strong effect, and r = .81–1.00 a very strong effect.

To assess diagnostic accuracy, receiver operating characteristics (ROC) analysis was conducted to compute area under the curve (AUC), sensitivity, specificity, and positive (LR+) and negative (LR–) likelihood ratios, using the consensus diagnosis provided by a team of experienced clinicians as the reference standard. In these analyses, patients with SCD were grouped with cognitively healthy participants to form a cognitively intact group. By definition, individuals with SCD have no objective cognitive impairment (Jessen et al., Reference Jessen, Amariglio, van Boxtel, Breteler, Ceccaldi, Chételat, Dubois, Dufouil, Ellis, K., van der Flier, Glodzik, van Harten, de Leon, McHugh, Mielke, Molinuevo, Mosconi, Osorio and Perrotin2014). Youden’s index was used for determining optimal cut-off values to maximize sensitivity and specificity. AUCs were compared using the DeLong-method (DeLong et al., Reference DeLong, DeLong and Clarke-Pearson1988). Binary logistic regression analyses were used to determine the influence of demographic and cultural variables on classification accuracy. Nagelkerke R ² was reported as a measure for the explained variance in diagnostic group status. All analyses were conducted in IBM SPSS Statistics version 29.0.2.0 or clinical calculators from VassarStats.com. All statistical significance was determined using a p-value of < .05 (two-tailed).

Results

Participant characteristics

A total of 192 participants were recruited for the study. Of these, 22 memory clinic patients were excluded due to being diagnosed with a primary affective disorder, and nine cognitively intact participants were excluded, seven due to scoring <23/30 points on the RUDAS, and two due to scoring >5/15 points on GDS-5/15. The final sample consisted of 161 participants (see Table 1), representing 36 different countries of origin and 30 different languages. A total of 56.5% (n = 91) of the sample had immigrant background, and among them seven originated from Europe, 31 from the Middle East, 13 from Africa, 19 from Asia, and 21 from Latin America.

Table 1. Participant characteristics and test performance (n = 161)

Note: BAS = Brief Acculturation Scale, C-CLNT20 = Copenhagen Cross-Linguistic Naming Test (20 items), CRT = Clock Reading Test, ILFT = Interlocking Finger Test, MCI = Mild Cognitive Impairment, NAME20 = Naming Assessment in Multicultural Europe (20 items), RUDAS = Rowland Universal Dementia Assessment Scale.

^* Group comparison is only based on participants with immigrant background.

Among the 86 memory clinic patients, 53 were diagnosed with dementia (32 AD, four VaD, three mixed dementia (AD/VaD), four frontotemporal dementia (FTD; including two behavioral variant FTD and two primary progressive aphasia [PPA]), one dementia with Lewy bodies (DLB), two other specified dementia (normal pressure hydrocephalus and HIV-associated neurocognitive disorder), and seven unspecified dementia cases. Furthermore, 33 were diagnosed with MCI, including 27 with amnestic MCI and six with non-amnestic MCI.

There were no significant differences in distribution of sex or years of education across the diagnostic groups, but the cognitively intact group was younger (F(2, 157) = 13.22, p < .001) and included a larger proportion of participants with immigrant background (χ ²(2, n = 161) = 12.77, p = .002). Analyses further showed significant differences on all neuropsychological tests across diagnostic groups: C-CLNT20 (Welch’s F(2, 70.56) = 18.92, p = <.001, η ² = .23), NAME20 (Welch’s F(2, 84.87) = 19.02, p < .001, η ² = .22), RUDAS (Welch’s F(2, 63.89) = 39.01, p < .001, η ² = .37), Category fluency (animals) (Welch’s F(2, 84.46) = 44.22, p < .001, η ² = .34), Category fluency (supermarket) (Welch’s F(2, 87.47) = 40.72, p < .001, η ² = .33), CRT (Welch’s F(2, 74.95) = 24.28, p < .001, η ² = .3) and ILFT (F(2, 158) = 13.87, p < .001, η ² = .15). However, no significant differences were found between patients with SCD and cognitively healthy participants on any of the neuropsychological tests.

When comparing participants with immigrant background to European native-born participants, participants with immigrant background were significantly younger (67.6 ± 7.5 years vs. 77.5 ± 5.7 years; Welch’s F(1, 157.83) = 89.58, p < .001), but there were no significant differences in years of education, distribution of sex, or performance on C-CLNT20 and NAME20. Out of the 91 participants with immigrant background, 31 were assessed with help from an interpreter, with no significant differences in interpreter use across diagnostic groups.

Construct validity of the abbreviated versions

Correlation analyses showed that C-CLNT20 and NAME20 were strongly correlated with each other (r = .67, p < .001). C-CLNT20 was also moderately correlated with RUDAS (r = .43, p < .001) and category fluency (animals: r = .41, p < .001), weakly to moderately correlated with category fluency (supermarket items: r = .34, p < .001), and CRT (r = .39, p < .001) and weakly correlated with ILFT (r = .25, p = .001). NAME20 correlated moderately to strongly with category fluency (animals) (r = .60, p < .001), moderately with RUDAS (r = .46, p < .001), category fluency (supermarket items) (r = .45, p < .001) and CRT (r = .51, p < .001), and weakly with ILFT (r = .25, p = .001).

Diagnostic accuracy

AUCs were .75 for C-CLNT20 and .82 for NAME20 in discriminating patients with dementia from other diagnostic groups (cognitively intact + MCI). This difference in AUC values was not statistically significant (z = −1.77, p = .076) (see Figure 2). With optimal cut-off values at ≤18/20 for both tests, sensitivity and specificity were .72 and .71 for C-CLNT20 and .76 and .80 for NAME20. Using prediction models adjusting for age, sex, years of education, and immigrant status (see below), slightly reduced the AUCs to .73 for C-CLNT20 and .79 for NAME20. In a sub-comparison between patients with dementia and cognitively intact participants only, both tests demonstrated slightly higher diagnostic accuracy, with marginally higher AUC and increased specificity (see Table 2). When using the tests to discriminate between patients with MCI and cognitively intact participants, the AUC for C-CLNT20 was .64 and the AUC for NAME20 was .62, which again was not a significant difference between the two naming tests (z = .37, p = .712). With optimal cut-off values at ≤19 for both tests for detecting MCI, sensitivity and specificity were .73 and .51 for C-CLNT20 and .55 and .67 for NAME20.

Figure 2. ROC-curves for C-CLNT20 and NAME20 for dementia.

Table 2. Diagnostic accuracy

Note: AUC = Area under the curve, C-CLNT20 = Copenhagen Cross-Linguistic Naming Test (20 items), NAME20 = Naming Assessment in Multicultural Europe (20 items), +LR = positive likelihood ratio, −LR = negative likelihood ratio.

^* Optimal cut-off scores were based on Youdens J.

In a subsample consisting only of participants with immigrant background (n = 91), AUCs for C-CLNT20 and NAME20 were .81 (95% CI: .71–.91) and .86 (95% CI: .78–.95), respectively, in discriminating dementia from other diagnostic groups (cognitively intact + MCI), which was not a significant difference (z = –1.23, p = .220). In a subsample of European native-born participants (n = 70), AUCs were .70 (95% CI: .56–.83) and .77 (95% CI: .66–.89) for C-CLNT20 and NAME20, respectively, which was also not a significant difference (z = −1.2, p = .231). Also, AUC values were not significantly different between participants with and without immigrant background on either of the tests (C-CLNT20: z = 1.25, p = .213; NAME20: z = 1.15, p = .252).

Influence of demographic and cultural factors on diagnostic accuracy

Binary logistic regression analyses were conducted to determine the effects of demographic and cultural variables on the diagnostic accuracy of C-CLNT20 and NAME20 (Tables 3 and 4). In a model, including C-CLNT20, age, sex, years of education, and immigrant status as covariates, the model predicted 36.2% (Nagelkerke R ²) of the variance in group status (dementia vs. cognitively intact + MCI), and correctly classified 80.5% of all cases. Lower C-CLNT20 score, older age, and female sex were significant predictors of dementia, while years of education and immigrant status were not. In a model including NAME20 and the same covariates, the model predicted 36.6% of the variance in group status, and correctly classified 79.9% of all cases. In this model, lower NAME20 score and older age were significant predictors, while there was a trend for female sex (p = .056).

Table 3. Logistic regression analyses for diagnosis of dementia in the full sample (n = 161)

Note: P.E. = parameter estimate (B-value), S.E. = standard error, OR = odds ratio.

Table 4. Logistic regression analyses for diagnosis of dementia in participants with immigrant background (n = 90)

Note: P.E. = parameter estimate (B-value), S.E. = standard error, OR = odds ratio.

The regression analyses were repeated in a subsample of participants with immigrant background only. In a model with C-CLNT20, age, sex, years of education, and acculturation as covariates, the model explained 54.7% of the variance in group status, and correctly classified 88.4% of all cases. Significant predictors were C-CLNT20 score (B = –0.9, p < .001, OR: 0.41 [95% CI: 0.26–0.64]), age (B = 0.13, p = .01, OR: 1.14 [95% CI: 1.03–1.26]) and years of education (B = 0.14, p = .035, OR: 1.15 [95% CI: 1.01–1.32]). In a model including NAME20 and the same covariates, the model explained 54.9% of the variance in group status, and correctly classified 86% of all cases. Significant predictors in this model were NAME20 score (B = −0.96, p < .001, OR: 0.38 [95% CI: 0.23–0.63]) and age (B = 0.11, p = .022, OR: 1.11 [95% CI: 1.02–1.22]). Adding the use of an interpreter to the regression analyses, did not significantly influence the diagnostic performance of either C-CLNT20 (p = .191) or NAME20 (p = .948).

Discussion

This study presents a head-to-head comparison of the diagnostic accuracy and cross-cultural applicability of abbreviated 20-item versions of the C-CLNT and NAME. Overall, both tests demonstrated moderate to high diagnostic accuracy for dementia, limited accuracy for MCI, and minimal bias related to cultural and demographic factors. Construct validity of both tests was supported by moderate to strong correlations with other language measures (Category fluency) and weaker correlations with visual measures (ILFT and CRT), indicating good convergent and acceptable divergent validity. Supporting our first hypothesis, ROC curve analyses showed similar AUCs of .75 for C-CLNT20 and .82 for NAME20 for classifying dementia, with acceptable levels of sensitivity and specificity for both tests at their optimal cut-offs of ≤18/20. Diagnostic accuracy for MCI was lower (AUC = .64 and .62, respectively), suggesting that the two confrontation naming tests are not sufficiently sensitive to MCI. Diagnostic accuracy did not significantly differ between European native-born and immigrant participants. Regarding the influence of demographic and cultural variables, in the full sample diagnostic accuracy of C-CLNT20 and NAME20 was influenced by age and sex and in a sub-analysis in participants with immigrant status alone, C-CLNT20 was additionally influenced by education. Notably, however, in support of our second hypothesis, neither NAME20 or C-CLNT20 were influenced by immigrant status, acculturation, or administration with an interpreter, indicating little cultural and language bias.

While the C-CLNT and NAME20 offer efficiency, abbreviating a test can affect its psychometric properties. The full versions of C-CLNT and NAME were not included in the present study, which limits the possibility of direct comparisons with their abbreviated counterparts. However, Nielsen et al. (Reference Nielsen, Grollenberg, Ringkøbing, Özden, Weekes and Waldemar2023) conducted a direct comparison of C-CLNT and C-CLNT20 in the original validation sample and found only a minimal reduction in AUC (from .80 to .78), suggesting that C-CLNT20 retained much of the original test’s diagnostic value. No such comparison between NAME and NAME20 has been conducted within the same sample or using the same methodology. Future research should more systematically investigate differences in diagnostic accuracy between full and abbreviated versions within the same samples to better understand potential trade-offs between test length and performance. Differences in sample composition and methodology also apply when comparing the present findings to other studies on abbreviated naming tests, limiting the strength of cross-study comparisons. Nonetheless, with these limitations in mind, the findings of this study are generally consistent with previous research on abbreviated naming tests. A 24-item version of the MINT demonstrated very similar diagnostic accuracy (AUC = .81; sensitivity = .91; specificity = .59) (Vélez-Uribe et al., Reference Vélez-Uribe, Rosselli, Newman, Gonzalez, Gonzalez Pineiro, Barker, Marsiske, Fiala, Lang, Conniff, Ahne, Goytizolo, Loewenstein, Curiel and Duara2024). Regarding the gold standard BNT, Li et al. (Reference Li, Zeng, Neugroschl, Aloysi, Zhu, Xu, Teresi, Ocepek-Welikson, Ramirez, Joseph, Cai, Grossman, Martin, Sewell, Loizos and Sano2022) reported an AUC of .78 for a 30-item version, while Katsumata et al. (Reference Katsumata, Mathews, Abner, Jicha, Caban-Holt, Smith, Nelson, Kryscio, Schmitt and Fardo2015) found AUCs ranging from .85 to .92 for various 15-item versions of BNT. Notably, the relatively high AUCs reported by Katsumata et al. (Reference Katsumata, Mathews, Abner, Jicha, Caban-Holt, Smith, Nelson, Kryscio, Schmitt and Fardo2015) may reflect that the study used a very demographically homogeneous sample (i.e., 93% White participants with a mean of 16 years of education), which may have inflated diagnostic accuracy. In more diverse populations, AUCs for 15-item version of BNT have been reported as low as .59 (Nielsen et al., Reference Nielsen, Grollenberg, Ringkøbing, Özden, Weekes and Waldemar2023), suggesting that diagnostic accuracy of BNT is lower in more demographically and culturally heterogeneous samples. In this context, both C-CLNT20 and NAME20 appear to perform comparably, or even favorably, relative to other abbreviated confrontation naming tests.

Both C-CLNT20 and NAME20 demonstrated limited diagnostic accuracy for identifying MCI, with AUCs of .64 and .62, respectively. This corresponds with results from Nielsen et al. (Reference Nielsen, Grollenberg, Ringkøbing, Özden, Weekes and Waldemar2023) on the full-length C-CLNT, which had an AUC of .53 for MCI. The AUC for MCI patients was not formally examined for the original NAME (Franzen et al., Reference Franzen, Van Den Berg, Ayhan, Satoer, Türkoğlu, Genç Akpulat, Visch-Brink, Scheffers, Kranenburg, Jiskoot, Van Hemmen and Papma2023). However, the original study reports medians in the MCI group closer to those of control participants than to patients with AD/mixed dementia with notable variation in this group, however. These results regarding detection of MCI are consistent with findings from other abbreviated naming tests. Short forms of the BNT have shown AUCs between .58 and .70 (Katsumata et al., Reference Katsumata, Mathews, Abner, Jicha, Caban-Holt, Smith, Nelson, Kryscio, Schmitt and Fardo2015; Li et al., Reference Li, Zeng, Neugroschl, Aloysi, Zhu, Xu, Teresi, Ocepek-Welikson, Ramirez, Joseph, Cai, Grossman, Martin, Sewell, Loizos and Sano2022), and the 24-item MINT reported similar performance (AUC = .60), with acceptable sensitivity (.79) but very low specificity (.35) (Vélez-Uribe et al., Reference Vélez-Uribe, Rosselli, Newman, Gonzalez, Gonzalez Pineiro, Barker, Marsiske, Fiala, Lang, Conniff, Ahne, Goytizolo, Loewenstein, Curiel and Duara2024). The limited accuracy across naming tests may reflect that anomia is not a core symptom of MCI, and that a significant proportion of individuals with MCI do not exhibit any anomia (Joubert et al., Reference Joubert, Brambati, Ansado, Barbeau, Felician, Didic, Lacombe, Goldstein, Chayer and Kergoat2010). Additionally, the present study’s MCI group included both amnestic and non-amnestic subtypes, unlike earlier studies that focused almost solely on amnestic MCI (Katsumata et al., Reference Katsumata, Mathews, Abner, Jicha, Caban-Holt, Smith, Nelson, Kryscio, Schmitt and Fardo2015; D. Li et al., Reference Li, Yu, Hu, Zhang, Liu, Fan, Ruan and Wang2022; Vélez-Uribe et al., Reference Vélez-Uribe, Rosselli, Newman, Gonzalez, Gonzalez Pineiro, Barker, Marsiske, Fiala, Lang, Conniff, Ahne, Goytizolo, Loewenstein, Curiel and Duara2024). Since naming impairments are more common in the amnestic subtype (Liampas et al., Reference Liampas, Folia, Morfakidou, Siokas, Yannakoulia, Sakka, Scarmeas, Hadjigeorgiou, Dardiotis and Kosmidis2023), this broader inclusion criteria for MCI may have slightly reduced classification performance in this sample compared to other samples. These findings indicate that brief naming tests, such as the C-CLNT20 and NAME20, may have limited utility in detecting naming deficits in individuals with MCI. Prior research suggests that naming tests incorporating items with a stronger semantic load, such as names of famous people or culturally significant landmarks, may offer greater sensitivity in identifying such impairments (Vogel et al., Reference Vogel, Johannsen, Stokholm and Jørgensen2014). However, because semantic knowledge is inherently culture-specific, the inclusion of these items poses challenges for test adaptation and validity in cross-cultural contexts.

Lastly, C-CLNT20 and NAME20, like their full-length counterparts, demonstrated minimal cultural bias. Unlike other naming tests, their performance was not significantly influenced by immigrant status, level of acculturation, or use of interpreter. In contrast, previous studies have shown that MINT-24 was affected by education, and the full MINT was also influenced by cultural group (Vélez-Uribe et al., Reference Vélez-Uribe, Rosselli, Newman, Gonzalez, Gonzalez Pineiro, Barker, Marsiske, Fiala, Lang, Conniff, Ahne, Goytizolo, Loewenstein, Curiel and Duara2024). Similarly, performance on the BNT has been shown to vary with education level, acculturation, and immigration background among other factors (Boone et al., Reference Boone, Victor, Wen, Razani and Ponton2007; Nussbaum et al., Reference Nussbaum, May, Cutler, Abeare, Watson and Erdodi2022; Shaikh et al., Reference Shaikh, Zaidi, Wong Gonzalez, Dimech, Gilson, Stokes and Paterson2025). Compared to these findings, both C-CLNT20 and NAME20 appear to be more culturally fair.

Taken together, the head-to-head comparison between C-CLNT20 and NAME20 showed no statistically significant differences in diagnostic accuracy between the two tests across all subgroup comparisons. However, NAME20 systematically showed slight advantages in classifying dementia, with higher sensitivity, specificity, and AUC values. Subtle differences in diagnostic performance may partly stem from the different approaches to item selection. NAME20 item selection was based on a psychometric decision using the 20 items that best discriminated patients with dementia from cognitively healthy controls, including colored photographs of objects, natural phenomena, animals, body parts, colors, occupations, and actions. In contrast, C-CLNT20 selectively included colored drawings of objects, as some literature suggests noun naming may be more affected than verb naming in Alzheimer’s disease (Williamson et al., Reference Williamson, Adair, Raymer and Heilman1998). These differences in item selection may have affected diagnostic performance of NAME20 and C-CLNT20 as some studies indicate that deficits in noun and verb naming vary based on affected brain regions and specific dementia subtypes (Hillis et al., Reference Hillis, Oh and Ken2004; Pisoni et al., Reference Pisoni, Mattavelli, Casarotti, Comi, Riva, Bello and Papagno2018). For instance, a study showed that patients with FTD (including non-fluent PPA) showed greater difficulty with verb naming than noun naming (Hillis et al., Reference Hillis, Oh and Ken2004). Thus, the differences in diagnostic performance may partly be due to NAME20 being more sensitive to naming impairment associated with FTD. However, this needs to be established in future research. Ideally, such research should examine C-CLNT20 and NAME20 performance separately across larger, well-defined, dementia syndromes, including patients with clinically documented anomia. It would also be interesting to examine whether incorrect responses on these tests reflect naming impairment, or whether some errors may be due to impairments in other cognitive functions (e.g., gnosis).

A key strength of this study was the direct head-to-head comparison of C-CLNT20 and NAME20 within the same multicultural sample, reducing methodological disparities and enhancing the validity of comparative findings. Recruitment across five countries and inclusion of 36 different nationalities and 30 languages further supported the generalizability of results across countries and cultural contexts. However, the small sample sizes from some countries (four from France and one from the UK) represent a limitation. Another limitation is the lack of accurate matching across diagnostic groups on variables such as age and immigrant status. Although we tried to correct for this in the analyses, it cannot be ruled out that this may have exacerbated some group differences and influenced diagnostic performance of the tests. Furthermore, our dementia and MCI samples were too small to analyze the C-CLNT20 and NAME20 across specific dementia and MCI subtypes. Additionally, while the findings supported construct validity through correlations with category fluency tests, the lack of a well-established confrontation naming tests for multicultural populations, including the BNT and MINT, complicate this type of research and limits direct comparability with other studies using these measures. Finally, although interpreter assistance was generally provided when necessary, access to interpreter services and the quality of professional interpreter training vary considerably across European countries (Nielsen et al., Reference Nielsen, De Mendonça, Frölich, Engelborghs, Gove, Lamirel, Calia and Waldemar2024). At most participating sites, neuropsychologists collaborated with interpreters who lacked specific training in cognitive assessment. Nonetheless, the study served to cross-validate parts of C-CLNT and NAME in a new, large, and diverse sample, strengthening the findings reported by Franzen et al. (Reference Franzen, Van Den Berg, Ayhan, Satoer, Türkoğlu, Genç Akpulat, Visch-Brink, Scheffers, Kranenburg, Jiskoot, Van Hemmen and Papma2023) and Nielsen et al. (Reference Nielsen, Grollenberg, Ringkøbing, Özden, Weekes and Waldemar2023).

In conclusion, this study supports the validity of C-CLNT20 and NAME20 for assessing anomia in patients with dementia in multicultural populations. While both tests showed acceptable diagnostic accuracy for dementia, their sensitivity for MCI was limited, likely due to subtle or absent naming deficits in these patients and potential ceiling effects. In this study, NAME20 showed slight but consistent advantages over C-CLNT20. However, these findings should be confirmed by additional studies before any recommendations are made regarding the choice between the two tests. Both NAME20 and C-CLNT20 appear to be valid time-saving alternatives to their full-length versions and represent a meaningful progress towards cross-cultural naming tests.

Funding statement

This research was supported by THE VELUX FOUNDATIONS (grant number 00042578), which had no role in the formulation of research questions, choice of study design, data collection, data analysis or decision to publish. The Danish Dementia Research Centre is supported by the Danish Ministry of Health. Sanne Franzen is supported by grants from the Netherlands Organisation for Health Research and Development (#733050834 and #10510032120004). She also received consulting fees from Biogen in 2022 (unrelated to this work) and receives royalties on the Dutch version of the Five Digit Test and the modified Visual Association Test (published by Hogrefe).

Competing interests

T. Rune Nielsen and Maria Özden are coauthors on the original C-CLNT validation paper and Sanne Franzen is the main author on the original NAME validation paper.

References

American Psychiatric Association. (2013). Diagnostic and Statistical Manual of Mental Disorders (5th ed.). American Psychiatric Association. https://doi.org/10.1176/appi.books.9780890425596 Google Scholar

Ardila, A. (2007). Toward the development of a cross-linguistic naming test. Archives of Clinical Neuropsychology, 22(3), 297–307.10.1016/j.acn.2007.01.016CrossRef Google Scholar PubMed

Baird, A., Ford, M., & Podell, K. (2007). Ethnic differences in functional and neuropsychological test performance in older adults. Archives of Clinical Neuropsychology, 22(3), 309–318.10.1016/j.acn.2007.01.005CrossRef Google Scholar PubMed

Boone, K., Victor, T., Wen, J., Razani, J., & Ponton, M. (2007). The association between neuropsychological scores and ethnicity, language, and acculturation variables in a large patient population. Archives of Clinical Neuropsychology, 22(3), 355–365.10.1016/j.acn.2007.01.010CrossRef Google Scholar PubMed

Calero, M. D., Arnedo, M. L., Navarro, E., Ruiz-Pedrosa, M., & Carnero, C. (2002). Usefulness of a 15-item version of the Boston Naming Test in neuropsychological assessment of low-educational elders with dementia. The Journals of Gerontology Series B: Psychological Sciences and Social Sciences, 57(2), P187–P191.10.1093/geronb/57.2.P187CrossRef Google Scholar PubMed

DeLong, E. R., DeLong, D. M., & Clarke-Pearson, D. L. (1988). Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics, 44(3), 837–845.10.2307/2531595CrossRef Google Scholar PubMed

Folstein, M. F., Folstein, S. E., & McHugh, P. R. (1975). Mini-mental state. Journal of Psychiatric Research, 12(3), 189–198.10.1016/0022-3956(75)90026-6CrossRef Google Scholar PubMed

Franzen, S., Papma, J. M., Van Den Berg, E., & Nielsen, T. R. (2020). Cross-cultural neuropsychological assessment in the European Union: a Delphi expert study. Archives of Clinical Neuropsychology, 36(5), 815–830.Google Scholar

Franzen, S., Van Den Berg, E., Ayhan, Y., Satoer, D. D., Türkoğlu, Ö., Genç Akpulat, G. E., Visch-Brink, E. G., Scheffers, E. A., Kranenburg, J., Jiskoot, L. C., Van Hemmen, J., & Papma, J. M. (2023). The Naming Assessment in Multicultural Europe (NAME): Development and validation in a multicultural memory clinic. Journal of the International Neuropsychological Society, 29(1), 92–104.10.1017/S135561772100148XCrossRef Google Scholar

Franzen, S., European Consortium on Cross-Cultural Neuropsychology (ECCroN), Watermeyer, T. J., Pomati, S., Papma, J. M., Nielsen, T. R., Narme, P., Mukadam, N., Lozano-Ruiz, Á., Ibanez-Casas, I., Goudsmit, M., Fasfous, A., Daugherty, J. C., Canevelli, M., Calia, C., Van Den Berg, E., & Bekkhus-Wetterberg, P. (2022). Cross-cultural neuropsychological assessment in Europe: Position statement of the European Consortium on Cross-Cultural Neuropsychology (ECCroN). The Clinical Neuropsychologist, 36(3), 546–557.10.1080/13854046.2021.1981456CrossRef Google Scholar

Gálvez-Lara, M., Moriana, J. A., Vilar-López, R., Fasfous, A. F., Hidalgo-Ruzzante, N., & Pérez-García, M. (2015). Validation of the Cross-Linguistic Naming Test: a naming test for different cultures? A preliminary study in the Spanish population. Journal of Clinical & Experimental Neuropsychology, 37(1), 102–112.10.1080/13803395.2014.1003533CrossRef Google Scholar

Georgiou, E., Prapiadou, S., Thomopoulos, V., Skondra, M., Charalampopoulou, M., Pachi, A., Anagnostopoulou, A., Vorvolakos, T., Perneczky, R., Politis, A., & Alexopoulos, P. (2022). Naming ability assessment in neurocognitive disorders: A clinician’s perspective. BMC Psychiatry, 22(1), 837.10.1186/s12888-022-04486-xCrossRef Google Scholar PubMed

Gollan, T. H., Fennema-Notestine, C., Montoya, R. I., & Jernigan, T. L. (2007). The bilingual effect on Boston Naming Test performance. Journal of the International Neuropsychological Society, 13(02), 197–208.10.1017/S1355617707070038CrossRef Google Scholar PubMed

Gollan, T. H., Weissberger, G. H., Runnqvist, E., Montoya, R. I., & Cera, C. M. (2012). Self-ratings of spoken language dominance: A Multilingual Naming Test (MINT) and preliminary norms for young and aging Spanish–English bilinguals. Bilingualism: Language and Cognition, 15(3), 594–615.10.1017/S1366728911000332CrossRef Google Scholar

Gorno-Tempini, M. L., Hillis, A. E., Weintraub, S., Kertesz, A., Mendez, M., Cappa, S. F., Ogar, J. M., Rohrer, J. D., Black, S., Boeve, B. F., Manes, F., Dronkers, N. F., Vandenberghe, R., Rascovsky, K., Patterson, K., Miller, B. L., Knopman, D. S., Hodges, J. R., Mesulam, M. M., & Grossman, M. (2011). Classification of primary progressive aphasia and its variants. Neurology, 76(11), 1006–1014.10.1212/WNL.0b013e31821103e6CrossRef Google Scholar PubMed

Harry, A., & Crowe, S. F. (2014). Is the Boston Naming Test still fit for purpose? The Clinical Neuropsychologist, 28(3), 486–504.10.1080/13854046.2014.892155CrossRef Google Scholar

Hillis, A. E., Oh, S., & Ken, L. (2004). Deterioration of naming nouns versus verbs in primary progressive aphasia. Annals of Neurology, 55(2), 268–275.10.1002/ana.10812CrossRef Google Scholar PubMed

Hinton, L., Tran, D., Peak, K., Meyer, O. L., & Quiñones, A. R. (2024). Mapping racial and ethnic healthcare disparities for persons living with dementia: A scoping review. Alzheimer’s & Dementia, 20(4), 3000–3020.10.1002/alz.13612CrossRef Google Scholar PubMed

Ivanova, I., Salmon, D. P., & Gollan, T. H. (2013). The Multilingual Naming Test in Alzheimer’s disease: Clues to the origin of naming impairments. Journal of the International Neuropsychological Society, 19(3), 272–283.10.1017/S1355617712001282CrossRef Google Scholar

Jessen, F., Amariglio, R. E., van Boxtel, M., Breteler, M., Ceccaldi, M., Chételat, G. B., Dubois, l., Dufouil, B., Ellis, C., K., A., van der Flier, W. M., Glodzik, L., van Harten, A. C., de Leon, M. J., McHugh, P., Mielke, M. M., Molinuevo, J. L., Mosconi, L., Osorio, R. S., Perrotin, A., … Subjective Cognitive Decline Initiative (SCD‐I) Working Group. (2014). A conceptual framework for research on subjective cognitive decline in preclinical Alzheimer’s disease. Alzheimer’s & Dementia, 10(6), 844–852.10.1016/j.jalz.2014.01.001CrossRef Google Scholar PubMed

Joubert, S., Brambati, S. M., Ansado, J., Barbeau, E. J., Felician, O., Didic, M., Lacombe, J., Goldstein, R., Chayer, C., & Kergoat, M.-J. (2010). The cognitive and neural expression of semantic memory impairment in mild cognitive impairment and early Alzheimer’s disease. Neuropsychologia, 48(4), 978–988.10.1016/j.neuropsychologia.2009.11.019CrossRef Google Scholar PubMed

Kaplan, E., Goodglass, H., & Weintraub, S. (2001). Boston naming test (2nd ed.). Lippincott, Williams & Wilkins.Google Scholar

Katsumata, Y., Mathews, M., Abner, E. L., Jicha, G. A., Caban-Holt, A., Smith, C. D., Nelson, P. T., Kryscio, R. J., Schmitt, F. A., & Fardo, D. W. (2015). Assessing the discriminant ability, reliability, and comparability of multiple short forms of the Boston Naming Test in an Alzheimer’s disease center cohort. Dementia and Geriatric Cognitive Disorders, 39(3–4), 215–227.10.1159/000370108CrossRef Google Scholar

Kim, B.-S., Lee, D.-W., Bae, J.-N., Kim, J.-H., Kim, S., Kim, K. W., Park, J.-E., Cho, M. J., & Chang, S. M. (2017). Effects of education on differential item functioning on the 15-item modified Korean version of the Boston Naming Test. Psychiatry Investigation, 14(2), 126.10.4306/pi.2017.14.2.126CrossRef Google Scholar PubMed

Kohnert, K. J., Hernandez, A. E., & Bates, E. (1998). Bilingual performance on the Boston Naming Test: Preliminary norms in Spanish and English. Brain and Language, 65(3), 422–440.10.1006/brln.1998.2001CrossRef Google Scholar PubMed

Kristensson, J., Longoni, F., Östberg, P., Rödseth Smith, S., Åke, S., & Saldert, C. (2024). Anomia in left hemisphere stroke, multiple sclerosis and Parkinson’s disease – a comparative study. Disability & Rehabilitation, 46(11), 2294–2316.10.1080/09638288.2023.2219902CrossRef Google Scholar PubMed

Lehman, W. A. (1970). Missile Wounds of the Brain; A Study of Psychological Deficits. By Freda Newcombe. Oxford University Press (Oxford Neurological Monographs). 1969. Pp. 145, Price 42 s. British Journal of Psychiatry, 117(539), 461. https://doi.org/10.1192/bjp.117.539.461 CrossRef Google Scholar

Li, C., Zeng, X., Neugroschl, J., Aloysi, A., Zhu, C. W., Xu, M., Teresi, J. A., Ocepek-Welikson, K., Ramirez, M., Joseph, A., Cai, D., Grossman, H., Martin, J., Sewell, M., Loizos, M., & Sano, M. (2022). The 32-item Multilingual Naming Test: Cultural and linguistic biases in monolingual Chinese-speaking older adults. Journal of the International Neuropsychological Society, 28(5), 511–519.10.1017/S1355617721000746CrossRef Google Scholar PubMed

Li, D., Yu, Y.-Y., Hu, N., Zhang, M., Liu, L., Fan, L.-M., Ruan, S.-S., & Wang, F. (2022). A color-picture version of Boston Naming Test outperformed the black-and-white version in discriminating amnestic mild cognitive impairment and mild Alzheimer’s disease. Frontiers in Neurology, 13, 884460.10.3389/fneur.2022.884460CrossRef Google Scholar PubMed

Liampas, I., Folia, V., Morfakidou, R., Siokas, V., Yannakoulia, M., Sakka, P., Scarmeas, N., Hadjigeorgiou, G., Dardiotis, E., & Kosmidis, M. H. (2023). Language differences among individuals with normal cognition, amnestic and non-amnestic MCI, and Alzheimer’s disease. Archives of Clinical Neuropsychology, 38(4), 525–536.10.1093/arclin/acac080CrossRef Google Scholar PubMed

Lin, P.-J., Daly, A. T., Olchanski, N., Cohen, J. T., Neumann, P. J., Faul, J. D., Fillit, H. M., & Freund, K. M. (2021). Dementia diagnosis disparities by race and ethnicity. Medical Care, 59(8), 679–686.10.1097/MLR.0000000000001577CrossRef Google Scholar PubMed

Mack, W. J., Freed, D. M., Williams, B. W., & Henderson, V. W. (1992). Boston Naming Test: Shortened versions for use in Alzheimer’s disease. Journal of Gerontology, 47(3), P154–P158.10.1093/geronj/47.3.P154CrossRef Google Scholar PubMed

March, E. G., Worrall, L. E., & Hickson, L. M. H. (2000). Performance of an Australian older sample on the Boston Naming Test and comparability of short form test versions. Clinical Neuropsychological Assessment, 3, 179–192.Google Scholar

Maruta, C., Guerreiro, M., De Mendonça, A., Hort, J., & Scheltens, P. (2011). The use of neuropsychological tests across Europe: The need for a consensus in the use of assessment tools for dementia. European Journal of Neurology, 18(2), 279–285.10.1111/j.1468-1331.2010.03134.xCrossRef Google Scholar PubMed

McKeith, I. G., Boeve, B. F., Dickson, D. W., Halliday, G., Taylor, J.-P., Weintraub, D., Aarsland, D., Galvin, J., Attems, J., Ballard, C. G., Bayston, A., Beach, T. G., Blanc, F., Bohnen, N., Bonanni, L., Bras, J., Brundin, P., Burn, D., Chen-Plotkin, A., …Kosaka, K. (2017). Diagnosis and management of dementia with Lewy bodies: Fourth consensus report of the DLB consortium. Neurology, 89(1), 88–100.10.1212/WNL.0000000000004058CrossRef Google Scholar PubMed

McKhann, G. M., Knopman, D. S., Chertkow, H., Hyman, B. T., Jack, C. R., Kawas, C. H., Klunk, W. E., Koroshetz, W. J., Manly, J. J., Mayeux, R., Mohs, R. C., Morris, J. C., Rossor, M. N., Scheltens, P., Carrillo, M. C., Thies, B., Weintraub, S., & Phelps, C. H. (2011). The diagnosis of dementia due to Alzheimer’s disease: Recommendations from the National Institute on Aging-Alzheimer’s Association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimer’s & Dementia, 7(3), 263–269.10.1016/j.jalz.2011.03.005CrossRef Google Scholar PubMed

Moo, L. R. (2003). Interlocking finger test: A bedside screen for parietal lobe dysfunction. Journal of Neurology, Neurosurgery & Psychiatry, 74(4), 530–532.10.1136/jnnp.74.4.530CrossRef Google Scholar

Nielsen, T. R., Andersen, B. B., Kastrup, M., Phung, T. K. T., & Waldemar, G. (2011). Quality of dementia diagnostic evaluation for ethnic minority patients: A nationwide study. Dementia and Geriatric Cognitive Disorders, 31(5), 388–396.10.1159/000327362CrossRef Google Scholar PubMed

Nielsen, T. R., De Mendonça, A., Frölich, L., Engelborghs, S., Gove, D., Lamirel, D., Calia, C., & Waldemar, G. (2024). Assessment of dementia in minority ethnic groups in Europe: A 14-year follow-up survey. International Journal of Geriatric Psychiatry, 39(12), e70034.10.1002/gps.70034CrossRef Google Scholar PubMed

Nielsen, T. R., Franzen, S., Watermeyer, T., Jiang, J., Calia, C., Kjærgaard, D., Bothe, S., & Mukadam, N. (2024). Interpreter-mediated neuropsychological assessment: Clinical considerations and recommendations from the European Consortium on Cross-Cultural Neuropsychology (ECCroN). The Clinical Neuropsychologist, 38(8), 1775–1805.10.1080/13854046.2024.2335113CrossRef Google Scholar

Nielsen, T. R., Grollenberg, B. U., Ringkøbing, S. P., Özden, M., Weekes, B., & Waldemar, G. (2023). The copenhagen cross-linguistic naming test (C-CLNT): Development and validation in a multicultural memory clinic population. Journal of the International Neuropsychological Society, 29(10), 911–921.10.1017/S1355617723000437CrossRef Google Scholar

Nielsen, T. R., Vogel, A., Phung, T. K. T., Gade, A., & Waldemar, G. (2011). Over- and under-diagnosis of dementia in ethnic minorities: A nationwide register-based study. International Journal of Geriatric Psychiatry, 26(11), 1128–1135.10.1002/gps.2650CrossRef Google Scholar PubMed

Nielsen, T. R., Vogel, A., Riepe, M. W., De Mendonça, A., Rodriguez, G., Nobili, F., Gade, A., & Waldemar, G. (2011). Assessment of dementia in ethnic minority patients in Europe: A European Alzheimer’s disease consortium survey. International Psychogeriatrics, 23(1), 86–95.10.1017/S1041610210000955CrossRef Google Scholar PubMed

Nørkær, E., Halai, A. D., Woollams, A., Lambon Ralph, M. A., & Schumacher, R. (2024). Establishing and evaluating the gradient of item naming difficulty in post-stroke aphasia and semantic dementia. Cortex, 179, 103–111.10.1016/j.cortex.2024.07.007CrossRef Google Scholar PubMed

Norris, A. E., Ford, K., & Bova, C. A. (1996). Psychometrics of a brief acculturation scale for hispanics in a probability sample of urban hispanic adolescents and young adults. Hispanic Journal of Behavioral Sciences, 18(1), 29–38.10.1177/07399863960181004CrossRef Google Scholar

Nussbaum, S., May, N., Cutler, L., Abeare, C. A., Watson, M., & Erdodi, L. A. (2022). Failing performance validity cutoffs on the Boston Naming Test (BNT) is specific, but insensitive to Non-credible Responding. Developmental Neuropsychology, 47(1), 17–31.10.1080/87565641.2022.2038602CrossRef Google Scholar PubMed

Pisoni, A., Mattavelli, G., Casarotti, A., Comi, A., Riva, M., Bello, L., & Papagno, C. (2018). Object-action dissociation: A voxel-based lesion-symptom mapping study on 102 patients after glioma removal. NeuroImage: Clinical, 18, 986–995.10.1016/j.nicl.2018.03.022CrossRef Google Scholar

Rabin, L. A., Paolillo, E., & Barr, W. B. (2016). Stability in test-usage practices of clinical neuropsychologists in the United States and Canada over a 10-year period: A follow-up survey of INS and NAN members. Archives of Clinical Neuropsychology, 31(3), 206–230.10.1093/arclin/acw007CrossRef Google Scholar

Rascovsky, K., Hodges, J. R., Knopman, D., Mendez, M. F., Kramer, J. H., Neuhaus, J., Van Swieten, J. C., Seelaar, H., Dopper, E. G. P., Onyike, C. U., Hillis, A. E., Josephs, K. A., Boeve, B. F., Kertesz, A., Seeley, W. W., Rankin, K. P., Johnson, J. K., Gorno-Tempini, M.-L., Rosen, H., …Miller, B.L. (2011). Sensitivity of revised diagnostic criteria for the behavioural variant of frontotemporal dementia. Brain, 134(9), 2456–2477.10.1093/brain/awr179CrossRef Google Scholar PubMed

Roberts, P. M., Garcia, L. J., Desrochers, A., & Hernandez, D. (2002). English performance of proficient bilingual adults on the Boston Naming Test. Aphasiology, 16(4–6), 635–645.10.1080/02687030244000220CrossRef Google Scholar

Sachdev, P., Kalaria, R., O’Brien, J., Skoog, I., Alladi, S., Black, S. E., Blacker, D., Blazer, D. G., Chen, C., Chui, H., Ganguli, M., Jellinger, K., Jeste, D. V., Pasquier, F., Paulsen, J., Prins, N., Rockwood, K., Roman, G., & Scheltens, P. (2014). Diagnostic criteria for vascular cognitive disorders: A VASCOG statement. Alzheimer Disease & Associated Disorders, 28(3), 206–218.10.1097/WAD.0000000000000034CrossRef Google Scholar PubMed

Schmidtke, K., & Olbrich, S. (2007). The clock reading test: Validation of an instrument for the diagnosis of dementia and disorders of visuo-spatial cognition. International Psychogeriatrics, 19(2), 307–321.10.1017/S104161020600456XCrossRef Google Scholar PubMed

Shaikh, K. T., Zaidi, K. B., Wong Gonzalez, D., Dimech, C., Gilson, Z. M., Stokes, K. A., & Paterson, T. S. E. (2025). Cultural bias in the assessment of language: a closer look at the Boston naming test among multicultural Canadian older adults. Applied Neuropsychology: Adult, 1–13.Google Scholar

Stasenko, A., Jacobs, D. M., Salmon, D. P., & Gollan, T. H. (2019). The Multilingual Naming Test (MINT) as a measure of picture naming ability in Alzheimer’s disease. Journal of the International Neuropsychological Society, 25(08), 821–833.10.1017/S1355617719000560CrossRef Google Scholar PubMed

Storey, J. E., Rowland, J. T. J., Conforti, D. A., & Dickson, H. G. (2004). The Rowland Universal Dementia Assessment Scale (RUDAS): A multicultural cognitive assessment scale. International Psychogeriatrics, 16(1), 13–31.10.1017/S1041610204000043CrossRef Google Scholar

Strain, J. F., Didehbani, N., Spence, J., Conover, H., Bartz, E. K., Mansinghani, S., Jeroudi, M. K., Rao, N. K., Fields, L. M., Kraut, M. A., Cullum, C. M., Hart, J., & Womack, K. B. (2017). White matter changes and confrontation naming in retired aging national football league athletes. Journal of Neurotrauma, 34(2), 372–379.10.1089/neu.2016.4446CrossRef Google Scholar PubMed

Vélez-Uribe, I., Rosselli, M., Newman, D., Gonzalez, J., Gonzalez Pineiro, Y., Barker, W. W., Marsiske, M., Fiala, J., Lang, M. K., Conniff, J., Ahne, E., Goytizolo, A., Loewenstein, D. A., Curiel, R. E., & Duara, R. (2024). Cross-cultural diagnostic validity of the Multilingual Naming Test (MINT) in a sample of older adults. Archives of Clinical Neuropsychology, 39(4), 464–481.10.1093/arclin/acad093CrossRef Google Scholar

Vogel, A., Johannsen, P., Stokholm, J., & Jørgensen, K. (2014). Frequency and severity of semantic deficits in a consecutive memory clinic cohort. Dementia and Geriatric Cognitive Disorders, 38(3–4), 214–223.10.1159/000357794CrossRef Google Scholar

Vogel, A., Mellergaard, C., & Frederiksen, K. S. (2025). Different language profiles on neuropsychological tests in dementia with Lewy bodies and Alzheimer’s disease. Applied Neuropsychology: Adult, 32(4), 1171–1178.10.1080/23279095.2023.2247112CrossRef Google Scholar PubMed

Weeks, S. K., McGann, P. E., Michaels, T. K., & Penninx, B. W. J. H. (2003). Comparing various short-form geriatric depression scales leads to the GDS-5/15. Journal of Nursing Scholarship, 35(2), 133–137.10.1111/j.1547-5069.2003.00133.xCrossRef Google Scholar

Williams, B. W., Mack, W., & Henderson, V. W. (1989). Boston naming test in Alzheimer’s disease. Neuropsychologia, 27(8), 1073–1079.10.1016/0028-3932(89)90186-3CrossRef Google Scholar PubMed

Williamson, D. J. G., Adair, J. C., Raymer, A. M., & Heilman, K. M. (1998). Object and action naming in Alzheimer’s disease. Cortex, 34(4), 601–610.10.1016/S0010-9452(08)70517-3CrossRef Google Scholar PubMed

Winblad, B., Palmer, K., Kivipelto, M., Jelic, V., Fratiglioni, L., Wahlund, L.-O., Nordberg, A., Bäckman, L., Albert, M., Almkvist, O., Arai, H., Basun, H., Blennow, K., De Leon, M., DeCarli, C., Erkinjuntti, T., Giacobini, E., Graff, C.,Hardy, J., …Petersen, R.C. (2004). Mild cognitive impairment – beyond controversies, towards a consensus. Report of the International Working Group on Mild Cognitive Impairment. Journal of Internal Medicine, 256(3), 240–246.Google Scholar PubMed

World Health Organization. (2021). Global Status Report on the Public Health Response to Dementia (1st ed.). World Health Organization.Google Scholar

Wright, L. M., De Marco, M., & Venneri, A. (2023). Current understanding of verbal fluency in Alzheimer’s disease: Evidence to date. Psychology Research and Behavior Management, 16, 1691–1705.10.2147/PRBM.S284645CrossRef Google Scholar PubMed

Figure 1. Examples of items from C-CLNT20 (bone and fly) and NAME20 (butcher and nose). Items are reproduced from the original C-CLNT and NAME papers (Franzen et al., 2023 and Nielsen et al., 2023) with permission from the authors.

Table 1. Participant characteristics and test performance (n = 161)

Figure 2. ROC-curves for C-CLNT20 and NAME20 for dementia.

Table 2. Diagnostic accuracy

Table 3. Logistic regression analyses for diagnosis of dementia in the full sample (n = 161)

Table 4. Logistic regression analyses for diagnosis of dementia in participants with immigrant background (n = 90)

Article contents

Assessing anomia across cultures and languages: A head-to-head comparison of abbreviated versions of the Copenhagen Cross-Linguistic Naming Test and Naming Assessment in Multicultural Europe in a multicultural memory clinic population

Abstract

Keywords

Information

Statement of Research Significance

Introduction

Materials and methods

Participants

Procedure

Measures

C-CLNT20

NAME20

Other measures

Statistical analyses

Results

Participant characteristics

Construct validity of the abbreviated versions

Diagnostic accuracy

Influence of demographic and cultural factors on diagnostic accuracy

Discussion

Funding statement

Competing interests

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests