Statement of research significance
Research question(s) or topic(s): Cognitive deficits are frequently identified in cases of attention deficit hyperactivity disorder (ADHD), particularly in Executive Functions and Theory of Mind. The correlation between these cognitive domains has been observed in other clinical groups. However, its presence and magnitude in ADHD remain to be fully understood. To enhance clinical reasoning and inform future interventions, we analyzed this correlation by investigating its magnitude, potential differences across developmental stages (children and adults), and comparisons with a control group without the disorder.
Main findings: Our results confirmed a significant correlation of moderate magnitude between different components of Executive Functions and Theory of Mind. We found no differences between children and adults, nor did we observe significant differences between the ADHD and control groups.
Study contributions: This study provides new insights into Theory of Mind and Executive Function performances in ADHD and critically examines how both domains have been studied in this clinical context.
Introduction
Attention deficit hyperactivity disorder (ADHD) is characterized by a persistent inattention and/or hyperactivity-impulsivity behavioral pattern that leads to a significant functional impairment (Antshel & Barkley, Reference Antshel and Barkley2020). It is one of the most prevalent neurodevelopmental disorders, with global rates reaching 7.6% in children/adolescents (Salari et al., Reference Salari, Ghasemi, Abdoli, Rahmani, Shiri, Hashemian, Akbari and Mohammadi2023) and 6.7% in adults (Song et al., Reference Song, Zha, Yang, Zhang, Li and Rudan2021). Deficits in Executive Functions (EF) are common and significant in ADHD (Silverstein et al., Reference Silverstein, Faraone, Leon, Biederman, Spencer and Adler2018). EFs can be understood as a set of mental abilities that regulate behavior, providing the individual with self-control, concentration, focus, and adaptability in the face of changes in context (Diamond, Reference Diamond2013). Several theoretical models have been developed to explain this construct and its subdomains (e.g., Baddeley, Reference Baddeley1996; Barkley, Reference Barkley1997; Diamond, Reference Diamond2013; Miyake et al., Reference Miyake, Friedman, Emerson, Witzki, Howerter and Wager2000; Stuss & Alexander, Reference Stuss and Alexander2007; Zelazo et al., Reference Zelazo, Muller, Frye and Marcovitch2003), with the most commonly associated deficits in ADHD being inhibitory control (Jiménez-Figueroa et al., Reference Jiménez-Figueroa, Ardila-Duarte, Pineda, Acosta-López, Cervantes-Henríquez, Pineda-Alhucema, Cervantes-Gutiérrez, Quintero-Ibarra, Sánchez-Rojas, Vélez and Puentes-Rozo2017), working memory (Kofler et al., Reference Kofler, Singh, Soto, Chan, Miller, Harmon and Spiegel2020), cognitive flexibility (Roshani et al., Reference Kofler, Singh, Soto, Chan, Miller, Harmon and Spiegel2020), planning (Willcutt et al., 2005), and attention processes (Miklós et al., Reference Miklós, Futó, Komáromy and Balázs2019).
Alterations in these EF subdomains imply impaired self-regulation and social interaction (Barkley, Reference Barkley2012). Furthermore, other impairments in the cognitive sphere are also observed in ADHD, such as language (Milligan et al., Reference Milligan, Astington and Dack2007; San Juan & Astington, Reference San Juan and Astington2011), learning (Bora & Pantelis, Reference Bora and Pantelis2015; Tucci & Easterbrooks, Reference Tucci and Easterbrooks2020), memory (Imanipour et al., Reference Imanipour, Sheikh, Shayestefar and Baloochnejad2021) and social cognition (Uekermann et al., Reference Uekermann, Kraemer, Abdel-Hamid, Schimmelmann, Hebebrand, Daum, Wiltfang and Kis2010), with an emphasis on the Theory of Mind (ToM; Carlson et al., Reference Carlson, Mandell and Williams2004; Hughes, Reference Hughes1998; Milligan et al., Reference Milligan, Astington and Dack2007). It is worth noting that these cognitive domains exhibit significant and moderate correlations, particularly among individuals with ADHD. For example, Sarıyer et al. (2023) reported correlations between language skills and working memory ranging from 0.25 to 0.42. Similarly, Babarczy et al. (2024) found a correlation of 0.44 between ToM and pragmatic comprehension, while Çiray et al. (Reference Çiray, Özyurt, Turan, Karagöz, Ermiş, Öztürk and Akay2021) reported a correlation of 0.28 between ToM and text comprehension ability. Likewise, Şahin et al. (2018) identified correlations ranging from 0.27 to 0.48 between ToM and global intelligence, while Imanipour et al., (Reference Imanipour, Sheikh, Shayestefar and Baloochnejad2021) found a correlation of 0.60 between ToM and working memory.
ToM is defined as a set of skills necessary to capture information, interpret, and make inferences about the mental states, intentions, and desires of others, as well as oneself (Premack & Woodruff, Reference Premack and Woodruff1978). It is relevant in prosocial behaviors and when interacting with peers (Rix et al., 2023). Evidence shows that individuals with ADHD presented a significantly lower performance on ToM tasks than their healthy counterparts (Mary et al., Reference Mary, Slama, Mousty, Massat, Capiau, Drabs and Peigneux2015; Maoz et al., 2017). A meta-analysis including 17 studies found differences of a moderate effect size (Hedges’ g = 0.66; Nejati, Reference Nejati2022). Additionally, these impairments were found in samples from different age groups (Mary et al., Reference Mary, Slama, Mousty, Massat, Capiau, Drabs and Peigneux2015; Tatar & Cansız, Reference Tatar and Cansız2020).
The nature of ToM impairments is believed to be partially associated with pre-existing deficits in ADHD’s other central cognitive domains, such as EF (Russell, 1996). Extensive literature focuses on healthy individuals, supporting that EF, in general, and specifically its subdomains, are essential for performing the most complex mental inferences. These are also predictors of later performance in ToM tasks (Russel, 1996; Devine & Hughes, Reference Devine and Hughes2014; Putko, Reference Putko2009). However, the nature and direction of the association between EF and ToM remain undetermined (Wade et al., Reference Wade, Prime, Jenkins, Yeates, Williams and Lee2018). Some findings suggest a strict interdependence, where each skill reciprocally predicts the other, while others indicate partial dependence due to shared brain regions. These hypotheses may partially explain the robust correlation between EF and ToM throughout childhood, as observed in behavioral measures and findings on normative brain development.
Neuroimaging studies indicates the presence of neural networks activated both during the performance of specific ToM tasks and those related to EF (Molehnberg et al., Johnson, Henry, & Mattingley, 2016; Salehinejad et al., Reference Salehinejad, Ghanavati, Rashid and Nitsche2021). Evidence also indicates that the performance of ToM and EF tasks is mediated by cortical regions very close to each other; for example, the prefrontal cortex (specifically the medial, orbitofrontal, and dorsolateral areas) and the temporoparietal junction (Schurz et al., Reference Schurz, Radua, Aichhorn, Richlan and Perner2014). Nonetheless, depending on the type of stimulus task to which an individual is subjected, whether ToM or EF, distinct, diverse and independent cortical networks might also be activated (Molenberghs et al., 2016). These networks appear to be specially designed for mental representation or executive control (Wade et al., Reference Wade, Prime, Jenkins, Yeates, Williams and Lee2018), supporting the explanation of directional relationships (EF → ToM or even ToM → EF).
Pineda-Alhucema et al., (Reference Pineda-Alhucema, Aristizabal, Escudero-Cabarcas, Acosta-López and Vélez2018) conducted a systematic literature review to assess associations between EF and ToM among children with ADHD. They analyzed 15 studies, of which only eight analyzed data statistically. Inhibitory control was the EF subdomain, showing the strongest association, though working memory, cognitive flexibility, and attention were also subdomains associated with ToM. However, due to the studies’ limitations, the degree of prediction and predictability between ToM and EF was not established. To the best of our knowledge, no systematic literature reviews or meta-analyses have addressed samples of adults with ADHD to verify these associations.
Note that associations between EF and ToM are not exclusive to ADHD and have already been evidenced among healthy individuals (Wilson et al., Reference Wilson, Andrews, Hogan, Wang and Shum2018; Cho & Cohen, Reference Cho and Cohen2019) and other clinical groups, though with specificities. For example, such a correlation in Autism Spectrum Disorder is more robust in tasks that require social, linguistic, and emotional recognition skills for mental inferences (Jones et al., Reference Jones, Simonoff, Baird, Pickles, Marsden, Tregay and Charman2017). It tends to be more expressive in Schizophrenia, when the individual is faced with tasks that require inhibitory control and cognitive flexibility (López-Navarro, Reference López-Navarro2018; Li et al., Reference Li, Hu, Deng, Tao, Hu, Yang and Zhang2017). Hence, it is relevant to verify such an association in ADHD and whether any EF subdomain presents more significant correlations with ToM.
Although it remains uncertain whether EF is best represented by a single central executive component or by interactions between specialized, potentially dissociated from each other, this topic continues to be debated. Nonetheless, the findings highlight the importance of assessing EF at a more specific level of analysis, as different disorders involving EF impairments may exhibit distinct executive profiles (Diamond, Reference Diamond2013; Ozonoff & Jensen, Reference Ozonoff and Jensen1999; Robbins, Reference Robbins1996).
Another matter involves the developmental aspect: does the association between these domains persist throughout one’s life cycle and maintain the same specificities? Studies comparing different age groups indicate that specific deficits in EF subdomains explain the deterioration of ToM skills as age advances (Cho & Cohen, Reference Cho and Cohen2019). Phillips et al. (Reference Phillips, Bull, Allen, Insch, Burr and Ogg2011) verified the influence of age on EF and whether this influence could explain one’s performance on ToM tasks in a sample of young, middle-aged, and older adults. They concluded that differences between age groups in the performance of ToM tasks decreased significantly when the EF variable was controlled (especially working memory). Wilson et al. (Reference Wilson, Andrews, Hogan, Wang and Shum2018), in turn, assessed the performance of children and adolescents and found significant correlations between increased age and decreased errors when performing EF tasks, resulting in more accurate and faster responses and, consequently, an increased number of correct answers in ToM tasks. Therefore, this evidence shows the relevance of investigating the role of age in moderating the association between EF and ToM in more depth.
Given the foregoing arguments, this study’s primary objective was to conduct a systematic literature review, followed by a meta-analysis, to identify the magnitude of the associations between EF and ToM among individuals with ADHD. The secondary objective was to verify whether there are differences in the magnitudes of the correlations and variations depending on the different EF subdomains and developmental stages (children vs. adults). Additionally, as part of this objective, correlations with groups of healthy individuals were verified.
Based on previous studies conducted with healthy participants across different age groups and various clinical conditions (Chainay & Gaubert, Reference Chainay and Gaubert2020; Devine & Hughes, Reference Devine and Hughes2014; Jones et al., Reference Jones, Simonoff, Baird, Pickles, Marsden, Tregay and Charman2017; Kong et al., Reference Kong, Koo, Seo, Park, Lee and An2021; Lochmann et al., Reference Lochmann, Wyrobnik, Kupper, Rewitzer and Klostermann2023), it is hypothesized that there is a correlation between the constructs, with at least a moderate magnitude. Additionally, the magnitude of these correlations is expected to vary significantly depending on age group (Ferguson et al., Reference Ferguson, Brunsdon and Bradford2021; Meinhardt-Injac et al., Reference Meinhardt-Injac, Daum and Meinhardt2020), specific EF subdomain (Carlson et al., Reference Carlson, Moses and Breton2002; Pineda-Alhucema et al., Reference Pineda-Alhucema, Aristizabal, Escudero-Cabarcas, Acosta-López and Vélez2018), and clinical condition (Tatar & Cansız, Reference Tatar and Cansız2020).
Method
This study complied with the methodological guidelines recommended by the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA – Shamseer et al., Reference Shamseer, Moher, Clarke, Ghersi, Liberati and Petticrew2015) and was registered on the PROSPERO platform (CRD42023449055). A specialist librarian recommended a systematic research strategy to identify the studies in the following databases: PubMed, Web of Science, Embase, Scopus, PsycInfo, Redalyc, and Scielo (the last search was performed on January 8, 2024). The following combinations of keywords were based on the study by Pineda-Alhucema et al. (Reference Pineda-Alhucema, Aristizabal, Escudero-Cabarcas, Acosta-López and Vélez2018): (“attention deficit disorder with hyperactivity” OR “attention deficit disorders with hyperactivity” OR “attention deficit disorder” OR “attention deficit disorders” OR “attention deficit hyperactivity disorder” OR “attention deficit hyperactivity disorders” OR “ADHD” OR “ADDH” OR “hyperactivity” AND “theory of mind” OR “social cognition” OR “mentalizing” OR “false belief” OR “false beliefs” OR “mental attribution” OR “mindread” OR “mindreading” OR “mental attribution” OR “ToM abilities” OR “ToM ability” OR “mind-blindness” AND “executive functions” OR “executive control” OR “inhibitory control” OR “cognitive flexibility” OR “working memory” OR “shifting” OR “planning” OR “executive functioning”).
The following were included in the review: a) original articles regardless of the date of publication, language, or methodological design; b) addressing human subjects regardless of gender or age and including at least an independent/exclusive group of people diagnosed with ADHD according to the criteria of the Diagnostic and Statistical Manual of Mental Disorders (DSM) or the International Statistical Classification of Diseases and Related Health Problems (ICD) (note - the studies were not required to have a group of healthy controls); and c) simultaneously assessing EF (and/or its subdomains) and ToM and analyzing potential associations. Exclusion criteria were articles: a) using duplicate samples; b) addressing a sample of people with ADHD and comorbidities with other neurodevelopmental disorders; and c) missing raw data (even after a request was sent to the authors by email).
Two researchers, experts in mental health (DAF and FLO), independently verified the studies’ eligibility. Disagreements were discussed until a consensus was reached. Next, a manual search for articles was conducted based on the reference lists of the studies already included. The Rayyan software (Ouzzani et al., Reference Ouzzani, Hammady, Fedorowicz and Elmagarmid2016) was used in the identification and screening stages. Figure 1 and the Supplementary Material (SM1) present the selection process.

Figure 1. Flow diagram of the process of article search and selection for the systematic review based on the PRISMA protocol (by Page et al., Reference Page, McKenzie, Bossuyt, Boutron, Hoffmann, Mulrow, Shamseer, Tetzlaff, Akl, Brennan, Chou, Glanville, Grimshaw, Hróbjartsson, Lalu, Li, Loder, Mayo-Wilson, McDonald and Moher2021).
The authors also extracted data independently using a form addressing the following variables: a) study characteristics (author(s), year, and country of publication); b) sociodemographic and clinical characteristics of the samples with ADHD and healthy controls (number of subjects, gender, age, education, recruitment source, diagnostic criteria, severity, subtype, comorbidities, intellectual quotient, and medications in use); c) methodological characterization (methodological design, primary objective, and instruments used to assess the outcomes); and d) primary results (comparison of performance between the ADHD and control groups – in the presence of a control group, and correlation coefficients with the respective levels of significance).
The studies’ methodological quality was assessed using the Newcastle-Ottawa Scale (NOS; Peterson et al., Reference Peterson, Welch, Losos and Tugwell2011). Three parameters were considered: sample selection (four items), comparability (two items), and exposure (three items). The researchers performed the analysis independently, then reviewed the scores and resolved disagreements: the more stars, the better the study’s methodological quality.
Independent meta-analyses were conducted according to the different EF subdomains and ADHD and healthy control groups. The highest magnitude was chosen when the studies presented more than one correlation measure. Analyses were only performed for subdomains and groups addressed by at least two studies. Subgroup analyses were performed to verify the influence of age groups on the outcome of interest; samples of children and adults were considered separately.
The meta-analytic measurement was estimated, considering the correlation coefficient as the outcome. A random effects model was adopted considering the presence of heterogeneity inherent to the studies (Dersimonian & Laird, Reference DerSimonian and Laird1986). Heterogeneity between the studies was assessed using the Q test (Cochran, Reference Cochran1954) and the I2 statistic (Higgins, Reference Higgins2003), analyzed according to the following: rates close to zero indicate the absence of heterogeneity; close to 25% indicate low heterogeneity; close to 50%, moderate heterogeneity; and rates close to or above 75% indicate high heterogeneity (Borenstein et al., Reference Borenstein, Hedges, Higgins, Rothstein and Higgins2009). Student residuals and Cook’s distances were used to verify whether studies would be outliers or influential in the model’s context. Studies with a studentized residual larger than the 100 × (1–0.05/(2 × k)th percentile of a standard normal distribution were considered potential outliers (Pope, Reference Pope1976). Studies with a Cook’s distance above the median plus six times the interquartile range of the Cook’s distances were considered influential (Cook, Reference Cook1979).
The Rank Correlation and Regression Tests were used to assess the presence of potential publication biases, and the standard error of the observed results was used as a predictor to verify the asymmetry of the Funnel Plot (Sterne & Egger, Reference Sterne and Egger2001).
The meta-analytic result was interpreted according to the parameters proposed by Streiner & Norman (Reference Streiner and Norman2003): correlations of magnitude between 0–0.25 were considered absent/weak; between 0.26–0.50, moderate; between 0.51–0.70, strong; and above 0.71 were considered very strong. Jamovi, version 2.3.28, was used to perform the analyses (Jamovi Project, 2023).
Results
Fifteen articles assessing the correlations between EF and ToM in samples of people with ADHD were analyzed (Abdel-Hamid et al., Reference Abdel‐Hamid, Niklewski, Heßmann, Guberina, Kownatka, Kraemer, Scherbaum, Dziobek, Bartels, Wiltfang and Kis2019; Aydin et al., Reference Aydın, Balıkçı, Sönmez, Ünal-Aydın and Spada2021; Bigorra et al., Reference Bigorra, Garolera, Guijarro and Hervas2016; Caillies et al., Reference Caillies, Bertot, Motte, Raynaud and Abely2014; Charman et al., Reference Charman, Carroll and Sturge2001; Dyck & Piek, Reference Dyck and Piek2012; Farahi et al., Reference Farahi, Moshirian Farahi, Asghari Ebrahim Abad and Hokm Abadi2014; Imanipour et al., Reference Imanipour, Sheikh, Shayestefar and Baloochnejad2021; Lavigne et al., Reference Lavigne, González-Cuenca, Romero-González and Sánchez2020; Mary et al., Reference Mary, Slama, Mousty, Massat, Capiau, Drabs and Peigneux2015; Miranda et al., Reference Miranda, Berenguer, Roselló, Baixauli and Colomer2017; Şahin et al., Reference Şahin, Hoşoğlu and Önal2021; Tatar & Cansiz, Reference Şahin, Hoşoğlu and Önal2021; Thoma et al., Reference Thoma, Sonnenburg, Marcinkowski, Juckel, Edel and Suchan2020; Yılmaz et al., Reference Yılmaz, Karaş and Tan2019). Of these, seven separately assessed the outcome of interest in samples of healthy controls (Charman et al., Reference Charman, Carroll and Sturge2001; Dyck & Piek, Reference Dyck and Piek2012; Caillies et al., Reference Caillies, Bertot, Motte, Raynaud and Abely2014; Farahi et al., Reference Farahi, Moshirian Farahi, Asghari Ebrahim Abad and Hokm Abadi2014; Thoma et al., Reference Thoma, Sonnenburg, Marcinkowski, Juckel, Edel and Suchan2020; Thoma et al., Reference Thoma, Sonnenburg, Marcinkowski, Juckel, Edel and Suchan2020; Imanipour et al., Reference Imanipour, Sheikh, Shayestefar and Baloochnejad2021). The studies were published from 2001 onwards, and the number of publications on this topic significantly increased in the last five years (N = 8; 53.3%), with 80% of the studies conducted in Euro-Asian countries (N = 12). Details about the sampling of the selected articles are presented in Table 1 and SM2.
Table 1. Main demographic and clinical characteristics of study samples included in the systematic review (N = 15)

Note. AD = Anxiety Disorder; ADs = Antidepressant; ADHS-SB = Attention Deficit/Hyperactivity Disorder Scale – Self Assessment; ASRS = Adult Attention-Deficit/ Hyperactivity Disorder Self-Report Scale; Aps = Antipsychotics; AUS = Australia; BEL = Belgium; CPRS-R = Connor’s Parents Rating Scale – Revised; DBRS-PV = Disruptive Behavior Rating Scale- Parent Version; DD = Depressive Disorder; DSM-5 = Diagnostic and Statistical Manual of Mental Disorders, 5th Edition; DSM-IV = Diagnostic and Statistical Manual of Mental Disorders, 4th Edition; DSM-IV-TR = Diagnostic and Statistical Manual of Mental Disorders, 4th Edition -Text Revision; ED = Elimination Disorder; ENG = England; F = Female; FRA = France; GAD = Generalized Anxiety Disorder; GER = Germany; ICD-10 = International Classification of Diseases, 10th Edition; IRA = Iran; LD = Learning Disorder; M = male; N = Number; NI = Not Informed; OCD = Obsessive Compulsive Disorder; ODD = Oppositional Defiant Disorder; PS = Psychostimulants; SAD = Social Anxiety Disorder; SCID-I/II = Structured Clinical Interview form for DSM-IV-TR axis I and II disorders; SD = Standard Deviation; SDr = Sleep Disorder; SPA = Spain; SWAN = Strengths and Weaknesses of ADHD-Symptoms and Normal Behavior Rating Scale based from DSM-IV; TUR = Turkey; WURS = Wender Utah Rating Scale; WURS-K = Wender Utah Rating Scale – Short Form.
x̅ = average
*The authors indicated that medications were withdrawn at least 24 hours before the assessment.
Table 1 and SM2 show a total of 560 subjects with ADHD and 377 healthy controls. The smallest sample comprised 15 participants and the largest 85; on average, there were 20.7 (± 12.7) participants with ADHD and 18 (± 9.3) in control groups. All studies addressed samples recruited in outpatient settings. On average, the adult participants were 28, and children were 9.5 years old (range 7–14). All studies included samples of both genders, except for Charman et al. (Reference Charman, Carroll and Sturge2001), in which the sample exclusively comprised male participants.
Regarding years of schooling, the adult participants with ADHD had 13 years on average, and the children had attended primary school. Only 60% of the studies (n = 9) assessed the participants’ intellectual quotient. Standardized instruments were used, and no impairment was found; the other studies did not provide such information.
ADHD was diagnosed or excluded mostly through clinical assessment performed by different mental health professionals (n = 11) or by consulting medical records (n = 03). The primary references used to verify and fulfill the diagnostic criteria were the DSM-IV-TR (n = 5) and DSM-5 (n = 7), often used together with other scales assessing the signs and symptoms of ADHD (n = 7). The ADHD subtype was specified in nine studies, with participants of the combined subtype predominating (n = 218 subjects). The severity of ADHD was seldom explored or reported (n = 6). Most studies (n = 13) investigated the presence of comorbidities with other neuropsychiatric conditions, but most participants did not present them. Note that only four studies addressed participants in the ADHD group who were medication-free or drug-naïve. The participants in 53% of the studies (n = 8) were using psychostimulants, which were interrupted 24 hours before the day of data collection; the other studies (n = 03) failed to report this information.
All studies, except Bigorra et al. (Reference Bigorra, Garolera, Guijarro and Hervas2016), adopted a cross-sectional observational design. Regarding methodological quality, only two of the cross-sectional studies achieved a quality percentage above 75% (Charman et al., Reference Charman, Carroll and Sturge2001; Aydın et al., Reference Aydın, Balıkçı, Sönmez, Ünal-Aydın and Spada2021). Half of the studies (n = 7) met between 56 and 67% of the quality criteria. The weakest points were in the items: a) “Exposure - Non-response rate,” given that no study presented the rates of participant loss; b) “Selection - representativeness of the cases,” since the samples in 80% of the studies were not representative or the information was not available; c) “Selection– definition controls,” as only 50% of the studies (n = 7) ensured that the control participants were selected from the same population as the cases; d) “Exposure – validated instruments,” given that less than half of the studies (n = 6) used instruments with adequate psychometric qualities to evaluate outcomes (EF and ToM). Further details are presented in SM3.
Regarding the outcomes of interest, EF was assessed through the following subdomains (non-exclusive categories): inhibitory control (N = 11; 73%), working memory (N = 8; 53%), cognitive flexibility (N = 5; 33%), attention processes (N = 4; 27%), planning (N = 3; 20%), and decision-making (N = 1; 7%). Different instruments were used to evaluate this outcome (Table 2); several studies adopted more than one instrument per subdomain (N = 12).
Table 2. Methodological aspects and main results of the studies included in the systematic review (N = 15)

Note. ADHD = Attention Deficit Hyperactivity Disorder; BRIEF = Behavior Rating Inventory of Executive Function; EF = Executive Function; HC = Health Control; IQ = Intelligence Quotient; MASC = Movie for Assessment of Social Cognition; NEPSY = Neuropsychological Assessment Battery) ns = not significative; O/CS = Observational / Cross-Sectional; RCT = Randomized Controlled Trial; RMET = Reading the Mind in the Eyes Task; TAP = Test for Attention Performance; ToM = Theory of Mind; WAIS-R = Wechsler Adult Intelligence Scale, Revised; WISC-IV = Wechsler Intelligence Scale for Children, Fourth Edition; WMS-III = Wechsler Memory Scale, Third Edition.
* The levels of ToM were classified using Castelli et al. (Reference Castelli, Pini, Alberoni, Liverta-Sempio, Baglio, Massaro and Nemni2011) as a reference.
Likewise, various instruments assessed the ToM domain (see Table 2). As proposed by Castelli et al. (Reference Castelli, Pini, Alberoni, Liverta-Sempio, Baglio, Massaro and Nemni2011), all studies adopted at least one instrument classified as “ToM - Higher-order,” which assesses the ability of an individual to understand the behavior of others in social situations involving complex mental states, such as gaffes, bluffs, and lies. The Reading the Mind in the Eyes Test (RMET; N = 7; 46%) and the Happé Strange Stories (N = 3; 20%) were the most frequently used. Six studies adopted more than one instrument to assess ToM (Table 2). Except for Miranda et al. (Reference Miranda, Berenguer, Roselló, Baixauli and Colomer2017), which used a self-report measure to assess EF, all other studies relied on performance-based measures for evaluating EF or ToM.
Regarding the results, each study’s data are described in detail in Table 2. All meta-analytic measurements are summarized in Table 3, Figures 2 and 3, and SM 4.

Figure 2. Metanalytic results of the correlations between Executive Functions (Subdomains of inhibitory control and working memory) and Theory of Mind (ToM) for ADHD group. note. A = adults≥18 years; C = children < 18 years; HC = healthy control; ADHD = attention deficit hyperactivity disorder. A - inhibitory control subdomain versus ToM in the total sample of ADHD; B - working memory subdomain versus ToM in the total sample of ADHD.

Figure 3. Metanalytic results of the correlations between executive functions (Subdomains of cognitive flexibility, planning and attention process) and Theory of Mind for attention deficit hyperactivity disorder group.
Table 3. Results of metanalytic measurements involving correlations between different domains of Executive Functions (EF) and Theory Of Mind (ToM)

Note. A = Adults samples; ADHD = Attention Deficit Hyperactivity Disorder samples; C = Children samples; CI = Confidence Interval; Classif. = classification of the magnitude of correlations based on Streiner & Norman (Reference Streiner and Norman2003); HC = Healthy Controls samples; I2 = I-squared; NS = Not Significant; Q = Cochran’s Q test; ToM = Theory of Mind.
a This study, if excluded, would not significantly alter the metanalytic measure; therefore, it was retained in the analysis because there was no justification for its exclusion.
- The analysis was not conducted due to lack of studies or the presence of only one study.
The meta-analytic correlation measures between ToM and EF were significant for all subdomains and presented a moderate magnitude in the ADHD groups. Subgroup analyses indicated the absence of significant differences between the correlations found between the samples of children and adults for the inhibitory control and working memory subdomains. Due to the restricted number of studies, such analysis cannot be performed for the other subdomains (cognitive flexibility, planning, and attention process). No heterogeneity was found between the studies for the planning and attention. However, moderate/high heterogeneity was found for the inhibitory, working memory, and cognitive flexibility subdomains and it did not decrease significantly in the subgroup analyses.
The correlations between ToM and EF in the healthy control groups were also significant. Although these correlations are of weak magnitude, they do not differ statistically from those observed in the ADHD group, as indicated by comparing confidence intervals in different analyses.
No indicators of potential publication bias were found for any of the study groups (Rank Correlation Test: p > 0.08; Regression Test: p > 0.06, Funnel Plot without indicators of asymmetry – see SM4A).
Discussion
The results of this systematic literature review with meta-analysis revealed significant and direct associations with moderate magnitudes between the different subdomains of EF and ToM among individuals with ADHD. No significant differences were found in the magnitudes of the correlations when comparing the children and adult samples. The correlations were also significant in the groups with healthy subjects. Despite a tendency for these magnitudes to be lower among the controls, no statistical differences were found from those presented by the ADHD group.
This study’s results reinforce evidence in the literature that has already indicated an association between these two cognitive domains in other clinical samples, such as ASD (Jones et al., Reference Jones, Simonoff, Baird, Pickles, Marsden, Tregay and Charman2017) and Schizophrenia (Li et al., Reference Li, Hu, Deng, Tao, Hu, Yang and Zhang2017), and also in non-clinical groups (Cho & Cohen, Reference Cho and Cohen2019; Wilson et al., Reference Wilson, Andrews, Hogan, Wang and Shum2018). However, researchers have differing views on the direction of the relationship between EF and ToM. This divergence can be partly attributed to methodological limitations in the field (Wade et al., Reference Wade, Prime, Jenkins, Yeates, Williams and Lee2018), as most studies, including those in this review, are correlational and do not allow for the establishment of cause-effect relationships.
Authors such as Russell (1996) argue that EF precedes and promotes the emergence of ToM, as a certain level of executive skills appears necessary to construct complex concepts of mental inferences. They consider that the ability to engage in goal-directed actions, inhibit responses, and overcome cognitive rigidity, among others, is necessary to reflect abstractly about the mental state of others and oneself. For Rowe et al. (Reference Rowe, Bullock, Polkey and Morris2001), EF favors a pragmatic understanding of a narrative, which consequently facilitates understanding non-explicit meanings, grasping different perspectives, and obtaining a deeper understanding of other peoples’ intentions, emotions, and mental states.
On the other hand, Perner & Lang (Reference Perner and Lang1999) consider that the relationship between these skills occurs in the opposite direction. They argue that a minimum level of understanding and representation of mental states (metarepresentation) is required to achieve better executive control. For Hughes & Ensor (Reference Hughes and Ensor2007), this association is explained by the fact that the two abilities share, at least in part, common aspects, such as the need for a conditional reasoning structure (Putko, Reference Putko2009) or common or strongly related brain structures (Wade et al., Reference Wade, Prime, Jenkins, Yeates, Williams and Lee2018). However, studies with predictive analysis, such as that by Hughes & Ensor (Reference Hughes and Ensor2007), Müller et al. (Reference Müller, Liebermann-Finestone, Carpendale, Hammond and Bibok2012), and Austin et al. (Reference Austin, Groppe and Elsner2014), showed greater support for the initial proposition that EF promotes performance in ToM tasks rather than to the proposition that ToM is a prerequisite for the development of EF.
Some studies indicate that the associations between EF and ToM are multifaceted and occur with specificities between the different subdomains (Hughes, Reference Hughes1998). For example, for Rowe et al. (Reference Rowe, Bullock, Polkey and Morris2001), inhibitory control plays a central role in understanding false beliefs, as the individual needs to inhibit his/her context’s knowledge to infer the other person’s belief. Working memory and cognitive flexibility gains importance in the face of second-order or advanced ToM tasks, which require the individual to have the ability to gather relevant information from the context and the flexibility to switch between his/her mental states and those of others.
The correlations between the different subdomains of EF and ToM showed similar magnitudes in this study, with inhibition and working memory being the most frequently studied subdomains. However, specific correlations between the EF subdomains and the different levels of mental attributions of ToM tasks were not analyzed, given that these levels were mainly of advanced order. The previous review by Pineda-Alhucema et al. (Reference Pineda-Alhucema, Aristizabal, Escudero-Cabarcas, Acosta-López and Vélez2018), which exclusively addressed child samples, presented findings very similar to this study’s, as all EF subdomains correlated with ToM. Based on a qualitative analysis of the study results, Pineda-Alhucema et al. (Reference Pineda-Alhucema, Aristizabal, Escudero-Cabarcas, Acosta-López and Vélez2018) also highlighted a stronger correlation between ToM and the inhibitory control subdomain.
It is also worth noting that the magnitudes of the correlations found in this study for samples with ADHD are very close to those evidenced in other meta-analyses that also intended to verify correlations between EF and ToM in other clinical groups. The meta-analytic correlation measure found by Joseph & Tager-Flusberg (Reference Joseph and Tager-flusberg2004) when addressing ASD samples was 0.59 (p < 0.01). The meta-analysis by Thibaudeau, Achim, Parent, Turcotte, & Cellard (2020) showed a measure of 0.30 (95%CI: .26 to .32, p < 0.01) for schizophrenia, while Devine & Hughes (Reference Devine and Hughes2014) conducted a meta-analysis and found a value of 0.38 (95%CI: 0.35 to 0.41) for healthy controls. These findings are noteworthy, as EF is the cognitive domain most frequently impaired in ADHD, which suggest that more robust correlations with ToM might be expected in this clinical group, particularly in comparison to other disorders where EF is not the primary area of impairment. Future studies are suggested to analyze in more depth whether this relationship suffers specific influences depending on the clinical group and the severity of the pathology.
Another aspect to be highlighted is that a moderate magnitude of the correlation between these two domains is found in most of the literature. Such a finding is supported by studies involving neuroimaging, which verified the same activations of specific brain regions (e.g., prefrontal cortex, temporoparietal junction) in both EF and ToM tasks, indicating the existence of neural network operations mediating both abilities (Molenberghs et al., Reference Molenberghs, Johnson, Henry and Mattingley2016; Menon & D’Esposito, Reference Menon and D’Esposito2021). At the same time, findings regarding the activation of independent regions also indicated the existence of specific neural processes involved in each domain (Wade et al., Reference Wade, Prime, Jenkins, Yeates, Williams and Lee2018). Furthermore, other factors are known to directly or indirectly influence ToM skills, explaining part of the variability in the performance of tasks that assess them, such as language (Devine & Hughes, Reference Devine and Hughes2014; Wade et al., Reference Wade, Prime, Jenkins, Yeates, Williams and Lee2018), learning (Bora & Pantelis, Reference Bora and Pantelis2015; Tucci & Easterbrooks, Reference Tucci and Easterbrooks2020), and cultural context (Sabbagh et al., Reference Sabbagh, Xu, Carlson, Moses and Lee2006), among others.
The age variable might be another mediating factor of EF and ToM skills (Ferguson et al., Reference Ferguson, Brunsdon and Bradford2021; Meinhardt-Injac et al., Reference Meinhardt-Injac, Daum and Meinhardt2020), as maturational and developmental aspects are closely linked to the acquisition and impairment of these skills (Gigi & Papirovitz, Reference Gigi and Papirovitz2022). Devine and Hughes (Reference Devine and Hughes2014) consider it important to investigate whether the associations between EF and ToM become more specific with age as EF develops. For example, studies analyzing the associations between ToM, aging, and EF emphasize the role of inhibition (Bailey & Henry, Reference Bailey and Henry2008) and working memory (Phillips et al., Reference Phillips, Bull, Allen, Insch, Burr and Ogg2011) in differences in the performance of ToM tasks, depending on age.
Therefore, this study tested whether age significantly influences the magnitude of correlations between these skills. This aspect could only be analyzed in the inhibitory control and working memory subdomains, and contrary to previous studies, no significant differences were found when comparing children and adult samples. Although this finding is not consensual in the literature (Bailey & Henry, Reference Bailey and Henry2008), such differences may not have been captured by the studies addressed here, as they predominantly assessed 9-10 years old and young adults aged 28 on average. There were no samples from more extreme age groups where specific developmental differences could be observed. Additionally, some samples included participants within age ranges that could exhibit differences, even if subtle, in EF development (e.g., Dyck & Piek, Reference Dyck and Piek2012; Imanipour et al., Reference Imanipour, Sheikh, Shayestefar and Baloochnejad2021; Sahin et al., 2021—groups aged 7-12). These aspects should be taken into account. According to Best and Miller (Reference Best and Miller2010), EF develops rapidly during childhood, with the most pronounced age effects observed in children aged 5–7; moderate effects are found among those aged 8–15, and adolescents aged 15–17 show minor effects. Hence, future studies should investigate this aspect further.
The studies analyzed here presented some limitations that must be accounted for in the generalization of results. As previously mentioned, one limitation concerns the samples, which were poorly represented by children in the early stages of development and older adults. Moreover, clinical samples were poorly identified according to the subtype of ADHD and were predominantly composed of people on the continuous use of medication to control ADHD symptoms.
Another important limitation concerns the lack of consensus on the understanding and systematizing of the EF structure, subdomains, and their measurement instruments. According to Snyder et al. (Reference Snyder, Miyake and Hankin2015), several neuropsychological instruments used to evaluate EF assess more than one of their subdomains, as well as non-EF-related skills. In their view, such a context favors inconsistency of responses and low reliability and sensitivity of the measure of interest, which can impact the evaluation of the construct and the level of correlation with other measures.
Furthermore, self-report and performance-based instruments may assess EF differently (individual perception vs. objective measures of executive skills), potentially leading to biased results. Previous studies have shown inconsistent relationships between these measures, ranging from no correlation to moderate correlations, suggesting that self-report measures have limited ability to assess EF cognitive performance (Heilmann, Reference Heilmann2022). Nevertheless, since the analyses did not indicate Miranda et al. (Reference Miranda, Berenguer, Roselló, Baixauli and Colomer2017) - the only study using self-report measures - as an outlier, it was retained in the dataset. However, researchers should consider this variable in future studies.
This framework hinders more robust analyses, such as meta-regressions, due to the considerable variability of instruments and the scarcity of studies accounting for these measurement specificities. Future research should further explore and systematize these aspects.
Other limitations relate the instruments used to assess ToM, as many of them contain verbally demanding tasks. Given that verbal difficulties are relatively common in ADHD, verbal assessments may introduce bias. Furthermore, Quesque & Rossetti (Reference Quesque and Rossetti2020) point out that not all of ToM instruments require the participants to engage in mental representation or distinguish between their mental states and those of others. These instruments fail to meet the two main criteria characteristic of ToM measures: “mentalization” and “nonmerging.” In this sense, it is worth highlighting the instrument most frequently used in the studies reviewed—the RMET. We argue that the RMET cannot be considered a ToM instrument, as it does not meet either of the previously mentioned criteria; rather, it is regarded as a measure of visual discrimination and state identification (Quesque & Rossetti, Reference Quesque and Rossetti2020). However, other researchers contend that the RMET enables individuals to make mental inferences through the direct reading of facial expressions, and for this reason, it serves as a ToM index (Baron‐Cohen et al., Reference Baron‐Cohen, Wheelwright, Hill, Raste and Plumb2001; Putko, Reference Putko2009).
It should also be noted that many of the instruments used to assess both outcomes, especially ToM, lacked adequate psychometric indicators, which affects the methodological quality of the studies. These observations highlight the need to strengthen the theoretical foundation of the concepts, terminologies, and instruments related to EF and ToM. Addressing these challenges remains a key priority for researchers in the field.
The conclusion is that this study’s hypotheses were partially confirmed, as the association between EF and ToM was significant with a moderate effect size. However, no significant differences were found based on age, ADHD, or the specific executive function domain assessed. Future studies should focus on expanding the age groups and addressing the previously mentioned limitations and confounding variables regarding the constructs and instruments to achieve more robust conclusions.
Supplementary material
The supplementary material for this article can be found at http://doi.org/10.1017/S1355617725000190.
Availability of data and materials
This metanalysis includes detailed search parameters and MeSH terms to ensure replicability of the search and findings. The data supporting the findings of this study are available from the corresponding author upon reasonable request.
Authors contributions
All authors contributed to the study’s conception and design. DAF and FLO prepared the material, collected data, and analyzed it. All authors read and approved the final manuscript.
Financial funding
This work was supported by Coordination for the Improvement of Higher Education Personnel / Academic Excellence Program (CAPES/ PROEX) Process N°: 88,887.701264/2022-00.
Competing interests
The authors declare no conflict of interest.
Role of the sponsor
The supporters had no role in this study’s design, analysis, interpretation, or publication.