Introduction
Although post-traumatic stress disorder (PTSD) can be addressed with a variety of treatments, even gold standard options fail to significantly improve symptoms in up to half of patients (Steenkamp, Litz, Hoge, & Marmar, Reference Steenkamp, Litz, Hoge and Marmar2015; Steenkamp, Litz, & Marmar, Reference Steenkamp, Litz and Marmar2020). The heterogeneity of PTSD is becoming increasingly recognized as one impediment to treatment success (Cloitre, Reference Cloitre2015; Herzog & Kaiser, Reference Herzog and Kaiser2022; Steenkamp & Litz, Reference Steenkamp and Litz2014). PTSD can exhibit a variety of symptom combinations, comorbidities, personality features, neurobiological correlates, and other characteristics (Morgan-Lopez et al., Reference Morgan-Lopez, Killeen, Saavedra, Hien, Fitzpatrick, Ruglass and Back2020; Young, Lareau, & Pierre, Reference Young, Lareau and Pierre2014), and at least some expressions appear to be partially undergirded by causes not well-suited to current PTSD treatments (Campbell, Trachik, Goldberg, & Simpson, Reference Campbell, Trachik, Goldberg and Simpson2020; Campbell-Sills et al., Reference Campbell-Sills, Sun, Choi, He, Ursano, Kessler and Stein2021). Developing alternative treatments for these differing presentations, and identifying the patients that best respond to them, could significantly increase the proportion of PTSD patients who experience symptom relief (Siegel & Laska, Reference Siegel and Laska2022). Such efforts are consistent with the broader goals of precision psychiatry to more effectively match patients to treatments tailored to their specific symptoms (Quinlan et al., Reference Quinlan, Banaschewski, Barker, Bokde, Bromberg, Buchel and Consortium2020; Scala, Ganz, & Snyder, Reference Scala, Ganz and Snyder2023).
One common PTSD correlate shown to impact treatment responsiveness is impaired executive functioning (EF). EF broadly refers to the ability to maintain control of complex goal-directed behavior (Alvarez & Emory, Reference Alvarez and Emory2006; McCabe et al., Reference McCabe, Roediger, McDaniel, Balota and Hambrick2010), and is typically measured by performance in areas like working memory, attention, response inhibition, flexibility, and planning (Aupperle, Melrose, Stein, & Paulus, Reference Aupperle, Melrose, Stein and Paulus2012). Individuals with PTSD consistently exhibit lower EF compared to controls (Polak, Witteveen, Reitsma, & Olff, Reference Polak, Witteveen, Reitsma and Olff2012; Woon et al., Reference Woon, Farrer, Braman, Mabey and Hedges2017). Moreover, impoverished EF is believed to exacerbate PTSD symptoms by disrupting the ability to disengage from trauma-related memories and reminders or inhibit the negative emotions they elicit (Aupperle et al., Reference Aupperle, Melrose, Stein and Paulus2012; Bomyea & Lang, Reference Bomyea and Lang2015). Neurobiologically, impoverished EF putatively reflects dysfunction in frontal areas necessary for PTSD recovery processes like fear extinction (Quirk, Garcia, & Gonzalez-Lima, Reference Quirk, Garcia and Gonzalez-Lima2006; Wicking et al., Reference Wicking, Steiger, Nees, Diener, Grimm, Ruttorf and Flor2016) and emotion regulation (Andrewes & Jenkins, Reference Andrewes and Jenkins2019; New et al., Reference New, Fan, Murrough, Liu, Liebman, Guise and Charney2009). Importantly, first-line PTSD treatments do not directly address deficient EF (Crocker et al., Reference Crocker, Jurick, Thomas, Keller, Sanderson-Cimino, Boyd and Jak2018; Haaland, Sadek, Keller, & Castillo, Reference Haaland, Sadek, Keller and Castillo2016), which is among the strongest predictors of nonresponse to traditional trauma-focused therapies according to a recent meta-analysis (Keyan et al., Reference Keyan, Garland, Choi-Christou, Tran, O’Donnell and Bryant2024). Hence, PTSD treatments that directly target EF deficits may bring relief to a subgroup of patients that would not typically respond to standard interventions.
Cognitive training has emerged as an increasingly popular tool to enhance cognitive functioning (Jaeggi, Buschkuehl, Jonides, & Shah, Reference Jaeggi, Buschkuehl, Jonides and Shah2011; Keshavan et al., Reference Keshavan, Vinogradov, Rumsey, Sherrill and Wagner2014). Cognitive training for EF has been shown to effectively improve EF abilities in a variety of age groups (Ludyga et al., Reference Ludyga, Held, Rappelt, Donath and Klatt2023; Nguyen, Murphy, & Andrews, Reference Nguyen, Murphy and Andrews2019; Wass, Reference Wass2015; Wass, Scerif, & Johnson, Reference Wass, Scerif and Johnson2012) and in neurodevelopmental or neurocognitive disorders that may be marked by EF impairments (Chen et al., Reference Chen, Yu, Zhang, Zhang, Zhang and Wang2021; Pasqualotto et al., Reference Pasqualotto, Mazzoni, Bentenuto, Mule, Benso and Venuti2021; van de Ven, Murre, Veltman, & Schmand, Reference van de Ven, Murre, Veltman and Schmand2016). To the extent that low EF impedes recovery from PTSD symptoms, intervening upon EF should improve PTSD symptoms. Indeed, several studies have found computerized EF training to produce greater PTSD symptom improvement than control or alternative training interventions (Badura-Brack et al., Reference Badura-Brack, Naim, Ryan, Levy, Abend, Khanna and Bar-Haim2015; Bomyea, Stein, & Lang, Reference Bomyea, Stein and Lang2015; Echiverri-Cohen et al., Reference Echiverri-Cohen, Spierer, Perez, Kulon, Ellis and Craske2021). Additionally, EF training has been found to enhance the psychological processes theorized to mediate PTSD recovery (e.g. regulation of intrusive thoughts: [Bomyea & Amir, Reference Bomyea and Amir2011]; emotion regulation: [Sabzi, Mihandoost, Nademi, & Parandin, Reference Sabzi, Mihandoost, Nademi and Parandin2022; Schweizer et al., Reference Schweizer, Grahn, Hampshire, Mobbs and Dalgleish2013]) or their associated neural underpinnings (e.g. enhanced inferior frontal gyrus [IFG] activation: [Li et al., Reference Li, Wu, Yuan, Wang, Wang, Zhong and Fox2022]; increased IFG–amygdala connectivity: [Cohen et al., Reference Cohen, Margulies, Ashkenazi, Schaefer, Taubert, Henik and Okon-Singer2016]).
Although initial studies of EF training for PTSD have been promising, it is unclear which PTSD patients are most likely to benefit from this training. Recent evidence suggests that EF impairments are only present in a subgroup of PTSD patients (~15%) who also possess distinct neural characteristics and symptom trajectories relative to normal or high-EF PTSD patients (Jagger-Rickels et al., Reference Jagger-Rickels, Stumps, Rothlein, Park, Fortenbaugh, Zuberer and Esterman2021; Jagger-Rickels et al., Reference Jagger-Rickels, Rothlein, Stumps, Evans, Bernstein, Milberg and Esterman2022). As such, EF training might only benefit the minority of PTSD patients who possess the neurocognitive deficits it targets. However, trials of EF training for PTSD have typically investigated the intervention in PTSD patients as a whole and not for the specific subgroup of patients whom the intervention might benefit most (Badura-Brack et al., Reference Badura-Brack, Naim, Ryan, Levy, Abend, Khanna and Bar-Haim2015; Bomyea et al., Reference Bomyea, Stein and Lang2015). Identifying the subgroup of patients who benefit most from EF training in terms of both cognitive and symptomatic improvement is a critical step toward maximizing the efficacy of this intervention for patients with PTSD.
Accordingly, the goal of the present study was to identify, validate, and characterize the subgroup of PTSD patients who respond to cognitive training for EF. Here, we examined data from a recently completed randomized clinical trial of Veterans with PTSD undergoing working memory training (WMT). WMT is a commonly used method to enhance EF (Karbach & Verhaeghen, Reference Karbach and Verhaeghen2014; Morrison & Chein, Reference Morrison and Chein2011; Salminen, Strobach, & Schubert, Reference Salminen, Strobach and Schubert2012) and is particularly relevant to PTSD given that concentration issues (which reflects poor WM) are a PTSD symptom (APA, 2013) and because deficits in WM systems are thought to influence management of other PTSD symptoms (Norte, Vargas, & de Carvalho Silveira, Reference Norte, Vargas and de Carvalho Silveira2022). PTSD subjects also tend to exhibit lower working memory capacity (Honzel, Justus, & Swick, Reference Honzel, Justus and Swick2014; Nejati, Salehinejad, & Sabayee, Reference Nejati, Salehinejad and Sabayee2018) and be more affected by emotional distractors during WM tasks (Morey et al., Reference Morey, Dolcos, Petty, Cooper, Hayes, LaBar and McCarthy2009; Schweizer & Dalgleish, Reference Schweizer and Dalgleish2016), which putatively makes detaching from trauma-related memories and emotions more difficult. There is also preliminary evidence that training-related improvements in WM lead to improvements in PTSD symptoms (Bomyea & Amir, Reference Bomyea and Amir2011; Larsen et al., Reference Larsen, Lotfi, Bennett, Larson, Dean-Bernhoft and Lee2019), thus further supporting the role of WM deficits in maintaining PTSD symptoms.
In this study, we specifically investigated (a) whether there was a distinct subgroup who responded to WMT based on improved working memory scores across training (i.e. identification); (b) whether this group also showed improvements in a non-training working memory task and in PTSD symptoms across training (i.e. external validation); and (c) whether this group could be adequately identified with baseline demographic, behavioral, and neural data (i.e. characterization). We hypothesized that there would be a distinct subgroup of PTSD patients who exhibit improved working memory and PTSD symptoms across training and further hypothesized that this subgroup would be characterized by low working memory scores and attenuated activation in working memory-related neural areas at baseline, consistent with WMT working best for patients with deficiencies in its putative treatment target.
Materials and methods
Participants
Participants were Veterans recruited from the VA San Diego Healthcare System. Qualifying participants were aged 21–65 with a current PTSD diagnosis, no history of severe mental illness (e.g. psychosis) or traumatic brain injury, and no recent severe substance use or suicidality, among other qualifications (see Supplementary Section 2.1 for full inclusion/exclusion criteria). N = 79 participants were eligible, enrolled, and attended the minimum number of training sessions (2). Of these 79 participants, n = 45 attended the functional magnetic resonance imaging (fMRI) session and had usable fMRI data while n = 56 completed the full neuropsychological battery at pretreatment. Of note, the relatively lower sample size for the fMRI scan and neuropsychological assessment was due to the study overlapping with the COVID-19 pandemic for two years, which prevented access to these in-person visits. Full information on participant flow can be found in the CONSORT diagram (Figure 1S of the Supplementary Material) and in Section 2.1 of the Supplementary Material. Demographics and clinical characteristics of the final sample can be found in Table 1S of the Supplementary Material.
Procedure
Following an initial eligibility screening, participants completed a baseline intake appointment where they completed self-report scales, neuropsychological tasks, and interviews to confirm full eligibility (see Supplementary Section 2.2 for specifics), and an fMRI scan. Following the baseline assessment, participants were assigned to 16 sessions of WMT occurring twice weekly for eight weeks. Each WMT session involved repeated completion of a modified complex reading span (R-SPAN) task, which required the participant to remember presented stimuli while doing a secondary problem-solving task. Participants were randomized to one of two WMT conditions: low interference control (LIC), which included both letter and number stimuli, and high interference control (HIC), which included only letters, thereby increasing proactive interference during recall (see Bomyea et al., Reference Bomyea, Stein and Lang2015 and Supplementary Section 2.3 for more details about WMT protocol and LIC/HIC conditions). Differences between LIC and HIC groups were not examined in this study since (a) our hypotheses were about WMT in general and not interference control specifically, (b) the LIC and HIC were identical in all other key parameters related to WMT (e.g. identical stimulus spans), and (c) because the identified WMT subgroups (see Results Section ‘WMT subgroup identification’) were already relatively small (lowest n = 16) and further dividing analyzed cells into treatment conditions would further reduce statistical power. However, we did examine differences in LIC/HIC randomization in the identified subgroups to ensure such subgroups were not merely a recapitulation of treatment conditions. A brief set of self-report questionnaires, including an assessment of PTSD symptoms (PCL-5), was conducted midway through treatment (i.e. following session 8) and at the conclusion of treatment (i.e. following session 16). Participants also completed the same working memory assessments from baseline at posttreatment. All participants provided written informed consent, and the trial protocol was approved by the VA San Diego Institutional Review Board. The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008.
Measures
Working memory
Changes in working memory across treatment were assessed with the modified R-SPAN task used to train working memory. The R-SPAN task instructs participants to remember stimuli while simultaneously completing a processing task. At the start of each trial, a fixation cross appears for 500 ms. Next, the screen displays a processing task (e.g. ‘Jane walks her car in the park’). After the processing task, the screen displays a question about the task (e.g. ‘does this sentence make sense?’) and prompts the participant to choose ‘True’ or ‘False’. The processing task is followed by the presentation of a to-be-remembered-item for 500 ms. The subsequent trials follow the same pattern – the presentation of a processing task followed by an item. At the end of a set, a recognition screen is displayed with twelve items and participants were instructed to choose and arrange previously displayed to-be-remembered items in serial order. Then, new trials began and followed the same pattern. Sets contain between two and six trials (i.e. between 2 and 6 to-be-remembered items out of the 12 total items presented on the recognition screen). Each session contained three blocks of 15 sets. The scanner R-SPAN task was identical to the treatment R-SPAN except that it included sets of two, four, or six letters or numbers, with each set size being repeated seven times in pseudorandom order Correct responses were separately tabulated for each of the three runs during the session (range per run = 0–60), and the highest R-SPAN total served as the participant’s score for that session as a reflection of maximum working memory ability (for a detailed description of this task see Supplementary Sections 2.3.2. and 2.4.3.2).
PTSD symptoms
Changes in PTSD symptoms were captured using the PTSD Checklist for DSM-5 (PCL-5; Blevins et al., Reference Blevins, Weathers, Davis, Witte and Domino2015), which was administered at baseline, mid-treatment, and posttreatment.
Neurocognition
Identified WMT subgroups were compared on both neurocognitive measures meant to assess EF and clinical and demographic variables that might affect cognition. Neurocognitive baseline variables related to EF included: verbal working memory ability (R-SPAN) (Bomyea, Taylor, Spadoni, & Simmons, Reference Bomyea, Taylor, Spadoni and Simmons2018), spatial working memory ability (symmetry span [S-SPAN]; Unsworth, Reference Unsworth2010), cognitive flexibility (letter-number switching raw score from Delis–Kaplan EF System Trail Making Test [D-KEFS TMT]; Delis, Kaplan, & Kramer, Reference Delis, Kaplan and Kramer2001), and response inhibition (switching-inhibition raw score from the D-KEFS color word interference test [D-KEFS CWI]). We also assessed two processing speed variables – visual scanning from the D-KEFS TMT and word reading from the D-KEFS CWI – to assess whether differences between WMT subgroups on EF variables were specific to EF or related to cognition more generally.
Clinical and demographic variables
Clinical baseline variables included: insomnia (insomnia severity index; Morin, Belleville, Bélanger, & Ivers, Reference Morin, Belleville, Bélanger and Ivers2011), depression (Beck Depression Inventory-II [BDI-II]; Beck, Steer, & Brown, Reference Beck, Steer and Brown1996), time (in months) since trauma, and TBI history classification using established TBI guidelines (O’Neil et al., Reference O’Neil, Carlson, Storzbach, Brenner, Freeman, Quiñones and Kansagara2013). We also examined age differences between WMT subgroups given its potential role in neurocognition, and number of sessions attended to assess whether differences in response to training were driven by engagement in training.
Neural data
Participants were scanned in a 3-Tesla GE 750 scanner using a 32-channel head array coil. The scanning session included collection of a three-plane scout scan, of a sagittally acquired sequence for acquiring T1-weighted images (parameters: FOV 256 mm; matrix: acquisition matrix =256 × 256 with 0 mm gap; 208 slices; thickness: 1 mm; TR 6900 ms, TE = 2.9 ms, flip angle: 8°) and one T2*-weighted axially acquired multiband echo-planar imaging scan to measure blood oxygen level dependent (parameters: FOV = 216 cm, acquisition matrix = 90 × 90 with a 0 mm gap; TR = 800 ms; TE = 30 ms; flip angle = 52° [whole brain]).
Statistical analyses
WMT subgroups were identified using growth mixture modeling (GMM), a technique used to identify unobserved subsamples in a dataset based on differences in longitudinal change on one or more outcomes (Ram & Grimm, Reference Ram and Grimm2009). GMMs were run using the hlme command within the lcmm package in R (Proust-Lima, Philipps, & Liquet, Reference Proust-Lima, Philipps and Liquet2017). Session-by-session peak R-SPAN scores served as the dependent variable while session number (1–16) served as the within-subject predictor variable. Bayesian Information Criterion (BIC) values were examined for model fit. We first identified the best-fitting (lowest BIC) model in terms of the number of latent subgroups (between 1 and 4), random effects (none, intercept, or intercept and slope) and slope shapes (linear and logarithmic). The best-fitting model was then selected for interpretation of subgroup-specific intercepts and slopes (see Supplementary Section 2.4.1 for more information). To ensure generalizability of the solution, we also reran the model including only participants who completed the baseline neuropsychological assessment (n = 56).
For subgroup validation, we utilized mixed-effects models (MEMs) to test whether the extracted GMM subgroups exhibited different patterns of change on two exogenous outcomes of interest: spatial working memory (S-SPAN) and PTSD symptoms (PCL-5). We examined changes in S-SPAN scores across WMT subgroups to verify that subgroup differences on the training task (R-SPAN) generalized to a working memory task from a different modality (S-SPAN). Changes in PCL-5 scores between WMT subgroups tested whether group differences in WM improvement translated into differences in PTSD improvement. We also repeated the same model with BDI-II scores to verify that any significant results for PCL-5 were specific to PTSD and not shared with depression. Each MEM included WMT subgroup, time, and their interaction, which tested whether the extracted subgroups showed different slopes of spatial working memory change and PTSD change across training. Random effects (intercept and slope) and slope shapes (linear vs. logarithmic) were selected according to how much they lowered BIC values. S-SPAN was only measured at pre- and post-WMT so logarithmic slopes could not be tested. Since WMT subgroup was a categorical variable, MEMs were tested with contrast coding in which the group with the greatest R-SPAN improvement during training was treated as the reference group. A significant Group × Time interaction for PCL-5 total was followed up by examining the same model for the four PCL-5 subscales. For PCL-5 subscale models, we applied a Benjamini–Hochberg (BH) correction for multiple comparisons (Benjamini & Hochberg, Reference Benjamini and Hochberg1995), wherein p values are ranked from smallest to largest and evaluated at a threshold of α×(I/M), where α is the original significance threshold (.05), I is the p value ranking, and M is the total number of tests.
For subgroup characterization, we compared the WMT subgroups on baseline variables of interest (see Section ‘Measures’ for full lists) using separate one-way ANOVAs and a BH correction for multiple comparisons. Chi square tests were used to examine differences between WMT subgroups in randomization to LIC versus HIC, since this test involved count data.
Neural data were processed utilizing Analysis of Functional NeuroImages and Advanced Normalization Tools Software in R (see Supplementary [Section: 2.4.3.2 Neural data] for a detailed description of fMRI data processing pipeline and control for multiple corrections). To analyze neural functioning during the R-SPAN task, response regressors were created for each phase of the task. Three phases were determined by the unique timing of presentation for each phase for each participant, including sentence reading and verification, stimuli encoding, and stimuli recall. Neural baseline differences between WMT subgroups were analyzed using a 3dLME of the R-SPAN task encoding phase weighted according to set size (i.e. the number of stimuli presented in a single trial), within an a priori defined working memory mask using neuroanatomical atlases of regions relevant to working memory (NeuroSynth v5 ‘working memory’ Topic 045). To further evaluate the clinical significance of these regions, extractions from each ROI were used as predictors of PTSD symptom changes across training using an identical MEM as was used to test the effects of WMT subgroup. These tests were also subject to BH correction for multiple comparisons based on the number of tested ROIs.
Results
WMT subgroup identification
The best-fitting model included a random intercept and slope, a logarithmic slope shape, and three latent subgroups (see Table 2S for full model statistics). The latent subgroups extracted from the best-fitting model included: (1) a high-WM/shallow improvement subgroup (54.5% of the total sample), characterized by a high intercept (b = 51.97, 95% CI = [50.03, 53.91], p < .001) and a shallow but significant positive slope (b = 1.59, 95% CI = [0.97, 2.21), p <. 001); (2) a low-WM/steep improvement subgroup (20.3% of the sample), characterized by a low intercept (b = 29.21, 95% CI = [22.99, 35.43], p < .001) and a steep positive slope (b = 8.03, 95% CI = [6.35, 9.71], p < .001); and (3) and a low-WM/no improvement subgroup (25.2% of the sample), characterized by a low intercept (b = 29.65, 95% CI = [22.01, 37.29], p < .001) and a nonsignificant negative slope (b = −1.28, 95% CI = [−2.98, 0.42], p = .131). Rerunning the analysis including only those who completed the baseline neuropsychological assessment yielded virtually identical findings, including the same best-fitting model and the same subgroup parameters (see Supplementary Table 2S for complete statistics). A fitted value plot of session-by-session R-SPAN score changes for each WMT subgroup can be found in Figure 1.

Figure 1. Fitted value plot of working memory trajectories across WMT sessions for each GMM subgroup. Shaded areas represent the 95% CI of the trajectory. R-SPAN highest total score is the highest R-SPAN score among the three R-SPAN runs that were completed at each session. Subgroups were extracted from the best-fitting GMM of session-by-session highest R-SPAN scores across the 16 WMT sessions. Note: WMT, ‘Working memory training’; R-SPAN, ‘Reading Span’; GMM, ‘Growth mixture model’; 95% CI, ‘95% confidence interval’.
WMT subgroup validation
The S-SPAN MEM revealed a significant interaction between time (pre- and post-training) and WMT subgroup, F = 3.28, p = .044, which follow-up analyses revealed was driven by significantly steeper improvement in the low-WM/steep improvement subgroup relative to both the low-WM/no improvement subgroup (b = −3.20, 95% CI = −6.05, −0.35, p = .029) and the high-WM/shallow improvement subgroup (b = −2.83, 95% CI = −5.22, −0.44, p = .021). Thus, the low-WM/steep improvement subgroup exhibited superior working memory improvement across training relative to the other two subgroups on both the verbal WMT task and a non-training spatial working memory task (see Figure 2 for fitted value plots).

Figure 2. Fitted value plot of (a) spatial working memory [S-SPAN] and (b) PTSD symptoms [PCL-5] across WMT for each WMT subgroup. Shaded areas represent the 95% CI of the trajectory. Subgroups were extracted from the best-fitting GMM of session-by-session highest R-SPAN scores across the 16 WMT sessions. Note: S-SPAN, ‘Symmetry span’; PCL-5, ‘PTSD Checklist for DSM-5’; WMT, ‘Working memory training’; R-SPAN, ‘Reading Span’; GMM, ‘Growth mixture model’; 95% CI, ‘95% confidence interval’.
The PCL-5 MEM was best characterized with a logarithmic slope and the Log time × WMT subgroup interaction was significant at the level of a trend (p = .056). Follow-up analyses revealed that the low-WM/steep improvement subgroup exhibited a significantly steeper reduction in PCL-5 scores relative to the low-WM/no improvement subgroup (b = 10.00, 95% CI = 1.77, 18.21, p = .018) but not the high-WM/shallow improvement subgroup (b = 4.28, 95% CI = −2.68, 11.23, p = .223). Nonetheless, the overall pattern of results suggested that subgroups with greater working memory improvement showed greater PTSD improvement. In contrast, none of the Group × Time interactions for the PCL-5 subscales reached statistical significance after applying a BH correction for multiple tests (p > .018). Similarly, no significant WMT Subgroup × Log Session interactions were found for BDI-II, indicating changes in depression symptoms across WMT did not differ significantly between subgroups (ps > .130). Comparison of model fit statistics for each candidate MEM can be found in Table 3S of the Supplementary Material. Coefficients in the best-fitting MEMs used to predict changes in S-SPAN and PCL-5 from WMT subgroup can be found in Table 4S of the Supplementary Material.
WMT subgroup characterization
Self-report and neuropsychological data
One-way ANOVAs revealed that WMT subgroups differed on all EF-related measures at baseline, including R-SPAN, F = 37.30, η2 = 0.50, p < .001, S-SPAN, F = 20.03; η2 = 0.35, p < .001, TMT switching, F = 8.65, η2 = 0.25, p < .001, and CWI inhibition/switching, F = 9.20, η2 = 0.26, p < .001. Post hoc pairwise comparisons revealed that these differences were, in almost all cases, driven by the high-WM/shallow improvement subgroup scoring significantly higher than the other two subgroups (ps < .021), which not differ from each other (ps > .559). The lone exception was that the high-WM/shallow improvement and low-WM/steep improvement subgroups did not significantly differ on TMT switching (p = .090). No other differences in baseline variables between subgroups emerged after correcting for multiple comparisons (ps > .030). Full statistics for all ANOVAs comparing subgroups on baseline measures can be found in Table 1.
Table 1. Differences between WMT subgroups on demographics and baseline variables

Note: Education was self-reported on a 7-point ordinal scale spanning 1 (did not finish high school) to 7 (graduate degree). TBI severity levels were computed from TBI history questions and ranged from 0 (none) to 3 (severe). WMT subgroups were formed based on differences in working memory changes across WMT. Comparisons were evaluated at a BH-corrected significance threshold based on the number tested covariates (14).
Abbreviations: WMT, Working memory training; PCL-5, PTSD Checklist for DSM-5; ISI, Insomnia Severity Index; BDI-II, Beck Depression Inventory II; TBI, Traumatic brain injury; R-SPAN, Reading span; S-SPAN, Symmetry span; TMT Scanning, Trail Making Test visual scanning subset; TMT Switching, Trail Making Test letter-number switching subtest; CWI Color Naming, Color Word Interference color naming subtest; CWI Inhibit/Switch, Color Word Interference, Inhibition/Switching subtest. N = 79.
a Categorical variable tested with chi-squared (χ2 and V statistics).
b Measure available for only a subset of participants (n = 56).
Overall, baseline measures of neuropsychological functioning were able to successfully distinguish the high-WM/shallow improvement group from the other two subgroups (i.e. higher EF in the high-WM/shallow improvement subgroup) but were not able to distinguish the other two subgroups (low-WM/steep improvement and low-WM/no improvement subgroups) from each other. Because the high-WM/shallow improvement could be readily distinguished from the other two subgroups using neuropsychological testing, we did not include this subgroup in our analyses of neural differences, as fMRI is only clinically justifiable in cases in which more accessible measures fail to yield phenotypic differences.
Neuroimaging data
Within the a priori defined working memory mask, the low-WM/steep improvement subgroup exhibited significantly lower neural activity during R-SPAN encoding in five ROIs relative to the low-WM/no improvement subgroup (ps < .004; see Figure 3 for images of neural regions and Table 5S for coordinates). These ROIs spanned the bilateral middle frontal gyrus (MFG), left precentral gyrus (PCG), left PCG extending into the left MFG, and the left IFG. To verify that this pattern of low neural responding in WM regions was unique to the low-WM/steep improvement subgroup, we conducted a supplemental analysis comparing the low-WM/steep improvement subgroup to both other subgroups. This analysis revealed differences in 13 ROIs (see Table 6S for full list and coordinates), with five of seven largest ROIs being the same as those found in the two-group analysis (i.e. bilateral MFG, left PCG, left PCG extending into the left MFG, and left IFG). Pairwise comparisons for these regions again showed lower activation in the low-WM/steep improvement subgroup compared to the other two subgroups, which did not differ from each other (see Figure S3 of the Supplement for an illustration of subgroup differences). Hence, findings from the three-group analysis confirmed that lower neural activation was unique to the low-WM/steep improvement subgroup.

Figure 3. Standardized estimates of neural activation (z-score) in the right middle frontal gyrus (MFG), left precentral gyrus (PCG), left PCG extending into left MFG, the left MFG, and the right inferior frontal gyrus (IFG) during the stimulus encoding portion of the baseline R-SPAN task. Bars represent activation in the low-WM/steep improvement subgroup (blue) relative to the low-WM/no improvement subgroup (orange).
MEMs predicting PCL-5 changes revealed significant interactions with log time for activity in each ROI (ps < .026; see Table 7S of the Supplementary Material for full statistics). In each case, lower activity in the ROI predicted a steeper reduction in PCL-5 scores across WMT. A fitted value plot depicting PCL-5 changes across training for participants with high versus low activity in the largest ROI (right MFG) can be found in Figure 4. Since effects were similar for the other ROIs, their corresponding fitted value plots were not constructed.

Figure 4. Fitted value plot of PCL-5 trajectories across pre-, mid-, and post-WMT for participants exhibiting high versus low baseline neural activity in the right MFG at baseline. Shaded areas represent the 95% CI of the trajectory. Models were evaluated in the WMT low-WM/no improvement and low-WM/steep improvement subgroups only. Neural activity was measured during encoding phase of the R-SPAN task at baseline. High and low activity reflect estimated neural activation at +1SD and −1SD below the mean, respectively. Note: PCL-5, ‘PTSD Checklist for DSM-5’; WMT, ‘Working memory training’; MFG, ‘Middle frontal gyrus’; 95% CI, ‘95% confidence interval’; SD, ‘standard deviation’.
Discussion
The purpose of this study was to identify, validate, and characterize the subgroup of PTSD patients who respond to WMT. Results confirmed the existence of a distinct subgroup of patients who exhibited steep improvement in working memory across training, as well as two less responsive subgroups – one group that began with high working memory and showed only modest improvement and another group that began with low working memory and showed no improvement. The low-WM/steep improvement subgroup also exhibited greater improvement in spatial working memory compared to the other two subgroups and greater improvement in PTSD symptom improvement relative to the low-WM/no improvement subgroup. At baseline, the low-WM/steep improvement subgroup could be distinguished from the high-WM/shallow improvement subgroup by relatively lower EF across most baseline measures and from the low-WM/no improvement subgroup by lower activity in brain areas linked to working memory (MFG, left PCG, left IFG), which also predicted steeper PTSD improvement across training. Overall, these findings highlight the potential of WMT to improve working memory and PTSD symptoms in a distinct minority of patients who exhibit lower EF and lower working memory-related neural activity prior to training.
Findings from the present study are broadly consistent with an experimental therapeutics model in which a treatment’s efficacy is maximized when it is delivered to patients with the deficit it is designed to target (Gordon, Reference Gordon2017). Patients who responded best to WMT generally exhibited lower baseline levels of the target process on both a neurocognitive level (lower EF performance) and a neural level (lower activity in brain areas considered integral to working memory). Although the subgroup with superior baseline EF (high-steep improvement) exhibited some evidence of improvement in working memory and PTSD symptoms, progress was overall modest. Higher baseline cognitive performance has also been linked to reduced cognitive improvement in other cognitive training studies (Matysiak, Kroemeke, & Brzezicka, Reference Matysiak, Kroemeke and Brzezicka2019; Rahe et al., Reference Rahe, Becker, Fink, Kessler, Kukolja, Rahn and Kalbe2015; van der Donk et al., Reference van der Donk, Hiemstra-Beernink, Tjeenk-Kalff, van der Leij and Lindauer2020), perhaps reflecting an EF ceiling effect that impedes more substantial improvement. Conversely, although the low-WM/no improvement subgroup exhibited low EF at baseline, they exhibited greater engagement of the treatment target on a neural level compared with the low-WM/steep improvement (i.e. relatively higher activity in working memory-related regions). That this subgroup showed greater activity than the low-WM/steep improvement group in areas underlying working memory but still exhibited similarly poor working memory performance could indicate that they had already reached their full working memory potential, thereby explaining their subsequent lack of improvement during WMT. However, it is also possible that the lack of improvement in this group stemmed from other neural differences that we did not measure in this study. These differences could include differences in functional connectivity, either within working memory areas or between working memory and nonworking memory-related areas, or differences in neural activation involving areas that were not included in our analysis, either because they were too small or because they were outside of our predefined working memory mask.
Importantly, subgroups that exhibited greater WM improvement also tended to show more improvement on PTSD symptoms, echoing findings of prior training paradigms in which training-related enhancement of EF mediated improvement in PTSD symptoms (Badura-Brack et al., Reference Badura-Brack, Naim, Ryan, Levy, Abend, Khanna and Bar-Haim2015). This finding suggests that enhancing working memory-related brain activity might improve top-down control of intrusive trauma-related memories and emotions (Aupperle et al., Reference Aupperle, Melrose, Stein and Paulus2012). Indeed, improvement in top-down control has been linked to PTSD improvement in both traditional and novel PTSD treatments (Adenauer et al., Reference Adenauer, Catani, Gola, Keil, Ruf, Schauer and Neuner2011; Shaw et al., Reference Shaw, Nicholson, Ros, Harricharan, Terpou, Densmore and Lanius2023). Furthermore, lower activity in frontal brain areas thought to be involved in both working memory and top-down control have been shown to predict greater PTSD symptom improvement among subgroups with low baseline EF (Nicholson et al., Reference Nicholson, Rabellino, Densmore, Frewen, Paret, Kluetsch and Lanius2017). Accordingly, impoverished baseline activity in areas relevant to top-down control may afford the opportunity to improve engagement in those areas during training, facilitating a reduction in PTSD symptoms.
Clinically, our findings have at least two important implications. The first is that WMT could offer an alternative treatment option for PTSD if delivered to the right patients. The low-WM/steep improvement subgroup was estimated to experience a 12-point decline on the PCL-5, which is above the threshold for reliable change (Blanchard et al., Reference Blanchard, Johnson, Campbell, Reed, Chen, Heagerty and Fortney2023; Marx et al., Reference Marx, Lee, Norman, Bovin, Sloan, Weathers and Schnurr2022). Although this degree of symptom reduction is similar to gold standard PTSD therapies and was experienced by a smaller proportion of patients (~20% vs. ~50% in gold standard PTSD therapies) (Eftekhari et al., Reference Eftekhari, Ruzek, Crowley, Rosen, Greenbaum and Karlin2013), it was accomplished with a less time- and resource-intensive treatment and with a subgroup of patients (i.e. those with low EF) who tend not to respond to traditional PTSD therapies (Keyan et al., Reference Keyan, Garland, Choi-Christou, Tran, O’Donnell and Bryant2024). Thus, although further research is required, our findings preliminarily implicate WMT as a promising alternative for PTSD patients with low EF who might not otherwise respond to traditional therapies. The second implication is that successfully identifying patients who will respond to WMT may require both neuropsychological and neuroimaging assessments. At baseline, neuropsychological testing could not adequately distinguish the groups with the most divergent outcomes (i.e. low-WM/steep improvement and low-WM/no improvement subgroup), as they could only be differentiated when comparing their working memory-related brain activity. Although neuroimaging largely remains cost prohibitive for universal clinical use, a less costly and invasive tool (e.g. EEG) could potentially provide similar mechanistic clarity of poor EF performance while still allowing successful WMT candidates to be identified. Regardless, our results indicate that WMT may be best suited for those exhibiting deficits in working memory and/or EF prior to therapy, as about 50% of patients with low WM/EF scores demonstrated a strong therapeutic response. Future studies of this training protocol should consider restricting the study to PTSD patients exhibiting lower WM and/or EF scores.
The present study is not without limitations. First, our sample size was relatively small, particularly for neurocognitive outcomes and subgroup sizes. Second, PTSD symptoms were not assessed intensively during training, which precluded us from verifying that working memory changes facilitated PTSD changes (as opposed to the other way around). Of note, our training intervention involved repeated completion of a working memory task and contained no techniques that directly targeted other cognitive or behavioral mechanisms, making it more likely that working memory changes facilitated PTSD changes rather than vice versa. Third, our study did not include a control condition that involved an intervention other than WM training (e.g. a psychosocial intervention, a cognitive training targeting a different domain), so we cannot fully rule out the possibility that the low WM/steep improvement subgroup is a generally responsive group rather than specifically responsive to WM training. Poor baseline neurocognitive functioning typically predicts poorer responses to PTSD treatment (Crocker et al., Reference Crocker, Jurick, Thomas, Keller, Sanderson-Cimino, Boyd and Jak2018; Tanev et al., Reference Tanev, Federico, Greenberg, Orr, Goetter, Resick and Pitman2020; Van Praag et al., Reference Van Praag, Van Den Eede, Wouters, Wilson and Maas2021), which contradicts the explanation that the low-WM/steep improvement subgroup would have the best response; however, the steep improvement in this group also suggests a higher propensity for learning and adaptation, which theoretically could make them more responsive to other treatment modalities (Cenkner et al., Reference Cenkner, Asnaani, DiChiara, Harb, Lynch, Greene and Scott2021). Thus, continued work is needed to determine whether the observed effects are global or specific to WM training. Relatedly, we cannot conclude that the steep changes observed in this group were due to the WM training versus elements shared by other interventions. While the lack of robust cognitive gains reported from other interventions deems this explanation less likely (Susanty et al., Reference Susanty, Sijbrandij, van Dijk, Srisayekti, de Vries and Huizink2022), it nonetheless requires empirical investigation. Fourth, we did not have enough posttreatment EF data to investigate whether training-related improvements in working memory generalized to other EF domains, which should be verified in future studies. Finally, the difficulty of our WMT procedure was not adaptive. This lack of adaptability may have impeded the opportunity for improvement in the high-WM/shallow improvement group (i.e. by making the training more challenging) and the low-WM/no improvement group (i.e. by making the training less challenging). Notably, the low-WM/no improvement group had similar working at baseline as the low-WM/steep improvement group, suggesting that adapting the training based on initial working memory would not have made a difference. However, adaptation that continues throughout training could allow greater success for patients who are failing to improve, which should be a focus of future research into the effectiveness and prediction of WM training.
Despite these limitations, the present study still offers promising evidence that WMT could be a viable alternative for PTSD patients with low EF and that these patients might be identified using a combination of neuropsychological testing and neuroimaging. Future research should explore whether patients with these same neurocognitive and neural markers show poorer responses to traditional PTSD therapies and whether these markers can be used to successfully match PTSD patients to WMT in a prospective manner. Such work would help verify the clinical utility of WMT as a viable alternative for PTSD patients who tend not to respond to traditional therapies.
Supplementary material
The supplementary material for this article can be found at http://doi.org/10.1017/S0033291726103298.
Data availability statement
Requests for de-identified data, analysis code, and research materials should be made to Jessica Bomyea, jbomyea@health.ucsd.edu.
Funding statement
This study was funded through a career development award from VA CSR&D awarded to JB (IK2CX001600). Individual author’s contributions were supported by grant funding from VA CSR&D (1IK2CX002640, CH; CX002485, JB), VA RR&D (RX003793, JB), VA BLR&D (I01-BX005918; AS), and by the VA Center of Excellence for Stress and Mental Health (CESAMH; CH, JB, AJ, AS).
Competing interests
The authors declare none.


