Introduction
Cognitive behavior therapy is the main evidence-based psychological treatment for paranoid ideation in patients with psychotic disorders. International treatment guidelines recommend that all patients with psychotic disorders should be offered cognitive behavior therapy for paranoid ideation (CBTp) in addition to antipsychotic medication (GGZ Standaarden, 2017; Haddock et al., Reference Haddock, Eisner, Boone, Davies, Coogan and Barrowclough2014). The efficacy of CBTp for psychotic symptoms, however, is modest, with estimated effect sizes of 0.24–0.49 (Bighelli et al., Reference Bighelli, Salanti, Huhn, Schneider-Thoma, Krause, Reitmeir, Wallis, Schwermann, Pitschel-Walz, Barbui, Furukawa and Leucht2018; Van der Gaag, Valmaggia, & Smit, Reference Van der Gaag, Valmaggia and Smit2014). For paranoid delusions specifically, a meta-analysis found a mean effect size of 0.27 compared to treatment as usual (Mehl, Werner, & Lincoln, Reference Mehl, Werner and Lincoln2015). New developments in CBTp interventions may enhance efficacy, such as targeting factors like worry rather than delusions directly (Freeman, Emsley, et al., Reference Freeman, Emsley, Diamond, Collett, Bold, Chadwick, Isham, Bird, Edwards, Kingdon, Fitzpatrick, Kabir, Waite, Carr, Causier, Černis, Kirkham, Lambe, Lister and Twivy2021) or virtual reality (VR)-CBTp (Salahuddin et al., Reference Salahuddin, Schütz, Pitschel-Walz, Mayer, Chaimani, Siafis, Priller, Leucht and Bighelli2024). Initial randomized controlled trials (RCTs) of VR-CBTp found large effect sizes (Freeman et al., Reference Freeman, Bradley, Antley, Bourke, DeWeever, Evans, Černis, Sheaves, Waite, Dunn, Slater and Clark2016; Pot-Kolder et al., Reference Pot-Kolder, Geraets, Veling, Staring, Gijsman, Delespaul and van der Gaag2018) but did not compare VR-CBTp with standard CBTp.
VR-CBTp may be more effective than standard CBTp because it facilitates faster and more consistent behavior interventions aimed at reducing avoidance and dropping safety behaviors, factors that have been shown to be critical for treatment effect. Specifically, VR facilitates controllable and repeated exposure on demand, lowers the threshold for patients to engage in behavior exercises, and is likely to increase the time spent on behavior interventions in sessions. According to the cognitive model of paranoia, paranoid ideation is maintained by safety behaviors such as avoidance of potentially threatening events or maladaptive coping behavior as an active effort to reduce the threat during perceived threatening events (Freeman, Reference Freeman2016). Therefore, dropping safety behavior is a key treatment target to reduce negative appraisal and fear by experiencing no bad consequences in perceived threatening situations. In a previous RCT of VR-CBTp by our group, a decrease in safety behavior but not cognitive biases was related to a reduction of paranoid ideation (Pot-Kolder et al., Reference Pot-Kolder, Geraets, Veling, Staring, Gijsman, Delespaul and van der Gaag2018). Also, safety behavior was the single moderator of the effect of VR-CBTp (Berkhof et al., Reference Berkhof, van der Stouwe, Pot‐Kolder, van der Gaag, Veling and Geraets2024). Standard CBTp generally requires starting with several sessions of cognitive interventions to reduce the conviction of paranoid thoughts to enhance motivation to start behavior interventions. With the possibility to practice safely in VR environments, tailor the VR environments to individual needs, and have complete control over what happens in VR, the threshold for engaging in exposure and behavior experiments is lower in VR-CBTp.
These characteristics of VR-CBTp may not only result in a stronger effect on paranoid ideation compared to CBTp, but they may also increase treatment efficiency. The minimal treatment duration of CBTp for paranoid ideation is currently unclear. A meta-analysis on brief CBTp (defined as 6–10 sessions) reported beneficial effects and concluded that the number of sessions did not moderate treatment outcomes (Naeem et al., Reference Naeem, Khoury, Munshi, Ayub, Lecomte, Kingdon and Farooq2016). However, a more recent individual participant data meta-analysis demonstrated that patients who received a higher number of CBTp sessions had lower total psychotic symptoms after treatment than those who received fewer sessions (Turner et al., Reference Turner, Reijnders, van der Gaag, Karyotaki, Valmaggia, Moritz, Lecomte, Turkington, Penadés, Elkis, Cather, Shawyer, O’Connor, Li, de Paiva Barretto and Cuijpers2020). The NICE guidelines in the United Kingdom recommend a minimum of 16 CBTp sessions, and the Dutch psychosis care guidelines suggest a range of 16–35 sessions (Haddock et al., Reference Haddock, Eisner, Boone, Davies, Coogan and Barrowclough2014). In clinical practice, there seems to be a considerable variation in the number of sessions delivering CBTp, which may be caused by a lack of clear evidence and the heterogeneity in the problems and goals of patients, but is also explained by variations in human and financial resources (Burgess-Barr et al., Reference Burgess-Barr, Nicholas, Venus, Singh, Nethercott, Taylor and Jacobsen2023). Given the limited availability of trained psychologists in mental healthcare, increasing treatment efficiency is crucial for improving access to psychological treatment for patients with a psychotic disorder.
We conducted a randomized clinical trial comparing VR-CBTp for paranoid ideation to standard CBTp (Berkhof et al., Reference Berkhof, van der Stouwe, Lestestuiver, van’t Hag, van Grunsven, de Jager, Kooijmans, Zandee, Staring, Pot-Kolder, Vos and Veling2021). The main aim was to investigate differences between VR-CBTp and CBTp in their effect on the level of paranoid ideation in daily life, the level of social activities, the proportion of time spent in social company, levels of distress, anxiety, and depression. In addition, we aimed to determine whether the number of required treatment sessions differs between CBTp and VR-CBTp. We hypothesized that VR-CBTp would show greater symptom reduction (i.e. paranoia in daily life) and larger effects on secondary outcomes (e.g. distress, depression, anxiety, self-esteem, and safety behavior). Furthermore, we expected VR-CBTp to reach these effects in fewer treatment sessions than CBTp.
Methods
Study design
We performed a pragmatic, single-blind, multicenter, randomized clinical superiority trial comparing two arms: VR-CBTp and CBTp (Berkhof et al., Reference Berkhof, van der Stouwe, Lestestuiver, van’t Hag, van Grunsven, de Jager, Kooijmans, Zandee, Staring, Pot-Kolder, Vos and Veling2021). Participants were outpatients recruited from six mental healthcare centers in the Netherlands and one in Belgium. The centers were dispersed over the Netherlands and Belgium, ensuring the recruitment of people from diverse environments, including rural, urban, and suburban environments. Therapists were employed at these centers and delivered the treatments in the trial as part of their regular clinical work. The trial was approved by the medical ethical committee of the University Medical Centre Groningen (number NL66850.042.18), and prospectively registered in the Dutch trial register (migrated to the Central Committee on Research Involving Human Subjects [CCMO] register) with trial number N7758.
Participants
Inclusion criteria were a Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition, diagnosis of a schizophrenia spectrum or other psychotic disorder, age 18–65 years, having an indication for treatment by a clinician, and paranoid ideation verified by a minimum score of 40 in the Green et al. Paranoid Thoughts Scale (GPTS) (Green et al., Reference Green, Freeman, Kuipers, Bebbington, Fowler, Dunn and Garety2008). Exclusion criteria were an estimated IQ below 70, insufficient command of the Dutch language, and having received CBTp for paranoid ideation in the past 12 months.
Procedures
Patients were recruited through advertisements (posters and flyers) at the participating centers and through clinicians who informed and referred patients. If a patient was willing to participate, written informed consent was obtained, after which the GPTS was completed to check eligibility (GPTS score > 40).
Participants were randomly allocated after the baseline assessment. Block randomization was used, with blocks of eight assignments generated by www.randomizer.org for each center. Randomization was carried out by an independent coordinator of the University Medical Center Groningen. Participants could receive additional types of treatment as usual during the study period, including antipsychotic medication, except for any form of CBT.
Participants were assessed at baseline (t0), posttreatment within 2 weeks after completing treatment (t1), and at the 3-month follow-up (t2). Assessors were blinded to treatment allocation and had to be replaced in case of unblinding. Blinding was evaluated with a self-report form. In 12 out of 160 posttreatment and follow-up assessments, assessors indicated that they were unblinded (2 indicated the wrong group). Two assessors were replaced; one participant only wanted to be assessed by the unblinded assessor, and in nine cases, replacement was undesirable due to increased burden when an additional assessment had to be planned. Sensitivity analyses were performed, testing effects for interview outcomes where assessors were completely unblinded (see Supplement 1).
VR-CBTp and CBTp interventions
Both VR-CBTp and CBTp consisted of a maximum of 16 individual sessions delivered within 8–12 weeks, and sessions lasted a maximum of 75 min. For CBTp, the Dutch CBT protocol for paranoid ideation was used (www.gedachtenuitpluizen.nl). The VR-CBTp protocol was based on this protocol as well. The CBTp protocol starts with constructing individual case formulations in collaboration with the patient to create an understanding of the paranoid ideas, and related feelings and behavior (in the first two sessions). Further, the protocol consists of cognitive techniques, such as cognitive restructuring, and behavior interventions, including exposure and behavior experiments. It focuses strongly on exercises (and planning and reflection thereof) aimed at the reappraisal of paranoid beliefs to reduce distress and improve the coping of patients.
Therapy could be completed before 16 sessions in case goal behaviors were achieved, all parts of the safety circle were effectively treated, and participants scored 0 on the Subjective-Units-of-Distress scale for identified fear cues in two consecutive sessions. In addition, session measurements should show a decrease in paranoid ideation, distress, and avoidance. Before stopping, the case had to be discussed with a supervisor.
The main difference between the two protocols was that VR-CBTp exercises were performed in VR during sessions, for ±40 min per session (from session 3 onwards), using social situations that triggered paranoid ideation and distress. In VR, patients were guided by therapists to drop safety behaviors, test paranoid beliefs, and engage in new behavior. Exercises could be tailored to the participant’s needs and goals and could be repeated. In contrast, in vivo exposure and behavior exercises during CBTp started after session 7.
For VR-CBTp, the CE-marked Social Worlds software (CleVR, Delft, The Netherlands) was used to role-play personalized and interactive scenarios. The VR was offered with an Oculus Rift head-mounted display (Reality Labs, Menlo Park, USA), and patients navigated through VR environments using a controller. Multiple VR environments were used: a café, a shopping street, a supermarket, a bus ride, an office, and a living room. The difficulty level could be modified by adjusting the number, gender, gestures, and ethnic appearance of virtual characters (avatars) present in the VR social situations. Moreover, the level of hostility and suspicious behavior could be modified. During scenarios, therapists used a microphone with voice distortion to be able to speak as an avatar.
Both VR-CBTp and CBTp were delivered by psychologists with at least a postmaster qualification in CBT (100 h of accredited CBT training) and a minimum of 6 months of experience in psychosis treatment. Therapists were trained in both protocols during a 4-day training and supervised by experienced psychologists registered at the Dutch Association of Behaviour and Cognitive Therapies (VGCt). Individual case conceptualizations were guided and evaluated by a supervisor after session 2. Therapists participated in monthly 2-h group supervision sessions. Treatments were audio recorded, and two recordings per therapist were rated for treatment fidelity by author MH (psychologist) using the Cognitive Therapy Rating Scale. Both the session number and participant were assigned randomly to ensure unbiased selection for rating.
Outcomes
The primary outcome was momentary paranoia in daily life at posttreatment, measured with the experience sampling method (ESM). ESM is a structured diary method; participants completed diary questionnaires on their mobile phones at semi-random moments 10 times a day for 7 days. ESM affect items were scored on visual analogue scales (VAS) ranging from 0 (not at all) to 100 (very). Momentary paranoia was assessed by the following items: ‘I feel that others might hurt me’, ‘I feel that others dislike me’, and ‘I feel suspicious’. A minimum of 30% of the questionnaires had to be completed for analyses. Mean total scores were analyzed. We performed a principal component analysis with oblique rotation and Kaiser normalization on the person-centered items to confirm subscales from the literature. The negative effect, positive effect, and three-item momentary paranoia subscale (factor loadings of momentary paranoia ranged from 0.63 to 0.91) were confirmed.
Secondary ESM outcomes concerned the proportion of time in social company (‘With whom have I been since the previous measurement?’) and social activities, in terms of social threat both in the current moment and since the previous measurement. Social threat was assessed using the mean of the following items: (i) ‘I like this company [reversed score]’, (ii) ‘In this company, I feel accepted [reversed score]’, and (iii) ‘In this company, I feel unsafe’). To avoid/endure threat situations, the mean of the following items was assessed: (i) ‘Since the previous measurement, I have avoided situation or activities to avoid danger/fear’ and (ii) ‘Since the previous measurement, I have had difficulty coping with situations or activities that I could not avoid’).
Secondary interview outcomes were the Psychotic Symptom Rating Scales (PSYRATS) (Haddock, McCarron, Tarrier, & Faragher, Reference Haddock, McCarron, Tarrier and Faragher1999) and the Safety Behaviours Questionnaire–persecutory delusions (Freeman, Garety, & Kuipers, Reference Freeman, Garety and Kuipers2001). Self-report questionnaire outcomes were the GPTS (additionally, R-GPTS scores were calculated) (Freeman, Loe, et al., Reference Freeman, Loe, Kingdon, Startup, Molodynski, Rosebrock, Brown, Sheaves, Waite and Bird2021), Social Interaction Anxiety Scale (Mattick & Clarke, Reference Mattick and Clarke1998), Inventory of Depressive Symptomatology Self-Report (Rush et al., Reference Rush, Gullion, Basco, Jarrett and Trivedi1996), Self-Esteem Rating Scale–Short Form (Lecomte, Corbière, & Laisné, Reference Lecomte, Corbière and Laisné2006), Interpersonal Sensitivity Measure (Boyce & Parker, Reference Boyce and Parker1989), Brief Core Schema Scales (Fowler et al., Reference Fowler, Freeman, Smith, Kuipers, Bebbington, Bashforth, Coker, Hodgekins, Gracie, Dunn and Garety2006), Davos Assessment of Cognitive Biases (van der Gaag et al., Reference van der Gaag, Schütz, ten Napel, Landa, Delespaul, Bak, Tschacher and de Hert2013), social functioning (PSP) (Kawata & Revicki, Reference Kawata and Revicki2008), and Penn State Worry Questionnaire (Brown, Antony, & Barlow, Reference Brown, Antony and Barlow1992).
Statistical analyses
A previous RCT comparing VR-CBTp with waiting list reported a large effect size (ES) (d = 1.6) for our primary outcome, that is, momentary paranoia (Pot-Kolder et al., Reference Pot-Kolder, Geraets, Veling, Staring, Gijsman, Delespaul and van der Gaag2018). The ES was expected to be smaller with standard CBTp as a comparison; therefore, we aimed to detect an ES of 0.6. Assuming a power of 0.8, α of 0.05, and a standard deviation (SD) of 1.1, the sample size was calculated to be 106 to detect an ES of 0.6 between groups. While allowing a 15% attrition rate, the total sample size was calculated to be 122.
Data were analyzed by intention to treat with SPSS (version 29.0.1.0). A p-value < .05 was considered significant. The Mann–Whitney U test was done to analyze differences in the number of sessions. Multilevel regression analyses with a random intercept for participants, identity covariance structure, and maximum likelihood method for estimation were performed. The treatment effect was established with the group-by-time interaction for posttreatment and follow-up separately, as well as the main effects of group and time. Cohen’s d effect sizes were calculated (by dividing the estimated mean difference by the pooled SD) for the primary outcome and related paranoia measures (PSYRATS and GPTS).
Results
A total of 103 patients were enrolled between October 1, 2019, and July 31, 2023 (Figure 1). Three participants were excluded because of a GPTS < 40, and two withdrew consent before randomization. In total, 98 were randomized to VR-CBTp (n = 48) or CBTp (n = 50). Of the randomized patients, 27% were female, 73% were male, and the mean age was 35.8 years (SD = 2.6) (Table 1). Baseline characteristics were well-balanced between groups, although schizophrenia was more common in the VR-CBTp group.

Figure 1. Flowchart of the trial.
Table 1. Baseline characteristics

Note: Data are n (%) or mean (standard deviation).
A total of 18 participants (37.5%) dropped out of VR-CBTp and 12 (24.0%) of CBTp (Table 2). Of those, four in the VR-CBTp group never started therapy because of symptom reduction (n = 2), unwillingness (n = 1), and other treatment required (n = 1). Reasons for dropout during VR-CBTp were other treatment required (n = 3) lost contact/unknown (n = 3), treatment too intensive (n = 3), unwillingness (n = 2), assessment too intensive (n = 1), severe physical illness ( = 1), and exacerbated symptoms/no trust in care (n = 1). CBTp participants dropped out before the start of treatment due to imprisonment (n = 1), other treatment required (n = 2), assessment too intensive (n = 1), treatment too intensive (n = 1), and exacerbated symptoms (n = 1). During CBTp, people dropped out because of exacerbated symptoms/no trust in care (n = 3), other treatment required (n = 1), unwillingness (n = 1), and lost contact (n = 1).
Table 2. Treatment characteristics

Note: Data are n (%) or mean (standard deviation).
Treatment dropout was higher during the coronavirus disease 2019 (COVID-19) restriction period (March 2020–March 2022) for VR-CBTp (40.0% vs. 34.8%) and CBTp (25.9% vs. 21.7%). Participants who dropped out did not have higher baseline paranoia (PSYRATS and GPTS) than completers. When considering only treatment completers, VR-CBTp took 12.7 (SD 3.9) sessions and CBTp took 15.1 sessions (SD = 2.5, U = 738.5, p < .001). Of the completers, 56.7% of the VR-CBTp group were early completers, completing treatment before the 16th session, and for CBTp, this was 21.1%.
In total, 24 therapists participated in the trial, of whom 4 had no recordings. Of the 1,036 sessions, over three-fifths had recordings. A total of 40 sessions (2 per therapist) were rated for treatment fidelity and showed good to very good adherence in both groups (M CBT = 4.7, SD = 0.6 and M VR-CBGTp = 4.8, SD = 0.7) on a 6-point scale. The session with the lowest adherence scored 3.5, and the highest session score was 5.7.
At baseline, 81 of 98 randomized participants sufficiently completed ESM, with a mean of 47.9 self-assessments (SD = 13.9). At posttreatment, 56 participants (57% of all randomized participants and 69% of those with baseline data) completed ESM (M = 44.4, SD = 12.7). At follow-up, 50 participants (51% of randomized participants and 62% of those with baseline data) completed the ESM (M = 42.5, SD = 12.5).
Momentary paranoia (primary outcome) improved significantly between baseline and posttreatment in both groups (ES = 0.65), as well as between baseline and follow-up (ES = 0.57) (see Tables 3 and 4). Participants in the VR-CBTp group improved on average by 12.6 points and the CBTp group by 9.2 points between baseline and follow-up. A significant interaction effect in favor of VR-CBTp was found at posttreatment (ES = 0.62) but not at follow-up (ES = 0.31).
Table 3. Means and standard deviations of outcomes over time

Note: Data are mean (SD). Deviating n: a= 47, b= 36, c = 40, d= 43, e= 26, f= 42, g= 35, h= 34, i = 45, j = 38, k = 48.
Table 4. Test results (main and interaction effects) of multilevel analyses

Note: CBTp was coded as 0 and VR-CBTp as 1. p-values derived from Type III F tests of fixed effects, confidence intervals from estimates of t fixed effects.
The PSYRATS interview confirmed the change in paranoid delusions; both groups reported reductions in paranoid delusions over time, with the biggest change occurring between baseline and posttreatment (ESt0–t1 = 0.66 and ESt0–t2 = 0.74). No significant interaction effects were found for posttreatment (ES = 0.17) nor follow-up (ES = 0.32); thus, the two therapies did not produce significantly different outcomes on these paranoia measurements. Similarly, the GPTS showed that patients in both treatment groups showed reduced ideas of social reference (ESt0–t1 = 1.15 and ESt0–t2 = 1.01) and ideas of persecution (ESt0–t1 = 1.02 and ESt0–t2 = 0.83) with strong improvements at the post-assessment. No significant interaction effects for ideas of social reference (ESt0–t1 = 0.17 and ESt0–t2 = 0.18) and persecution were found (ESt0–t1 = 0.00 and ESt0–t2 = 0.01).
Participants showed significantly less safety behavior over time. This effect was stronger in the VR-CBTp group, as shown by the significant interaction effect at posttreatment for avoidance and in-situation safety behavior, but this effect did not reach significance at follow-up. With regard to social interaction anxiety, patients in both treatment arms showed reduced social interaction anxiety post-treatment (main effects of time significant at p < .001 for both time points). There was no significant interaction effect at posttreatment, but we did observe stronger improvements in the VR-CBTp group at follow-up. In addition, symptoms of depression, worry, cognitive problems and biases, and social functioning showed improvements over time when looking at both groups (p < .001). Only for depression, an interaction effect was found, reflecting stronger improvement in the VR-CBTp group at posttreatment but not at follow-up. Interpersonal sensitivity also improved slightly in both groups, but this was only significant for posttreatment.
For self-esteem and core schemas, significant main effects of time were found for posttreatment and follow-up, except regarding positive others’ core schemas. For positive and negative self-esteem, significant interaction effects were established for both time points, reflecting that self-esteem improved over time in the VR-CBTp group but remained stable over time in the CBTp group.
Regarding secondary ESM outcomes, a significant improvement was found for the avoid/endure threat situations scale over time between baseline and posttreatment, as well as follow-up. No significant improvements over time were found for proportion in time in the company or social threat subscales. There was a main effect of group for the social threat scales that was attributable to the difference at baseline.
A total of 15 serious adverse events occurred during the trial (n = 2 before randomization; n = 3 VR-CBTp; n = 10 CBTp). Two serious events (admissions) were possibly related to the trial, specifically to the intensity of the baseline assessment and more awareness of behavior patterns that led to an increase in suffering after the first session.
Discussion
In this RCT comparing VR-CBTp to standard CBTp for paranoid ideation in patients with a psychotic disorder, we found that following both treatments, patients reported significantly reduced momentary paranoia and severity of paranoid delusions, safety behavior, social anxiety, depression, worry, and dysfunctional cognitive schemes. Furthermore, posttreatment, patients reported improved self-esteem and social functioning, and most effects were maintained 3 months after treatment. At posttreatment, VR-CBTp had significantly stronger effects than standard CBTp on momentary paranoia, safety behavior, depressive symptoms, and self-esteem, of which the difference in effects on self-esteem and social anxiety remained at follow-up. In VR-CBTp, 37% of the participants dropped out versus 24% in the CBTp group. Completers, on average, received 12.7 (VR-CBTp) and 15.1 (CBTp) sessions. These findings indicate that CBTp and VR-CBTp are both efficacious treatments for paranoid ideation but that VR-CBTp may be somewhat more effective directly after treatment completion.
Treatment dropout rates were substantially higher in the VR-CBTp group (37%) than in the CBTp group (24%). A substantial proportion of the dropouts (22% and 50%, respectively) did not start therapy. Reasons for dropout were often unrelated to the interventions (prioritizing other treatment, e.g. for trauma, assessment too intensive, imprisonment, and physical illness). With regard to VR-CBTp, sometimes participants had to travel further than usual, as VR sets were located at specific locations, which may have influenced willingness to participate. Moreover, high dropout could be partly caused by the COVID-19 pandemic, as outside the pandemic, dropout was lower. It may also be attributed to the rapid emphasis on exposure in VR-CBTp, which might be less tolerable for some, or to a perceived mismatch between the possibilities for behavior exercises in VR and the goals of participants. The strong focus on exposure/behavior exercises might make the treatment less tolerable or suitable for some, but it is also a major strength of VR-CBTp. Lastly, dropout may be attributed to the pragmatic design in which therapy was integrated into a clinical setting. However, this design enabled a realistic image of feasibility in clinical practice.
To improve engagement and reduce dropout, hybrid treatments – combining VR with in vivo exercises and telehealth options where patients can be partly treated at home – could form a solution. Also, more precise treatment indication (as dropout was often related to a mismatch with the patient’s current needs), combined with shared decision-making regarding treatment choices, could enhance the effectiveness and acceptability of the interventions. Finally, portable VR sets or VR sets at more treatment locations are necessary to counter patients’ traveling.
When we look at those participants who completed therapy, we observe that similar treatment effects were attained in VR-CBTp with fewer sessions. The VR-CBTp protocol explicitly includes extensive practice in VR, stimulating patients and therapists to engage in exposure for 40 min per session, whereas research has shown that in vivo exposure is frequently underutilized (de Jong et al., Reference de Jong, Lommen, van Hout, de Jong and Nauta2020). This enabled participants to practice in threatening social situations earlier, more often, and more intensively and in a personalized way. The working mechanism of behavior exercises is dropping safety behaviors to experience high levels of anxiety and to subsequently learn that one is safe and can endure distress; that is, threat beliefs are falsified, and maladaptive fear associations may be replaced with new, more adaptive ones. This approach may thus facilitate quicker and stronger reduction of safety behavior than in CBTp. Consequently, it could have increased the experience of positive learning effects and impacted self-esteem. Possibly, the direct VR exposures are such an empowering experience that improves self-esteem.
This is consistent with current cognitive behavior models of paranoid delusions that view delusions as threat beliefs maintained by the use of safety strategies and psychological factors such as worry and low self-esteem (Freeman, Reference Freeman2016). If treatment successfully targets these maintaining factors, (Freeman, Emsley, et al., Reference Freeman, Emsley, Diamond, Collett, Bold, Chadwick, Isham, Bird, Edwards, Kingdon, Fitzpatrick, Kabir, Waite, Carr, Causier, Černis, Kirkham, Lambe, Lister and Twivy2021), patients will more easily enter feared situations and will experience that they are safe, as a result of which paranoid ideation will gradually disappear. VR-CBTp centers around behavior exercises and lowers the threshold for engaging in threatening virtual social situations because patients know it is not real, virtual scenarios can be completely controlled, and the therapist is present for guidance. Indeed, results from our previous VR-CBTp RCT confirmed that the reduction in safety behavior over time was related to diminished levels of paranoid ideation (Pot-Kolder et al., Reference Pot-Kolder, Geraets, Veling, Staring, Gijsman, Delespaul and van der Gaag2018).
This finding of potentially higher treatment efficiency of VR-CBTp is important both for patients and mental healthcare, especially when replicated. While both conditions used similar symptom-oriented protocols focusing on paranoid ideation, we cannot rule out the possibility that VR-CBTp may have had a somewhat narrower focus than CBTp. Nonetheless, as results indicated that there was generalization to real-life situations for both treatments with similar effects on a wide range of outcomes, they imply higher efficacy of the VR-CBTp protocol. Quicker recovery is advantageous for people suffering from paranoid delusions, as these symptoms have been associated with high levels of distress and hospital admissions (Freeman et al., Reference Freeman, Startup, Dunn, Wingham, Černis, Evans, Lister, Pugh, Cordwell and Kingdon2014). A 16% reduction in treatment duration (2.6 of 16 sessions) with VR-CBTp compared to standard CBTp is relevant to the accessibility of psychological treatment, as more patients can be treated with the same human resources, which is crucial given the low proportion of patients who currently receive CBTp (Burgess-Barr et al., Reference Burgess-Barr, Nicholas, Venus, Singh, Nethercott, Taylor and Jacobsen2023). Cost-effectiveness studies still need to be conducted, but preliminary findings suggest that VR-CBTp is cost-effective in the short term from a societal perspective (Pot-Kolder et al., Reference Pot-Kolder, Veling, Geraets, Lokkerbol, Smit, Jongeneel, Ising and Van Der Gaag2020). If developments of (partly) automated VR therapies continue, treatment efficiency and cost-effectiveness are likely to increase (Freeman et al., Reference Freeman, Lambe, Kabir, Petit, Rosebrock, Yu, Dudley, Chapman, Morrison, O’Regan, Aynsworth, Jones, Murphy, Powling, Galal, Grabey, Rovira, Martin, Hollis and West2022).
Our study had several limitations. First, the number of participants was lower than planned (103 instead of 122). Thus, the study was somewhat underpowered. Second, ESM was missing for 17% of the participants at baseline and almost half at posttreatment and follow-up. The frequency and intensity of assessments seem too burdensome. The same ESM scheme was not problematic in our previous RCT (Pot-Kolder et al., Reference Pot-Kolder, Geraets, Veling, Staring, Gijsman, Delespaul and van der Gaag2018); however, a special device was used, and stronger emphasis on completion was put (e.g. completion was needed for randomization). We do not know the impact of the missing data on our findings and the potential biases it could cause; missing ESM data could be related to both better and worse outcomes. In the case of better outcomes, assessments may have been missed because individuals were engaged in other activities, such as work, leading them to ignore the prompts. On the other hand, missing ESMs related to worse outcomes may be due to being preoccupied with symptoms, lack of motivation, or paranoia toward technology or smartphones. The significant finding on our primary outcome was consistent with most secondary outcomes. For secondary outcomes, the completion rate was 83% for posttreatment and 81% for follow-up. Future trials should carefully consider using ESM as a primary outcome; on the one hand, this method enables ecologically valid data; but on the other hand, using daily diary questionnaires has a high risk of missing data. Finally, our findings may have limited generalizability due to the predominantly Dutch participant pool, which may not reflect populations in non-European or non-Western countries. Strengths of the study included the comparison of VR-CBTp with the gold standard CBTp treatment, a rigorous pragmatic multicenter RCT design providing information on feasibility in ‘real-world’ clinical practice, and high-quality treatment in both groups, evidenced by experienced therapists, adequate training and supervision, and high treatment fidelity.
Over the past decade, several promising VR treatments for patients with psychotic disorders have been developed and tested in pilot studies, qualitative studies, and RCTs (Wiebe et al., Reference Wiebe, Kannen, Selaskowski, Mehren, Thöne, Pramme, Blumenthal, Li, Asché, Jonas, Bey, Schulze, Steffens, Pensel, Guth, Rohlfsen, Ekhlas, Lügering, Fileccia and Braun2022). VR-CBTp is currently being investigated in another RCT, also with CBTp as a control treatment condition (Jeppesen et al., Reference Jeppesen, Due, Mariegaard, Pinkham, Vos, Veling, Nordentoft and Glenthøj2022). If the results of our RCT are replicated and extended, VR-CBTp may be implemented in routine mental healthcare. To achieve this, actions are needed in all parts of the implementation process (ranging from technical support and training to integration in daily workflow and costs), using theoretical implementation frameworks and integrated implementation strategies at multiple levels.
Future research should investigate whether integrating VR-CBTp with in vivo behavior interventions, such as exposure, might give even better results than the current interventions. Moreover, there is a need to investigate which patients benefit most from VR-CBTp, such as those who are unable or unwilling to engage in in vivo exposures. Until present, only two studies have performed such research for VR-CBTp, showing larger benefits for those with a high level of safety behavior and agoraphobia (Berkhof et al., Reference Berkhof, van der Stouwe, Pot‐Kolder, van der Gaag, Veling and Geraets2024; Freeman et al., Reference Freeman, Lambe, Kabir, Petit, Rosebrock, Yu, Dudley, Chapman, Morrison, O’Regan, Aynsworth, Jones, Murphy, Powling, Galal, Grabey, Rovira, Martin, Hollis and West2022), but no other moderators are yet known.
Supplementary material
The supplementary material for this article can be found at http://doi.org/10.1017/S0033291725100949.
Acknowledgments
The authors thank the study participants, the participating mental health centers (Altrecht, GGZ Delfland, GGZ Noord-Holland-Noord, Katholieke Universiteit Leuven, Parnassia, ProPersona, Rivierduinen, and UMCG), and the supervisors, therapists, and research assistants involved in this study.
Funding statement
This trial was funded by the Dutch Brain Foundation, grant number HA2017.01.04.
Competing interests
The authors declare none.
Ethical standard
The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008.