Audio Visual Assisted Therapy Aid for Refractory Auditory Hallucinations (AVATAR) therapy for voice hearers: systematic review and meta-analysis

Felix Opper; Sebastian Henges; Pawel Weinstein; Dana Arnheim; Laura Fässler; Olivier Percie du Sert; Izabela Stefaniak; Michel Sabé; Louise Birkedal Glenthøj; Neil Thomas; Chih-Sung Liang; Brendon Stubbs; Kerem Böge

doi:10.1192/bjo.2026.11014

Audio Visual Assisted Therapy Aid for Refractory Auditory Hallucinations (AVATAR) therapy for voice hearers: systematic review and meta-analysis

Published online by Cambridge University Press: 13 April 2026

Olivier Percie du Sert ,

Izabela Stefaniak ,

Michel Sabé ,

Louise Birkedal Glenthøj and

Neil Thomas

...Show all authors

Show author details

Felix Opper*: Affiliation:
Department of Neuroscience and Psychiatry, Charité – Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany Department of Education and Psychology, Freie Universität Berlin, Berlin, Germany
Sebastian Henges: Affiliation:
Department of Neuroscience and Psychiatry, Charité – Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany Faculty of Psychology and Neuroscience, Maastricht University, Maastricht, Netherlands
Pawel Weinstein: Affiliation:
Department of Neuroscience and Psychiatry, Charité – Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany Faculty of Psychology and Neuroscience, Maastricht University, Maastricht, Netherlands
Dana Arnheim: Affiliation:
Department of Neuroscience and Psychiatry, Charité – Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany Zachai Division of Psychiatry, Sheba Medical Center Israel and Sackler Faculty of Medicine, Tel Aviv University, Tel Aviv, Israel
Laura Fässler: Affiliation:
Department of Neuroscience and Psychiatry, Charité – Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany Department of Education and Psychology, Freie Universität Berlin, Berlin, Germany
Olivier Percie du Sert: Affiliation:
Prevention and Early Intervention Program for Psychoses, Douglas Research Centre, Montreal, Quebec, Canada Department of Psychiatry, McGill University, Montreal, Quebec, Canada
Izabela Stefaniak: Affiliation:
Faculty of Medicine, Lazarski University, Warsaw, Poland
Michel Sabé: Affiliation:
Psychiatry Department, Faculty of Medicine, University of Geneva, Geneva, Switzerland Division of Adult Psychiatry, Department of Psychiatry, University Hospitals of Geneva, Geneva, Switzerland
Louise Birkedal Glenthøj: Affiliation:
Department of Psychology, University of Copenhagen, Copenhagen, Denmark VIRTU Research Group, Mental Health Center Copenhagen, Copenhagen University Hospital, Mental Health Services CPH, Copenhagen, Denmark
Neil Thomas: Affiliation:
Centre for Mental Health and Brain Sciences, Swinburne University of Technology, Melbourne, Australia
Chih-Sung Liang: Affiliation:
Department of Psychiatry, Tri-Service General Hospital, National Defense Medical Center, Taipei, Taiwan Department of Psychiatry, Beitou Branch, Tri-Service General Hospital, Taipei, Taiwan
Brendon Stubbs: Affiliation:
Institute of Psychiatry, Psychology and Neuroscience (IoPPN), King’s College London, London, UK
Kerem Böge: Affiliation:
Department of Neuroscience and Psychiatry, Charité – Universitätsmedizin Berlin, corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany Medical University Brandenburg-Theodor Fontane, Neuruppin, Germany German Center of Mental Health (DZPG), Berlin/Potsdam, Germany
*: Correspondence: Felix Opper. Email: felix.opper@charite.de

Article contents

Abstract
Background
Aims
Method
Results
Conclusions
Method
Results
Discussion
Supplementary material
Data availability
Author contributions
Funding
Declaration of interest
References

Rights & Permissions

Abstract

Background

Auditory verbal hallucinations (AVHs) are common and distressing symptoms across a range of psychiatric disorders, including schizophrenia spectrum disorders. Audio Visual Assisted Therapy Aid for Refractory Auditory Hallucinations (AVATAR) is an innovative therapeutic approach that facilitates dialogue with a digital avatar representing the voices that patients hear.

Aims

This systematic review and meta-analysis aimed to assess the efficacy, tolerability and acceptability of AVATAR therapy in reducing voice-related symptoms.

Method

Following preregistration, we conducted a systematic review and meta-analysis of controlled trials of AVATAR therapy in samples primarily diagnosed with schizophrenia spectrum disorders. PubMed, CINAHL, Embase, PsycInfo, ClinicalTrials.gov, ISRCTN and Web of Science were searched in March 2025. We assessed bias and certainty with the Cochrane Risk-of-Bias 2 tool and the GRADE approach. Random-effects models were used to synthesise outcomes.

Results

Eight AVATAR trials (N = 978) were included. Compared with usual treatment, waitlist and active control groups, AVATAR therapy decreased the primary outcome of AVH severity at post-treatment (Hedges’ g = −0.40, 95% CI −0.54 to −0.25) and short-term follow-up (Hedges’ g = −0.25, 95% CI −0.40 to −0.10). AVH subscales showed small significant effect sizes at post-treatment (frequency: Hedges’ g = −0.38, 95% CI −0.52 to −0.24; distress: Hedges’ g = −0.32, 95% CI −0.46 to −0.18), which were maintained at short-term follow-up. The certainty of evidence was rated moderate for AVH severity at post-treatment. AVATAR therapy was largely tolerable and acceptable, with adverse events mostly unrelated to the treatment and a comparable drop-out rate to control groups.

Conclusions

Findings suggest that AVATAR therapy is effective at reducing AVH symptoms. Considering heterogeneous control groups and less clear evidence for secondary outcomes and longer follow-ups, further research is warranted.

Keywords

AVATAR digital intervention voice hearing schizophrenia spectrum disorders virtual reality

Information

Type: Review
Information: BJPsych Open , Volume 12 , Issue 3 , May 2026 , e104

DOI: https://doi.org/10.1192/bjo.2026.11014 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution and reproduction, provided the original article is properly cited.
Copyright: © The Author(s), 2026. Published by Cambridge University Press on behalf of Royal College of Psychiatrists

Auditory verbal hallucinations (AVHs), the experience of hearing voices without an external source,^{Reference Beck and Rector1} can occur across a wide range of psychiatric disorders, including schizophrenia spectrum disorders, affective disorders, borderline personality disorder, post-traumatic stress disorder, as well as in non-clinical populations.^{Reference Maijer, Begemann, Palmen, Leucht and Sommer2} Despite their cross-diagnostic presence, AVHs are most commonly associated with schizophrenia spectrum disorders, where approximately 70% of patients experience them at some point during their lives.^{Reference McCarthy-Jones, Trauer, Mackinnon, Sims, Thomas and Copolov3} Phenomenologically, AVHs are highly heterogeneous, varying in content, emotional valence and perceived agency.^{Reference Woods, Jones, Alderson-Day, Callard and Fernyhough4} They may be personified or non-personified, and experienced as persecutory, abusive, obscene, derogatory, threatening or critical, but also potentially helpful, affirming or inspirational.^{Reference McCarthy-Jones, Trauer, Mackinnon, Sims, Thomas and Copolov3}

Although not inherently pathological, AVHs can become clinically relevant when they are experienced as intrusive, uncontrollable or malevolent.^{Reference Larøi, Bless, Laloyaux, Kråkvik, Vedul-Kjelsås and Kalhovde5} In such cases, AVHs are often associated with heightened distress, functional impairment and increased psychopathology.^{Reference Toh, Thomas, Hollander and Rossell6}

In the treatment of schizophrenia spectrum disorders, guidelines emphasise a multidisciplinary approach, with antipsychotic medication as a central component.^{Reference Hasan, Falkai and Lehmann7–Reference Kuipers, Yesufu-Udechuku, Taylor and Kendall9} Although antipsychotics are effective for a substantial proportion of patients,^{Reference Samara, Nikolakopoulou, Salanti and Leucht10} approximately 20–35% do not experience clinically meaningful improvement.^{Reference Diniz, Fonseca and Rocha11–Reference Siskind, Orr and Sinha13} Limitations are further underscored by high relapse rates upon medication discontinuation,^{Reference Bogers, Hambarian, Walburgh Schmidt, Vermeulen and Haan14,Reference Zipursky, Menezes and Streiner15} and the risk of burdensome side-effects.^{Reference Leucht, Priller and Davis16} Notably, around 30% of treatment-resistant symptoms involve persistent AVHs,^{Reference Goghari, Harrow, Grossman and Rosen17–Reference Nathou, Etard and Dollfus19} highlighting the urgent need for additional, targeted interventions for individuals who continue to experience significant voice-related distress despite standard pharmacological care.

In addition, psychological interventions such as cognitive–behavioural therapy (CBT) are recommended.^{Reference Hasan, Falkai and Lehmann7–Reference Kuipers, Yesufu-Udechuku, Taylor and Kendall9} CBT has demonstrated small effects on psychotic symptoms in numerous domains, including overall positive^{Reference Bighelli, Salanti, Huhn, Schneider-Thoma, Krause and Reitmeir20} and AVH-related symptoms.^{Reference Turner, Burger, Smit, Valmaggia and Gaag21} However, CBT shows relevant limitations including a large number of sessions required for effectiveness.^{Reference Lincoln, Jung, Wiesjahn and Schlier22} More recently, a shift toward symptom-specific approaches has gained momentum, allowing for more personalised treatment strategies.^{Reference Freeman23} For AVHs, relational therapies have emerged as a promising line of intervention, drawing on the person-like qualities of voices and conceptualising AVHs as embedded within a dynamic, relationship-like framework.^{Reference Smailes, Alderson-Day, Fernyhough, McCarthy-Jones and Dodgson24} Among these, the therapy intervention Audio Visual Assisted Therapy Aid for Refractory Auditory Hallucinations (AVATAR) represents a novel therapeutic development explicitly aimed at improving outcomes for individuals experiencing distressing and persistent voices.^{Reference Leff, Williams, Huckvale, Arbuthnot and Leff25}

Initially developed by Julian Leff in 2008,^{Reference Craig, Rus-Calafell, Ward, Leff, Huckvale and Howarth26} AVATAR therapy enables real-time interactions with a digital representation of a person’s most dominant AVH. The approach involves the creation of a patient-designed digital avatar displayed either on a screen or through a virtual reality headset. A clinician then animates the avatar by voicing it according to the patient’s descriptions, and the avatar’s facial and head movements are synchronised to simulate natural speech. This simulated ‘face-to-face’ interaction with the avatar functions as a form of exposure to anxiety-provoking stimuli,^{Reference Rus-Calafell, Ward, Zhang, Edwards, Garety and Craig27} with the therapeutic aim of gradually increasing the individual’s sense of control, reducing fear-based appraisals and altering the relational dynamics with the voice. By enabling patients to assert themselves and challenge previously threatening voices, AVATAR therapy may reduce the distress associated with AVHs.^{Reference Rus-Calafell, Ehrbar, Ward, Edwards, Huckvale and Walke28} In line with this, maladaptive appraisals of voices, such as beliefs about omnipotence or malevolence, have shown associations with voice-related distress, whereas more positive interpretations were modestly associated with reduced distress.^{Reference Tsang, Bucci and Branitsky29}

Since its creation, AVATAR therapy has been evaluated in several clinical trials.^{Reference Leff, Williams, Huckvale, Arbuthnot and Leff25} Previous reviews have examined AVATAR therapy among virtual reality-based treatments in mental disorders or positive symptoms of schizophrenia spectrum disorders,^{Reference Spark, Pot-Kolder, Dzafic, Nelson, Byrne and Lum30,Reference Zeka, Clemmensen, Valmaggia, Veling, Hjorthøj and Glenthøj31} but they have not provided a comprehensive perspective of AVATAR therapy for AVHs beyond its role within the broader context of virtual reality-based treatments. Two previous systematic reviews and meta-analyses have examined AVATAR trials and have reported promising effects on AVH-related symptoms.^{Reference Aali, Kariotis and Shokraneh32,Reference Hsu, Tseng, Hsu, Yang, Changchien and Lin33} However, these were limited by the inclusion of few studies in the meta-analysis, by analysis of short follow-ups and by the lack of secondary aspects such as tolerability and acceptability of the treatment. These aspects are especially relevant for clinical decision-makers and guideline developers in evaluating the real-world utility of emerging therapies. Moreover, further high-quality clinical trials may have since been published, revealing the necessity for an updated review. In light of these limitations, we aimed to comprehensively assess the efficacy of AVATAR-based interventions for AVHs, including both clinical and functional outcomes, as well as their tolerability, acceptability and the overall quality of available evidence.

Method

This systematic review and meta-analysis was preregistered at PROSPERO on 15 March 2025 (identifier CRD420251005545) and followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) reporting guidelines (see Supplementary Tables 1 and 2 available at https://doi.org/10.1192/bjo.2026.11014).^{Reference Page, McKenzie, Bossuyt, Boutron, Hoffmann and Mulrow34}

Search strategy

On 21 March 2025, the publication databases Embase, PubMed and PsycINFO, as well as the grey literature and trial registration databases Web of Science Core Collection, CENTRAL, ClinicalTrials.gov and ISRCTN, were searched for relevant studies, to minimise the potential of publication bias. Indexing and general terms were applied in the search (see Supplementary Table 3).^{Reference Bramer, De Jonge, Rethlefsen, Mast and Kleijnen35} References of prior reviews and included articles were checked for additional studies, and experts were contacted for knowledge of further suitable trials. Duplicate removal and record screening were performed in the systematic review program Rayyan.^{Reference Ouzzani, Hammady, Fedorowicz and Elmagarmid36} F.O. and S.H. independently screened titles and abstracts for full-text screening eligibility with hierarchical criteria (see Supplementary Table 4),^{Reference Polanin, Pigott, Espelage and Grotpeter37} before independently completing the full-text screening in a spreadsheet. Corresponding authors of trial registrations and missing data were contacted weekly over the period of 4 weeks to request missing information. For disagreements during screenings, a third reviewer (K.B.) was consulted for discussion. Interrater reliability in the metric of Cohen’s κ was calculated using the DeltaMAN package for both screening stages.^{Reference Cohen38,Reference Maldonado, Marzo and Andrés39}

Inclusion criteria

To be included, studies had to be published or unpublished, randomised controlled trials or controlled trials, investigating the efficacy of AVATAR or equivalent therapy in treating samples of participants aged ≥16 years who reported AVHs and were diagnosed with a schizophrenia spectrum disorder in >50% of cases, compared to any active or passive control group. Finally, studies had to report one of the following outcomes of interest.

The primary outcome of this study was the between-group difference of severity of voice symptoms at post-treatment measured with the Psychotic Symptom Rating Scales for Auditory Hallucinations (PSYRATS-AH^{Reference Haddock, McCarron, Tarrier and Faragher40}). Secondary outcomes included further voice-related (frequency and distress in the PSYRATS-AH), clinical (positive, negative, total psychotic, depressive and anxiety symptoms) and functional outcomes (social functioning and quality of life), as well as their follow-up assessments. We also indexed acceptability by examining within- and between-group drop-out rates, and tolerability from reports of treatment-related adverse events and symptom exacerbations. For multiple non-AVH measurements per study and outcome, the more primary measurement was included into the quantitative analysis (per study definition or earlier reference as an outcome). Follow-up measurements were categorised as short (12–23 weeks), medium (24–51 weeks) and long term (≥52 weeks) as a pragmatic approach reflecting common follow-up durations selected in meta-analyses of psychological intervention trials in schizophrenia spectrum disorders.^{Reference Guaiana, Abbatecola, Aali, Tarantino, Ebuenyi and Lucarini41,Reference Mayer, Corcoran, Kennedy, Leucht and Bighelli42}

We examined both treatment and study drop-out. Treatment drop-out was identified by any participant who did not complete the treatment post-randomisation.^{Reference Cooper and Conklin43} The criteria for having finished a treatment followed that of each study. In contrast, study drop-out was characterised by participants lost to post-treatment assessments regardless of reason.

Data extraction, synthesis and effect sizes

Data was independently extracted by F.O. and S.H., and cross-checked by D.A. The data extraction started on 2 April 2025. Meta-analysis was performed when at least two effect sizes could be pooled per analysis. Although AVATAR therapy approaches are similar, we anticipated differences in control groups, leading us to the use of random-effects models, which allow for heterogeneous treatment effects.^{Reference Riley, Higgins and Deeks44}

The metric of Hedges’ g was used to report the standardised mean difference for continuous outcomes and aggregate outcomes of overlapping constructs using different scales.^{Reference Hedges45} Negative effect sizes described outcomes with a lower mean score in the AVATAR groups relative to the control groups, with positive values indicating the opposite. Values of 0.2, 0.5 and 0.8 were considered the thresholds for low, medium and large effect sizes, respectively.^{Reference Cohen46} Proportion and risk ratio were calculated for within- and between-group drop-out effect sizes. Drop-out proportions were calculated with generalised linear-mixed models.^{Reference Lin and Chu47} Risk ratios >1 portrayed an increased risk in AVATAR relative to control groups. To adjust for zero-case studies, 0.5 cases of drop-out were added to each study’s risk table containing at least one instance of zero drop-outs in either group.^{Reference Weber, Knapp, Ickstadt, Kundt and Glass48} For three-arm studies, the sample sizes of the control groups were halved to include both AVATAR groups, per the suggestions in the Cochrane Handbook.^{Reference Higgins, Eldridge, Li, Higgins, Thomas, Chandler, Cumpston, Li and Page49}

The presence of heterogeneity was tested with Cochran’s Q-test and interpreted as Higgins’ I ² -statistic. ^{Reference Cochran50,Reference Higgins51} Percentages of 25%, 50% and 75% were considered low, moderate and high levels of heterogeneity, respectively. All analyses were performed in R version 2025.05.1+513 in RStudio for macOS, using the metafor and Tidyverse packages, and were visualised with forest plots.^{52–Reference Wickham, Averick and Bryan54} To exploratively assess the robustness of effects on AVH symptoms, subgroup analyses of studies with low and high risks of bias were performed. Additionally, for AVH symptoms, analyses by control group (treatment as usual versus treatment as usual plus waitlist versus active control group) were performed with subgroup analyses and mixed-effects meta-regression. Results with significant heterogeneity were examined with jackknife analyses.^{Reference Viechtbauer and Cheung55} An α-level of 0.05 was considered for all analyses.

Bias assessments, publication bias and quality of evidence

F.O. and S.H. independently assessed risk of bias with the Revised Cochrane Risk-of-Bias Tool for Randomized Trials (RoB-2^{Reference Sterne, Savović, Page, Elbers, Blencowe and Boutron56}), and differences were discussed with a third reviewer, K.B. Bias was rated under consideration of the primary outcome of AVH severity. The risk of bias was visualised using the robvis app.^{Reference McGuinness and Higgins57} Publication bias may occur when articles remain unpublished because of unwanted study results.^{Reference Dickersin and Min58} For main analyses, funnel plots were plotted for its assessment and asymmetry was tested with Egger’s tests.^{Reference Egger, Davey Smith, Schneider and Minder59} Asymmetric analyses were eligible to be corrected with the trim-and-fill procedure.^{Reference Duval and Tweedie60} F.O. characterised the certainty of meta-analytic evidence according to the Grading of Recommendations, Assessment, Development and Evaluation (GRADE) criteria, using GRADEpro online software.^{Reference Guyatt, Oxman and Vist61,62}

Results

Study selection

The literature search identified 556 records (see the PRISMA flow diagram in Supplementary Fig. 1). After duplicate removal and dual screenings, a total of seven peer-reviewed articles and one letter to the editor were included. We contacted authors of trial registrations and conference abstracts, although no ongoing trials were able to provide includable information concerning relevant outcomes for this study. Abstract and full-text screening exclusion reasons can be viewed in Supplementary Tables 4 and 5. The original AVATAR trial included a small number of participants under 16 years of age,^{Reference Leff, Williams, Huckvale, Arbuthnot and Leff25} formally conflicting with our inclusion criteria. Given this small proportion and comparable mean age and standard deviation to other included samples, the study was retained. The title and abstract as well as full-text screening interrater reliabilities revealed Cohen’s κ = 0.95 and 0.93, respectively, which can be considered satisfactory.^{Reference McHugh63}

Study and sample characteristics

The eight randomised and controlled studies included nine relevant comparisons (Garety et al^{Reference Garety, Edwards and Jafari64} included two AVATAR groups) and n = 554 participants in the AVATAR therapy groups and n = 424 in the control groups. Of note is that three studies were partial crossover trials,^{Reference Leff, Williams, Huckvale, Arbuthnot and Leff25,Reference Percie du Sert, Potvin, Lipp, Dellazizzo, Laurelli and Breton65,Reference Stefaniak, Sorokosz, Janicki and Wciórka66} with only the period before crossover included in the analyses. The post-treatment measurement varied from 7 to 16 weeks (mean 9.88 weeks). Each study that performed a follow-up assessment did so at 12–13-week follow-up, and one study additionally assessed at 24- and 52-week follow-ups.^{Reference Dellazizzo, Potvin, Phraxayavong and Dumais67} All participants received treatment as usual, typically including antipsychotic medication. Control groups consisted of treatment as usual alone,^{Reference Garety, Edwards and Jafari64} treatment as usual plus waitlist,^{Reference Leff, Williams, Huckvale, Arbuthnot and Leff25,Reference Percie du Sert, Potvin, Lipp, Dellazizzo, Laurelli and Breton65,Reference Stefaniak, Sorokosz, Janicki and Wciórka66} or treatment as usual with an additional active control condition (CBT,^{Reference Dellazizzo, Potvin, Phraxayavong and Dumais67,Reference Liang, Li and Guo68} supportive counselling,^{Reference Craig, Rus-Calafell, Ward, Leff, Huckvale and Howarth26} enhanced treatment as usual with supportive counselling sessions^{Reference Smith, Vernal, Mariegaard, Christensen, Jansen and Schytte69}). The studies with an additional active control condition were categorised as active control groups in the subgroup analysis. The range of treatment durations was 6 to 9 weeks or sessions except for AVATAR-EXT,^{Reference Garety, Edwards and Jafari64} an extension of AVATAR in 12 weekly sessions. Samples consisted of persons with schizophrenia spectrum disorders reporting treatment-resistant, chronic or persisting AVHs, although three studies included small subsamples of other mental disorders.^{Reference Craig, Rus-Calafell, Ward, Leff, Huckvale and Howarth26,Reference Garety, Edwards and Jafari64,Reference Stefaniak, Sorokosz, Janicki and Wciórka66} Demographic and clinical outcomes, as well as specific study characteristics, can be found in Table 1. Additional information (protocols, criteria, settings, diagnosis distributions, participant ages and funding) is provided in Supplementary Table 6, whereas Supplementary Table 7 describes additional information received from study authors.

Table 1 Study characteristics

N, sample size; T0, baseline; AVATAR, Audio Visual Assisted Therapy Aid for Refractory Auditory Hallucinations; TAU, treatment as usual; AVH, auditory verbal hallucinations; T1, post-treatment; T2, follow-up; RCT, randomised controlled trial; RPCT, randomised partial crossover trial.

Sample sizes given at baseline.

AVATAR therapy protocols

The treatment protocols showed differences. Treatment began with the creation of a digital avatar, except in Stefaniak et al,^{Reference Stefaniak, Sorokosz, Janicki and Wciórka66} which used a standardised avatar. Although some protocols enabled viewing the avatar via a virtual reality headset,^{Reference Percie du Sert, Potvin, Lipp, Dellazizzo, Laurelli and Breton65,Reference Dellazizzo, Potvin, Phraxayavong and Dumais67–Reference Smith, Vernal, Mariegaard, Christensen, Jansen and Schytte69} others presented the avatar on a computer screen,^{Reference Leff, Williams, Huckvale, Arbuthnot and Leff25,Reference Craig, Rus-Calafell, Ward, Leff, Huckvale and Howarth26,Reference Garety, Edwards and Jafari64,Reference Stefaniak, Sorokosz, Janicki and Wciórka66} and one used full-face and body motion capture of the therapist to increase the feeling of presence and immersion.^{Reference Liang, Li and Guo68} The participants usually interacted with an avatar voiced by the therapist in real time, although in Stefaniak et al,^{Reference Stefaniak, Sorokosz, Janicki and Wciórka66} a pre-programmed avatar was used, which the therapist and participant both interacted with.

Post-creation sessions included a therapeutic session preparation phase, followed by exposure to distressing utterances of the avatar, during which the participant was encouraged respond assertively.^{Reference Craig, Rus-Calafell, Ward, Leff, Huckvale and Howarth26} As the treatment progressed, the avatar’s verbalisations gradually became less abusive and more supportive to reflect the participants’ increase in control. Later sessions were designated for future outlook and relapse prevention. As an extension of this framework, AVATAR-EXT provided six additional sessions, where participants discussed the trauma, marginalisation and social exclusion background of their AVHs with the therapist before avatar exposure.^{Reference Garety, Edwards and Jafari64} Half of the studies provided recordings of the sessions, which could be listened to as homework to increase transferral to daily life.^{Reference Leff, Williams, Huckvale, Arbuthnot and Leff25,Reference Craig, Rus-Calafell, Ward, Leff, Huckvale and Howarth26,Reference Garety, Edwards and Jafari64,Reference Liang, Li and Guo68}

Risk of bias

The risk-of-bias assessments are displayed in Fig. 1, and the results of the bias assessment questions can be found in Supplementary Table 8. Overall, two studies were rated with low risk of bias,^{Reference Garety, Edwards and Jafari64,Reference Smith, Vernal, Mariegaard, Christensen, Jansen and Schytte69} and the remaining six studies high overall risk of bias.^{Reference Leff, Williams, Huckvale, Arbuthnot and Leff25,Reference Craig, Rus-Calafell, Ward, Leff, Huckvale and Howarth26,Reference Percie du Sert, Potvin, Lipp, Dellazizzo, Laurelli and Breton65–Reference Liang, Li and Guo68} Importantly, the two studies rated as having low risk of bias accounted for the majority (63%) of participants.

Fig. 1 Risk-of-bias assessments. Ext., extended; Brf., brief.

Meta-analysis and systematic review results

Although control groups varied across studies, a main analysis of all studies was performed. Treatments for schizophrenia spectrum disorders differ depending on the setting, availability and severity of symptoms.^{Reference Burgess-Barr, Nicholas, Venus, Singh, Nethercott and Taylor70–Reference McDonagh, Dana, Selph, Devine, Cantor and Bougatsos72} Consequently, this approach provided greater external validity, but may have induced heterogeneity. Subgroup analyses by specific control group type were conducted to complement this approach. Forest plots of all secondary outcomes can be found in Supplementary Figs 2–6, the assessments of the certainty of evidence according to GRADE criteria can be found in Supplementary Table 9 and information on which scales were included can be found in Supplementary Table 10.

Primary and secondary AVH outcomes: AVH severity, frequency and distress

Table 2 presents the meta-analytic results for voice-related outcomes alongside the corresponding certainty of evidence, and Table 3 shows subgroup and moderation analyses by subgroup and bias. Each study assessed AVH severity, frequency and distress at pre- and post-treatment. Short-term 12- to 13-week follow-up assessments were performed in all but three studies.^{Reference Leff, Williams, Huckvale, Arbuthnot and Leff25,Reference Percie du Sert, Potvin, Lipp, Dellazizzo, Laurelli and Breton65,Reference Stefaniak, Sorokosz, Janicki and Wciórka66} The pooled effect size on AVH severity at post-treatment was small, homogeneous and favoured AVATAR (Hedges’ g = −0.40, 95% CI −0.54 to −0.25, p < 0.001). This effect was moderated by control group (p = 0.01), although each subgroup favoured AVATAR. At short-term follow-up, the effect size remained small (Hedges’ g = −0.25, 95% CI −0.40 to −0.10, p < 0.001) and homogeneous. This effect was not moderated by control group (p = 0.82), but only the active control subgroup significantly favoured AVATAR. Medium- and long-term effect sizes were derived from only one study, and were small and non-significant. Studies rated with a low risk of bias revealed similar results at both post-treatment and short-term follow-up. Forest plots for AVH severity are displayed in Fig. 2. The certainty of evidence was rated moderate for post-treatment and low for the short-term-follow-up, whereas medium- and long-term follow-ups were rated very low.

Table 2 Meta-analytic results for auditory verbal hallucination-related outcomes

N, sample size; I ², heterogeneity statistic; GRADE, Grading of Recommendations, Assessment, Development and Evaluation; AVH, auditory verbal hallucinations; T1, post-treatment; T2, follow-up at 12 weeks; T3, follow-up at 24 weeks; T4, follow-up at 52 weeks.

Negative effect sizes described lower scores in intervention groups.

a. GRADE ratings⁷³: High: a lot of confidence that the true effect lies close to that of the estimated effect. Moderate: moderate confidence in the estimated effect. The true effect is likely to be close to the estimated effect, but there is a possibility that it is substantially different. Low: limited confidence in the estimated effect. The true effect might be substantially different from the estimated effect. Very low: very little confidence in the estimated effect. The true effect is likely to be substantially different from the estimated effect.

*p < 0.05; **p < 0.01; ***p < 0.001.

Table 3 Meta-analytic subgroup and moderator results for auditory verbal hallucination-related outcomes

N , sample size; I ², heterogeneity statistic; AVH, auditory verbal hallucinations; T1, post-treatment; TAU, treatment as usual; T2, follow-up at 12–23 weeks; T3, follow-up at 24–51 weeks; T4, follow-up at 52 weeks.

Negative effect sizes described lower scores in intervention groups.

*p < 0.05; **p < 0.01; ***p < 0.001.

Fig. 2 Forest plot showing meta-analytic results for AVH severity at post-treatment and follow-up. Negative SMDs portray smaller means in AVATAR compared with control groups. ⊕, Low bias; ⊖, high bias; AVATAR, Audio Visual Assisted Therapy Aid for Refractory Auditory Hallucinations; AVH, auditory verbal hallucination; Brf., brief; CBT, cognitive–behavioural therapy; Ext., extended; I ², Higgins’ heterogeneity statistic; k, number of comparisons; n, sample size; Q, Cochran’s Q-statistic; Qp, Cochran’s Q p-value; SMD, standardised mean difference in Hedges’ g; TAU, treatment as usual.^{Reference Stefaniak, Sorokosz, Janicki and Wciórka66}

For AVH frequency, the pooled effect size at post-treatment was small, favoured AVATAR (Hedges’ g = −0.38, 95% CI −0.52 to −0.24, p < 0.001) and was homogeneous. Of the control group subgroup analyses, all but the AVATAR versus the treatment as usual plus waitlist group favoured AVATAR. The effect size at short-term follow-up remained small (Hedges’ g = −0.34, 95% CI −0.49 to −0.19, p < 0.001), significantly favoured AVATAR across subgroups and was homogeneous. Medium- and long-term follow-up effect sizes were derived from one study and were small and non-significant. Studies rated with a low risk of bias revealed similar results at both time points. The certainty of evidence was rated high for post-treatment and low for short term follow-up, whereas medium- and long-term follow-ups were rated very low.

For AVH distress, the pooled effect size at post-treatment was medium, significant (Hedges’ g = −0.32, 95% CI −0.46 to –0.18, p < 0.001) and homogeneous. This effect was moderated by control group (p = 0.02), but each of the subgroups significantly favoured AVATAR. The pooled small effect size at short-term follow-up favoured AVATAR (Hedges’ g = −0.20, 95% CI −0.35 to −0.06, p = 0.007) and was homogeneous. Only the active control subgroup favoured AVATAR at short-term follow-up. Medium- and long-term follow-up effect sizes were derived from one study and were negligible to small and non-significant. Studies rated with a low risk of bias revealed similar results at post-treatment, but were non-significant at short-term follow-up. The certainty of evidence was rated low for post-treatment and very low for short-term follow-up, whereas medium- and long-term follow-ups were rated very low.

Secondary outcomes: clinical and functional outcomes

Table 4 presents the results of meta-analyses performed for secondary clinical outcomes. PSYRATS subscales measured by one study^{Reference Garety, Edwards and Jafari64} were aggregated according to the recommendations of Borenstein,^{Reference Borenstein, Hedges, Higgins and Rothstein74} assuming a correlation of 0.34 between delusions and hallucinations.^{Reference Smith, Mar and Turoff75} For positive symptoms, small and negligible effect sizes significantly favoured AVATAR at post-treatment and short-term follow-up, respectively. In contrast, effect sizes for total psychotic and negative symptoms did not significantly favour either group at either time point. Small effect sizes significantly favoured AVATAR for anxiety and depressive symptoms at post-treatment, but remained significant only for anxiety symptoms at short-term follow-up. No significant between-group effects were observed for quality of life at either time point. Social functioning was measured by only one study,^{Reference Smith, Vernal, Mariegaard, Christensen, Jansen and Schytte69} and was therefore not meta-analytically aggregated. The removal of a singular outlier^{Reference Dellazizzo, Potvin, Phraxayavong and Dumais67} at post-treatment, specifically for quality of life, reduced heterogeneity without altering the conclusion of the statistical tests. Medium- and long-term follow-up assessments were derived from one study and were non-significant for each outcome.

Table 4 Meta-analytic results for clinical and functional outcomes

I ² , heterogeneity statistic; T1, post-treatment; T2, follow-up at 12 weeks; T3, follow-up at 24 weeks; T4, follow-up at 52 weeks; AVATAR, Audio Visual Assisted Therapy Aid for Refractory Auditory Hallucinations.

Negative effect sizes described lower means in AVATAR groups.

a. Decimals are because of aggregation of subscales.

*p < 0.05; **p < 0.01; ***p < 0.001.

Secondary outcomes: tolerability and acceptability

Each study reported tolerability aspects concerning treatment-related adverse events or exacerbations, except for Leff et al.^{Reference Leff, Williams, Huckvale, Arbuthnot and Leff25} Adverse events and exacerbations were non-standardised, making narrative review necessary. In Percie du Sert et al,^{Reference Percie du Sert, Potvin, Lipp, Dellazizzo, Laurelli and Breton65} 1 participant out of 12 received additional counselling during early treatment, because of transient symptom exacerbations. Additionally, each of the four participants who dropped out did so because of anxiety symptom exacerbations in early virtual reality sessions. Three studies stated that any adverse events were not attributable to AVATAR or the control group.^{Reference Craig, Rus-Calafell, Ward, Leff, Huckvale and Howarth26,Reference Dellazizzo, Potvin, Phraxayavong and Dumais67,Reference Liang, Li and Guo68} In Garety et al,^{Reference Garety, Edwards and Jafari64} 1 case of hospital admission in AVATAR (out of 116) and 4 cases in AVATAR-EXT (out of 114) could not be ruled out from possibly being attributable to the treatments. In Smith et al,^{Reference Smith, Vernal, Mariegaard, Christensen, Jansen and Schytte69} of 140 participants, five cases of hospital admission and one case of self-harm were considered potentially related to the AVATAR treatment because of worsening of AVHs. AVH symptom increases occurred in 52 participants during early exposure, which gradually declined over the course of the therapy. Furthermore, 40 participants reported needing additional time to manage anxiety during virtual reality immersion. Finally, Stefaniak et al^{Reference Stefaniak, Sorokosz, Janicki and Wciórka66} reported one case of hospital admission among 14 in the AVATAR group, although causal connections to the treatment were not reported.

Each study reported on treatment drop-outs in AVATAR and five reported rates in control groups. As can be seen in Table 5, the overall aggregated proportion of treatment drop-out in AVATAR was 24% with moderate heterogeneity. In control groups, the aggregated proportion of treatment drop-out was 18%, with moderate heterogeneity. No single outlier was responsible for the observed heterogeneity. In terms of study drop-out, both intervention and control groups showed aggregated proportions of 16%. Comparative analyses showed no significant difference in risk between AVATAR and control groups for both treatment and study drop-out (risk ratio of 1.01 in both cases).

Table 5 Meta-analytic drop-out results

N, sample size; I ² , heterogeneity statistic. Negative effect sizes described lower scores in intervention groups whereas risk ratios above 1 describe an increased risk in intervention groups.

**p ≤ 0.01; ***p ≤ 0.001.

Publication bias

The Egger’s test for AVH severity at post-treatment was significant (p = 0.009), indicating an asymmetric funnel plot.^{Reference Egger, Davey Smith, Schneider and Minder59} Examination of the corresponding funnel plot revealed the notable impact of the large effect size in Stefaniak et al.^{Reference Stefaniak, Sorokosz, Janicki and Wciórka66} Trim-and-fill procedures revealed a corrected small effect size favouring AVATAR (Hedges’ g _corrected = −0.35, 95% CI −0.49 to −0.22, p < 0.001), with two studies estimated to be missing. The Egger’s test of AVH distress (p = 0.02) at post-treatment was significant. Examination of the corresponding funnel plot revealed the impact of the large effect sizes in Stefaniak et al^{Reference Stefaniak, Sorokosz, Janicki and Wciórka66} and Percie du Sert et al.^{Reference Percie du Sert, Potvin, Lipp, Dellazizzo, Laurelli and Breton65} Trim-and-fill procedures revealed a corrected small effect size favouring AVATAR (Hedges’ g _corrected = −0.28, 95% CI −0.41 to −0.14, p < 0.001), with two studies estimated to be missing. Funnel plots and Egger’s tests statistics can be found in Supplementary Figs 7–10 and Supplementary Table 11, respectively.

Discussion

The present study employed a systematic review and meta-analysis framework to investigate the efficacy, tolerability and acceptability of AVATAR for AVHs. Analyses showed preliminary evidence of the efficacy of AVATAR in decreasing AVH severity at post-treatment and short-term follow-up. This was robust for the effect of bias, and after correction for potential publication bias, although the effect was moderated by control group. Notably, the analyses were performed in persons with treatment-resistant and persistent AVHs, suggesting that AVATAR may address a critical treatment gap for previously refractory symptoms. The effect size observed (Hedges’ g = –0.40) was comparable to that of guideline-recommended psychological interventions, such as CBT (Hedges’ g = –0.34); however, this was in populations not specifically characterised as treatment-resistant.^{Reference Turner, Burger, Smit, Valmaggia and Gaag21} This illustrates the compelling potential of a theory-based intervention that harnesses digital technologies in a novel and effective way to treat AVHs. Medium- and long-term follow-ups (non-significant small effect sizes) were measured by only one study, limiting interpretability and certainty of evidence.

Analyses of effects on AVH frequency and distress showed significant small effects at post-treatment. They remained small and significant at short-term follow-up, and robust for the effect of bias (exception: distress at short-term follow-up) and potential publication bias. From a theoretical standpoint, the effects may align with potential mechanisms underlying reductions in AVH severity. The exposure to distressing avatar utterances and subsequent assertive strategies seem to have reduced both the frequency and distress of AVHs. Qualitative evidence has shown that although those affected by AVHs prioritise a decrease in the frequency of AVHs (i.e. total cessation), a decrease in AVH distress and disruption is more prioritised by service providers.^{Reference Longden, Branitsky, Sheaves, Chauhan and Morrison76} These findings indicate that AVATAR may support shared goals between patients and caregivers, with the potential for mutually beneficial outcomes. Of note is that the duration of AVATAR generally consisted of fewer sessions (7–12) than a typical minimal dose of 16 sessions of a psychological intervention such as CBT,^{Reference Kuipers, Yesufu-Udechuku, Taylor and Kendall9} emphasising the efficiency of AVATAR. Nevertheless, AVATAR is comparatively resource-intensive and may be demanding to implement at scale, because of the requirements of software, hardware and training.^{Reference Garety, Edwards and Jafari64}

Some concerns of publication bias were present in the AVH symptom analyses at post-treatment for AVH severity and distress. Funnel plots revealed the strong influence two small outlier studies with large effect sizes.^{Reference Percie du Sert, Potvin, Lipp, Dellazizzo, Laurelli and Breton65,Reference Stefaniak, Sorokosz, Janicki and Wciórka66} This may point less toward the potential for missing studies and may instead reflect true heterogeneity of small studies. These are often able to direct more resources into treatment intensity, but are also more likely to be afflicted by methodological effects of bias.^{Reference Sterne, Sutton, Ioannidis, Terrin, Jones and Lau77} Furthermore, the outlier studies had waitlist control groups, which a recent meta-analysis has shown to have the smallest within-group symptom reductions in treatment-resistant schizophrenia,^{Reference Schütz, Salahuddin, Priller, Bighelli and Leucht78} underlining the potential for inflated effects in these designs.

AVATAR therapy was generally well tolerated. Treatment-related adverse events and lasting symptom exacerbations were not common. However, two studies reported instances of anxiety occurring during early exposure to avatars,^{Reference Percie du Sert, Potvin, Lipp, Dellazizzo, Laurelli and Breton65,Reference Smith, Vernal, Mariegaard, Christensen, Jansen and Schytte69} and a few cases requiring hospital admission attributable to treatment-related AVH symptom exacerbations.^{Reference Smith, Vernal, Mariegaard, Christensen, Jansen and Schytte69} As with other exposure-based therapies, initial increases in anxiety are not unusual and typically diminish as the treatment progresses.^{Reference Heinig, Knappe and Hoyer79} Consistent with this, two included studies found that within-session anxiety significantly decreased over the course of the treatment.^{Reference Craig, Rus-Calafell, Ward, Leff, Huckvale and Howarth26,Reference Rus-Calafell, Ward, Zhang, Edwards, Garety and Craig27,Reference Percie du Sert, Potvin, Lipp, Dellazizzo, Laurelli and Breton65} Additionally, small effect sizes favoured AVATAR in the reduction of anxiety symptoms at post-treatment and short-term follow-up. This further suggests that anxiety exacerbation is likely transient for the majority of participants. Studies also employed techniques to mitigate acute distress, such as offering a panic button or showing images of a calming beach.^{Reference Leff, Williams, Huckvale, Arbuthnot and Leff25,Reference Smith, Vernal, Mariegaard, Christensen, Jansen and Schytte69} However, expanding upon these techniques to avoid overwhelming participants may be beneficial in AVATAR to minimise adverse events and attrition.

Treatment non-completion was observed in approximately a quarter of participants receiving AVATAR, which did not differ significantly in drop-out risk to control groups, supporting the general acceptability of the intervention. Notably, other exposure-based interventions report similar treatment drop-out rates (e.g. 28% for prolonged exposure in post-traumatic stress disorder^{Reference Varker, Jones, Arjmand, Hinton, Hiles and Freijah80}). The aggregated study drop-out rate of 16% across intervention and control conditions closely aligns with drop-out rates of CBT in schizophrenia spectrum disorders (14%^{Reference Cuijpers, Harrer, Miguel, Ciharova, Papola and Basic81}). This suggests that AVATAR is comparable to current recommended evidence-based treatments in terms of retention.

Positive symptoms improved compared to control groups, which is likely associated with the improvement in AVH symptoms. Effect sizes were comparable to those reported for CBT in treatment-resistant schizophrenia compared with TAU and supportive counselling (Hedges’ g = –0.31 and –0.19, respectively^{Reference Salahuddin, Schütz and Pitschel-Walz82}). Total psychotic and negative symptoms did not show significant improvement, which may reflect the more specific focus on AVHs. In contrast, the small effect on anxiety symptoms at post-treatment and short-term follow-up makes the anxiolytic effect of exposure to distressing AVHs apparent. Significant results were not observed for social functioning and quality of life. Contrary to common impairment in schizophrenia spectrum disorders,^{Reference Laws, Darlington, Kondel, McKenna and Jauhar83} functional outcomes were measured in few of the included studies, reflecting a common oversight in schizophrenia spectrum disorder research.^{Reference Bighelli, Wallis, Reitmeir, Schwermann, Salahuddin and Leucht84} Future trials should aim to consistently include functional outcome measures.

Limitations

The results of this study should be considered alongside several limitations. As a meta-analysis of under ten studies, robustness of results is not assured, requiring additional high-quality, randomised controlled trials. Likewise, publication bias tests and meta-regression tests may have been underpowered, potentially missing significant effects.^{Reference Sterne, Sutton, Ioannidis, Terrin, Jones and Lau77,Reference Deeks, Higgins, Altman, Higgins, Thomas, Chandler, Cumpston, Li and Page85} Another limiting factor concerns the length of measurement: only one study measured follow-ups later than 3 months post-intervention. Future studies are encouraged to perform longer follow-ups to assess the retention of effects. Although statistical heterogeneity was non-significant in the majority of analyses and often controlled by the removal of outliers, differences in treatment and follow-up durations, inclusion criteria, study designs, outcome measures and immersion were observed, potentially leading to clinical heterogeneity not assessed in these analyses. Future reviews should plan further subgroup and meta-regression analyses. Similarly, the question remains as to the comparative efficacy and tolerability of immersive virtual reality-based versus less immersive screen-based approaches, which could not be answered in this review and may be assessed with future direct comparison trials. Many included samples were small, had heterogeneous outcomes and were monocentric, with one study enrolling only 19 participants,^{Reference Percie du Sert, Potvin, Lipp, Dellazizzo, Laurelli and Breton65} potentially introducing bias into effect sizes.^{Reference Lin86} Future trials should be designed as large, multicentric trials with standardised outcome measures to allow for greater comparability between studies. Finally, trials were overwhelmingly conducted in Western, Educated, Industrial, Rich and Democratic (WEIRD) countries,^{Reference Muthukrishna, Bell and Henrich87} limiting the generalisability of results for all populations reporting AVHs, especially considering the strong cultural aspects apparent in AVHs.^{Reference Khaled, Brederoo, Yehya, Alabdulla, Woodruff and Sommer88,Reference Larøi, Luhrmann, Bell, Christian, Deshpande and Fernyhough89}

In conclusion, AVATAR therapy showed efficacious and robust results for our primary outcome of the severity of AVH at post treatment, with moderate certainty of evidence. Effects were maintained into short-term follow-up, and AVH dimensions of both frequency and distress showed similar results. Efficacy profiles for other clinical and functional outcomes were mixed. These findings support the efficacy of AVATAR as a focused treatment for AVHs, although heterogeneity was apparent, and additional medium- and long-term evidence is required to assess the retention of effects.

Supplementary material

The supplementary material is available online at https://doi.org/10.1192/bjo.2026.11014

Data availability

Additional material is available in the online Supplementary Material. Data and scripts can be supplied upon reasonable request.

Acknowledgements

We would like to thank Professor Alex Leff for providing support and additional information on the original AVATAR trial. Additionally, we thank Dr Daniel Schulze and Dr Lars Schulze for answering early statistical questions.

Author contributions

F.O. contributed to study conceptualisation, data curation, formal analysis, methodology, project administration, visualisation and software, and wrote the original draft of the manuscript. S.H. contributed to study conceptualisation, validation and data curation, wrote the original draft of the manuscript and reviewed and edited the manuscript. P.W., L.F., M.S., N.T., C.-S.L. and B.S. reviewed and edited the manuscript. D.A. contributed to validation and reviewed and edited the manuscript. O.P.d.S., I.S. and L.B.G. contributed to data curation and reviewed and edited the manuscript. K.B. contributed to study conceptualisation, methodology, project administration, resources and supervision, and reviewed and edited the manuscript.

Funding

This research received no specific grant from any funding agency, commercial or not-for-profit sectors.

Declaration of interest

L.B.G., O.P.d.S., N.T. and I.S. were directly involved with included AVATAR trials. These authors played purely advisory roles and had no influence over methodology, or rating of bias and certainty of evidence. B.S. is supported by a National Institute for Health and Care Research Advanced Fellowship (grant number NIHR301206). B.S. is on the editorial board of Nature Exercise Science and Health, The Journal of Physical Activity and Health, Ageing Research Reviews, Mental Health and Physical Activity, The Journal of Evidence Based Medicine and The Brazilian Journal of Psychiatry. B.S. has received honorarium from a co-edited book on exercise and mental illness (Elsevier), an education course and unrelated advisory work from ASICS and FitXR Ltd. L.B.G. has received honoraria from Heka-VR for clinician training in the Challenge-virtual reality-assisted therapy intervention in Denmark and internationally, is engaged in other research collaborations with Heka-VR, and has received research funding for related virtual reality-based studies. N.T. has received funding from the Australian National Health and Medical Research Council for research into AVATAR therapy. K.B. is a co-founder of the two healthcare start-ups Kiso GmbH and Mental Hub. He has also received consultancy fees and/or honoraria for lectures and/or educational materials from Boehringer Ingelheim and Angelini, as well as from publishers and training institutes for workshops, books and lectures on psychotherapy. F.O., S.H., P.W., D.A., L.F., M.S. and C.-S.L. report no competing interests.

References

Beck, AT, Rector, NA. A cognitive model of hallucinations. Cogn Ther Res 2003; 27: 19–52.10.1023/A:1022534613005CrossRef Google Scholar

Maijer, K, Begemann, MJH, Palmen, SJMC, Leucht, S, Sommer, IEC. Auditory hallucinations across the lifespan: a systematic review and meta-analysis. Psychol Med 2018; 48: 879–88.10.1017/S0033291717002367CrossRef Google Scholar PubMed

McCarthy-Jones, S, Trauer, T, Mackinnon, A, Sims, E, Thomas, N, Copolov, DL. A new phenomenological survey of auditory hallucinations: evidence for subtypes and implications for theory and practice. Schizophr Bull 2014; 40: 231–5.10.1093/schbul/sbs156CrossRef Google Scholar PubMed

Woods, A, Jones, N, Alderson-Day, B, Callard, F, Fernyhough, C. Experiences of hearing voices: analysis of a novel phenomenological survey. Lancet Psychiatry 2015; 2: 323–31.10.1016/S2215-0366(15)00006-1CrossRef Google Scholar PubMed

Larøi, F, Bless, JJ, Laloyaux, J, Kråkvik, B, Vedul-Kjelsås, E, Kalhovde, AM, et al. An epidemiological study on the prevalence of hallucinations in a general-population sample: effects of age and sensory modality. Psychiatry Res 2019; 272: 707–14.10.1016/j.psychres.2019.01.003CrossRef Google Scholar

Toh, WL, Thomas, N, Hollander, Y, Rossell, SL. On the phenomenology of auditory verbal hallucinations in affective and non-affective psychosis. Psychiatry Res 2020; 290: 113147.10.1016/j.psychres.2020.113147CrossRef Google Scholar PubMed

Hasan, A, Falkai, P, Lehmann, I. Die aktualisierte S3-Leitlinie Schizophrenie: Entwicklungsprozess und ausgewählte Empfehlungen [Revised S3 guidelines on schizophrenia: developmental process and selected recommendations]. Nervenarzt 2020; 91: 26–33.10.1007/s00115-019-00813-yCrossRef Google Scholar

Keepers, GA, Fochtmann, LJ, Anzia, JM. The American Psychiatric Association Practice Guideline for the treatment of patients with schizophrenia. Am J Psychiatry 2020; 177: 868–72.10.1176/appi.ajp.2020.177901CrossRef Google Scholar PubMed

Kuipers, E, Yesufu-Udechuku, A, Taylor, C, Kendall, T. Management of psychosis and schizophrenia in adults: summary of updated NICE guidance. BMJ 2014; 348: g2234.10.1136/bmj.g1173CrossRef Google Scholar PubMed

Samara, MT, Nikolakopoulou, A, Salanti, G, Leucht, S. How many patients with schizophrenia do not respond to antipsychotic drugs in the short term? An analysis based on individual patient data from randomized controlled trials. Schizophr Bull 2019; 45: 639–46.10.1093/schbul/sby095CrossRef Google Scholar

Diniz, E, Fonseca, L, Rocha, D. Treatment resistance in schizophrenia: a meta-analysis of prevalence and correlates. Braz J Psychiatry 2023; 45: 448–58.Google Scholar PubMed

Mørup, MF, Kymes, SM, Oudin Åström, D. A modelling approach to estimate the prevalence of treatment-resistant schizophrenia in the United States. PLOS One 2020; 15: e0234121.10.1371/journal.pone.0234121CrossRef Google Scholar PubMed

Siskind, D, Orr, S, Sinha, S. Rates of treatment-resistant schizophrenia from first-episode cohorts: systematic review and meta-analysis. Br J Psychiatry 2022; 220: 115–20.10.1192/bjp.2021.61CrossRef Google Scholar PubMed

Bogers, JPAM, Hambarian, G, Walburgh Schmidt, N, Vermeulen, JM, Haan, L. Risk factors for psychotic relapse after dose reduction or discontinuation of antipsychotics in patients with chronic schizophrenia. A meta-analysis of randomized controlled trials. Schizophr Bull 2023; 49: 11–23.10.1093/schbul/sbac138CrossRef Google Scholar PubMed

Zipursky, RB, Menezes, NM, Streiner, DL. Risk of symptom recurrence with medication discontinuation in first-episode psychosis: a systematic review. Schizophr Res 2014; 152: 408–14.10.1016/j.schres.2013.08.001CrossRef Google Scholar PubMed

Leucht, S, Priller, J, Davis, JM. Antipsychotic drugs: a concise review of history, classification, indications, mechanism, efficacy, side effects, dosing, and clinical application. Am J Psychiatry 2024; 181: 865–78.10.1176/appi.ajp.20240738CrossRef Google Scholar PubMed

Goghari, VM, Harrow, M, Grossman, LS, Rosen, CA. 20-year multi-follow-up of hallucinations in schizophrenia, other psychotic, and mood disorders. Psychol Med 2013; 43: 1151–60.10.1017/S0033291712002206CrossRef Google Scholar PubMed

Heilbronner, U, Samara, M, Leucht, S, Falkai, P, Schulze, TG. The longitudinal course of schizophrenia across the lifespan: clinical, cognitive, and neurobiological aspects. Harv Rev Psychiatry 2016; 24: 118–28.10.1097/HRP.0000000000000092CrossRef Google Scholar PubMed

Nathou, C, Etard, O, Dollfus, S. Auditory verbal hallucinations in schizophrenia: current perspectives in brain stimulation treatments. Neuropsychiatr Treat 2019; 15: 2105–17.10.2147/NDT.S168801CrossRef Google Scholar PubMed

Bighelli, I, Salanti, G, Huhn, M, Schneider-Thoma, J, Krause, M, Reitmeir, C, et al. Psychological interventions to reduce positive symptoms in schizophrenia: systematic review and network meta-analysis. World Psychiatry 2018; 17: 316–29.10.1002/wps.20577CrossRef Google Scholar PubMed

Turner, DT, Burger, S, Smit, F, Valmaggia, LR, Gaag, M. What constitutes sufficient evidence for case formulation-driven CBT for psychosis? Cumulative meta-analysis of the effect on hallucinations and delusions. Eur Psychiatry 2020; 46: 1072–85.Google Scholar PubMed

Lincoln, TM, Jung, E, Wiesjahn, M, Schlier, B. What is the minimal dose of cognitive behavior therapy for psychosis? An approximation using repeated assessments over 45 sessions. Eur Psychiatry J Assoc Eur Psychiatry 2016; 38: 31–9.10.1016/j.eurpsy.2016.05.004CrossRef Google Scholar

Freeman, D. Persecutory delusions: a cognitive perspective on understanding and treatment. Lancet Psychiatry 2016; 3: 685–92.10.1016/S2215-0366(16)00066-3CrossRef Google Scholar PubMed

Smailes, D, Alderson-Day, B, Fernyhough, C, McCarthy-Jones, S, Dodgson, G. Tailoring cognitive behavioral therapy to subtypes of voice-hearing. Front Psychol 2015; 6: 1933.10.3389/fpsyg.2015.01933CrossRef Google Scholar PubMed

Leff, J, Williams, G, Huckvale, MA, Arbuthnot, M, Leff, AP. Computer-assisted therapy for medication-resistant auditory hallucinations: proof-of-concept study. Br J Psychiatry 2013; 202: 428–33.10.1192/bjp.bp.112.124883CrossRef Google Scholar PubMed

Craig, TKJ, Rus-Calafell, M, Ward, T, Leff, JP, Huckvale, M, Howarth, E, et al. AVATAR therapy for auditory verbal hallucinations in people with psychosis: a single-blind, randomised controlled trial. Lancet Psychiatry 2018; 5: 31–40.10.1016/S2215-0366(17)30427-3CrossRef Google Scholar PubMed

Rus-Calafell, M, Ward, T, Zhang, XC, Edwards, CJ, Garety, P, Craig, T. The role of sense of voice presence and anxiety reduction in AVATAR therapy. J Clin Med 2020; 9: 2748.10.3390/jcm9092748CrossRef Google Scholar PubMed

Rus-Calafell, M, Ehrbar, N, Ward, T, Edwards, C, Huckvale, M, Walke, J, et al. Participants’ experiences of AVATAR therapy for distressing voices: a thematic qualitative evaluation. BMC Psychiatry 2022; 22: 356.10.1186/s12888-022-04010-1CrossRef Google Scholar PubMed

Tsang, A, Bucci, S, Branitsky, A. The relationship between appraisals of voices (auditory verbal hallucinations) and distress in voice-hearers with schizophrenia-spectrum diagnoses: a meta-analytic review. Schizophr Res 2021; 230: 38–47.10.1016/j.schres.2021.02.013CrossRef Google Scholar PubMed

Spark, J, Pot-Kolder, R, Dzafic, I, Nelson, B, Byrne, LK, Lum, JAG. Virtual reality for the treatment of positive symptoms of psychosis: a meta-analysis of trials. Curr Treat Options Psychiatry 2025; 12: 10.10.1007/s40501-025-00350-3CrossRef Google Scholar

Zeka, F, Clemmensen, L, Valmaggia, L, Veling, W, Hjorthøj, C, Glenthøj, LB. The effectiveness of immersive virtual reality-based treatment for mental disorders: a systematic review with meta-analysis. Acta Psychiatr Scand 2025; 151: 210–30.10.1111/acps.13777CrossRef Google Scholar PubMed

Aali, G, Kariotis, T, Shokraneh, F. AVATAR therapy for people with schizophrenia or related disorders. Cochrane Database Syst Rev 2020; 5: CD011898.Google Scholar PubMed

Hsu, T-W, Tseng, P-T, Hsu, C-W, Yang, F-C, Changchien, T-C, Lin, Y-H, et al. AVATAR therapy for medication-resistant auditory hallucination in patients with psychosis: a systematic review and meta-analysis. Schizophrenia 2025; 12: 1.10.1038/s41537-025-00671-5CrossRef Google Scholar PubMed

Page, MJ, McKenzie, JE, Bossuyt, PM, Boutron, I, Hoffmann, TC, Mulrow, CD, et al. The PRISMA. 2020 statement: an updated guideline for reporting systematic reviews. BMJ 2021; 372: n71.10.1136/bmj.n71CrossRef Google Scholar PubMed

Bramer, WM, De Jonge, GB, Rethlefsen, ML, Mast, F, Kleijnen, J. A systematic approach to searching: an efficient and complete method to develop literature searches. J Med Libr Assoc 2018; 106: 531–41.10.5195/jmla.2018.283CrossRef Google Scholar PubMed

Ouzzani, M, Hammady, H, Fedorowicz, Z, Elmagarmid, A. Rayyan – a web and mobile app for systematic reviews. Syst Rev 2016; 5: 210.10.1186/s13643-016-0384-4CrossRef Google Scholar

Polanin, JR, Pigott, TD, Espelage, DL, Grotpeter, JK. Best practice guidelines for abstract screening large-evidence systematic reviews and meta-analyses. Res Synth Methods 2019; 10: 330.10.1002/jrsm.1354CrossRef Google Scholar

Cohen, J. A coefficient of agreement for nominal scales. Educ Psychol Meas 1960; 20: 37–46.10.1177/001316446002000104CrossRef Google Scholar

Maldonado, AD, Marzo, PF, Andrés, AM. DeltaMAN: Delta Measurement of Agreement for Nominal Data. CRAN R-Project, 2022 (https://cran.r-project.org/web/packages/DeltaMAN/index.html)10.32614/CRAN.package.DeltaMANCrossRef Google Scholar

Haddock, G, McCarron, J, Tarrier, N, Faragher, EB. Scales to measure dimensions of hallucinations and delusions: the psychotic symptom rating scales (PSYRATS). Psychol Med 1999; 29: 879–89.10.1017/S0033291799008661CrossRef Google Scholar PubMed

Guaiana, G, Abbatecola, M, Aali, G, Tarantino, F, Ebuenyi, ID, Lucarini, V, et al. Cognitive behavioural therapy (group) for schizophrenia. Cochrane Database Syst Rev 2022; 7: CD009608.Google Scholar

Mayer, SF, Corcoran, C, Kennedy, L, Leucht, S, Bighelli, I. Cognitive behavioural therapy added to standard care for first-episode and recent-onset psychosis. Cochrane Database Syst Rev 2024; 3: CD015331.Google Scholar PubMed

Cooper, AA, Conklin, LR. Dropout from individual psychotherapy for major depression: a meta-analysis of randomized clinical trials. Clin Psychol Rev 2015; 40: 57–65.10.1016/j.cpr.2015.05.001CrossRef Google Scholar PubMed

Riley, RD, Higgins, JPT, Deeks, JJ. Interpretation of random effects meta-analyses. BMJ 2011; 342: d549.10.1136/bmj.d549CrossRef Google Scholar PubMed

Hedges, LV. Distribution theory for glass’s estimator of effect size and related estimators. J Educ Stat 1981; 6: 107–28.10.3102/10769986006002107CrossRef Google Scholar

Cohen, J. Statistical Power Analysis for the Behavioral Sciences 2nd ed. Routledge, 1988.Google Scholar

Lin, L, Chu, H. Meta-analysis of proportions using generalized linear mixed models. Epidemiology 2020; 31: 713–7.10.1097/EDE.0000000000001232CrossRef Google Scholar PubMed

Weber, F, Knapp, G, Ickstadt, K, Kundt, G, Glass, Ä. Zero-cell corrections in random-effects meta-analyses. Res Synth Methods 2020; 11: 913–9.10.1002/jrsm.1460CrossRef Google Scholar PubMed

Higgins, JP, Eldridge, S, Li, T. Including variants on randomized trials. In Cochrane Handbook for Systematic Reviews of Interventions 1st ed. (eds Higgins, JPT, Thomas, J, Chandler, J, Cumpston, M, Li, T, Page, MJ, et al.): 569–93. Wiley, 2019.10.1002/9781119536604.ch23CrossRef Google Scholar

Cochran, WG. The combination of estimates from different experiments. Biometrics 1954; 10: 101–29.10.2307/3001666CrossRef Google Scholar

Higgins, JPT. Measuring inconsistency in meta-analyses. BMJ 2003; 327: 557–60.10.1136/bmj.327.7414.557CrossRef Google Scholar PubMed

Posit Team. RStudio: Integrated Development Environment for R. Posit Team, 2025 (https://posit.co/products/open-source/rstudio/?sid=1).Google Scholar

Viechtbauer, W. Conducting meta-analyses in R with the metafor package. J Stat Softw 2010; 36: 1–48.10.18637/jss.v036.i03CrossRef Google Scholar

Wickham, H, Averick, M, Bryan, J. Welcome to the tidyverse. J Open Source Softw 2019; 4: 1686.10.21105/joss.01686CrossRef Google Scholar

Viechtbauer, W, Cheung, MW. Outlier and influence diagnostics for meta-analysis. Res Synth Methods 2010; 1: 112–25.10.1002/jrsm.11CrossRef Google Scholar PubMed

Sterne, JAC, Savović, J, Page, MJ, Elbers, RG, Blencowe, NS, Boutron, I, et al. RoB 2: a revised tool for assessing risk of bias in randomised trials. BMJ 2019; 366: l4898.10.1136/bmj.l4898CrossRef Google Scholar PubMed

McGuinness, LA, Higgins, JPT. Risk-of-bias VISualization (robvis): an R package and Shiny web app for visualizing risk-of-bias assessments. Res Synth Methods 2021; 12: 55–61.10.1002/jrsm.1411CrossRef Google Scholar

Dickersin, K, Min, Y-I. Publication bias: the problem that won’t go away. Ann N Y Acad Sci 1993; 703: 135–48.10.1111/j.1749-6632.1993.tb26343.xCrossRef Google Scholar PubMed

Egger, M, Davey Smith, G, Schneider, M, Minder, C. Bias in meta-analysis detected by a simple, graphical test. BMJ 1997; 315: 629–34.10.1136/bmj.315.7109.629CrossRef Google Scholar PubMed

Duval, S, Tweedie, R. Trim and fill: a simple funnel-plot-based method of testing and adjusting for publication bias in meta-analysis. Biometrics 2000; 56: 455–63.10.1111/j.0006-341X.2000.00455.xCrossRef Google Scholar PubMed

Guyatt, GH, Oxman, AD, Vist, GE. GRADE: an emerging consensus on rating quality of evidence and strength of recommendations. BMJ 2008; 336: 924–6.10.1136/bmj.39489.470347.ADCrossRef Google Scholar PubMed

McMaster University, Evidence Prime. GRADEpro Guideline Development Tool. Evidence Prime, 2025 (https://www.gradepro.org).Google Scholar

McHugh, ML. Interrater reliability: the kappa statistic. Biochem Medica 2012; 22: 276.10.11613/BM.2012.031CrossRef Google Scholar PubMed

Garety, PA, Edwards, CJ, Jafari, H. Digital AVATAR therapy for distressing voices in psychosis: the phase 2/3 AVATAR2 trial. Nat Med 2024; 30: 3658–68.10.1038/s41591-024-03252-8CrossRef Google Scholar PubMed

Percie du Sert, O, Potvin, S, Lipp, O, Dellazizzo, L, Laurelli, M, Breton, R, et al. Virtual reality therapy for refractory auditory verbal hallucinations in schizophrenia: a pilot clinical trial. Schizophr Res 2018; 197: 176–81.10.1016/j.schres.2018.02.031CrossRef Google Scholar PubMed

Stefaniak, I, Sorokosz, K, Janicki, A, Wciórka, J. Therapy based on avatar-therapist synergy for patients with chronic auditory hallucinations: a pilot study. Schizophr Res 2019; 211: 115–7.10.1016/j.schres.2019.05.036CrossRef Google Scholar PubMed

Dellazizzo, L, Potvin, S, Phraxayavong, K, Dumais, A. One-year randomized trial comparing virtual reality-assisted therapy to cognitive–behavioral therapy for patients with treatment-resistant schizophrenia. NPJ Schizophr 2021; 7: 9.10.1038/s41537-021-00139-2CrossRef Google Scholar PubMed

Liang, N, Li, X, Guo, X. Visual P300 as a neurophysiological correlate of symptomatic improvement by a virtual reality-based computer AT system in patients with auditory verbal hallucinations: a pilot study. J Psychiatr Res 2022; 151: 261–71.10.1016/j.jpsychires.2022.04.027CrossRef Google Scholar PubMed

Smith, LC, Vernal, DL, Mariegaard, LS, Christensen, AG, Jansen, JE, Schytte, G, et al. Immersive virtual reality-assisted therapy targeting persistent auditory verbal hallucinations in patients diagnosed with schizophrenia spectrum disorders in Denmark: the challenge assessor masked, randomized clinical trial. Lancet Psychiatry 2025; 12: 557–67.10.1016/S2215-0366(25)00161-0CrossRef Google Scholar

Burgess-Barr, S, Nicholas, E, Venus, B, Singh, N, Nethercott, A, Taylor, G, et al. International rates of receipt of psychological therapy for psychosis and schizophrenia: systematic review and meta-analysis. Int J Ment Health Syst 2023; 17: 8.10.1186/s13033-023-00576-9CrossRef Google Scholar PubMed

Correll, CU, Arango, C, Fagiolini, A, Giordano, GM, Leucht, S, Salazar de Pablo, G. Finding the right setting for the right treatment during the acute treatment of individuals with schizophrenia: a narrative review and clinical practice guideline. Neuropsychiatr Dis Treat 2024; 20: 1293–307.10.2147/NDT.S459450CrossRef Google Scholar PubMed

McDonagh, MS, Dana, T, Selph, S, Devine, EB, Cantor, A, Bougatsos, C, et al. Treatments for Schizophrenia in Adults: A Systematic Review. Agency for Healthcare Research and Quality, 2017 (https://www.ncbi.nlm.nih.gov/books/NBK487628/).10.23970/AHRQEPCCER198CrossRef Google Scholar PubMed

The GRADE Working Group. GRADE Handbook for Grading Quality of Evidence and Strength of Recommendations. The GRADE Working Group, 2013 (https://gdt.gradepro.org/app/handbook/handbook.html).Google Scholar

Borenstein, M, Hedges, LV, Higgins, JPT, Rothstein, HR. Introduction to Meta-Analysis 1st ed. Wiley, 2009.10.1002/9780470743386CrossRef Google Scholar

Smith, DA, Mar, CM, Turoff, BK. The structure of schizophrenic symptoms: a meta-analytic confirmatory factor analysis. Schizophr Res 1998; 31: 57–70.10.1016/S0920-9964(98)00009-7CrossRef Google Scholar PubMed

Longden, E, Branitsky, A, Sheaves, B, Chauhan, N, Morrison, AP. Preferred treatment outcomes in psychological therapy for voices: a comparison of staff and service-user perspectives. Psychosis 2024; 16: 107–17.10.1080/17522439.2023.2215298CrossRef Google Scholar

Sterne, JAC, Sutton, AJ, Ioannidis, JPA, Terrin, N, Jones, DR, Lau, J, et al. Recommendations for examining and interpreting funnel plot asymmetry in meta-analyses of randomised controlled trials. BMJ 2011; 343: d4002.10.1136/bmj.d4002CrossRef Google Scholar PubMed

Schütz, A, Salahuddin, NH, Priller, J, Bighelli, I, Leucht, S. The role of control groups in non-pharmacological randomised controlled trials of treatment-resistant schizophrenia: a systematic review and meta-analysis. Psychiatry Res 2024; 339: 116069.10.1016/j.psychres.2024.116069CrossRef Google Scholar PubMed

Heinig, I, Knappe, S, Hoyer, J. Effective – and tolerable: acceptance and side effects of intensified exposure for anxiety disorders. Behav Ther 2023; 54: 427–43.10.1016/j.beth.2022.11.001CrossRef Google Scholar PubMed

Varker, T, Jones, KA, Arjmand, H-A, Hinton, M, Hiles, SA, Freijah, I, et al. Dropout from guideline-recommended psychological treatments for posttraumatic stress disorder: a systematic review and meta-analysis. J Affect Disord Rep 2021; 4: 100093.Google Scholar

Cuijpers, P, Harrer, M, Miguel, C, Ciharova, M, Papola, D, Basic, D, et al. Cognitive behavior therapy for mental disorders in adults: a unified series of meta-analyses. JAMA Psychiatry 2025; 82: 563.10.1001/jamapsychiatry.2025.0482CrossRef Google Scholar PubMed

Salahuddin, NH, Schütz, A, Pitschel-Walz, G. Psychological and psychosocial interventions for treatment-resistant schizophrenia: a systematic review and network meta-analysis. Lancet Psychiatry 2024; 11: 545–53.10.1016/S2215-0366(24)00136-6CrossRef Google Scholar PubMed

Laws, KR, Darlington, N, Kondel, TK, McKenna, PJ, Jauhar, S. Cognitive behavioural therapy for schizophrenia – outcomes for functioning, distress and quality of life: a meta-analysis. BMC Psychol 2018; 6: 32.10.1186/s40359-018-0243-2CrossRef Google Scholar PubMed

Bighelli, I, Wallis, S, Reitmeir, C, Schwermann, F, Salahuddin, NH, Leucht, S. Effects of psychological treatments on functioning in people with schizophrenia: a systematic review and meta-analysis of randomized controlled trials. Eur Arch Psychiatry Clin Neurosci 2023; 273: 779–810.10.1007/s00406-022-01526-1CrossRef Google Scholar PubMed

Deeks, JJ, Higgins, JP, Altman, DG. Analysing data and undertaking meta-analyses. In Cochrane Handbook for Systematic Reviews of Interventions 1st ed. (eds Higgins, JPT, Thomas, J, Chandler, J, Cumpston, M, Li, T, Page, MJ, et al): Chapter 10. Wiley, 2019.Google Scholar

Lin, L. Bias caused by sampling error in meta-analysis with small sample sizes. PLOS One 2018; 13: e0204056.10.1371/journal.pone.0204056CrossRef Google Scholar PubMed

Muthukrishna, M, Bell, AV, Henrich, J. Beyond western, educated, industrial, rich, and democratic (WEIRD) psychology: measuring and mapping scales of cultural and psychological distance. Psychol Sci 2020; 31: 678–701.10.1177/0956797620916782CrossRef Google Scholar PubMed

Khaled, SM, Brederoo, SG, Yehya, A, Alabdulla, M, Woodruff, PW, Sommer, IEC. Cross-cultural differences in hallucinations: a comparison between middle eastern and European community-based samples. Schizophr Bull 2023; 49: S13.10.1093/schbul/sbac086CrossRef Google Scholar PubMed

Larøi, F, Luhrmann, TM, Bell, V, Christian, WA Jr, Deshpande, S, Fernyhough, C, et al. Culture and hallucinations: overview and future directions. Schizophr Bull 2014; 40: S213–20.10.1093/schbul/sbu012CrossRef Google Scholar PubMed

Table 1 Study characteristics

Fig. 1 Risk-of-bias assessments. Ext., extended; Brf., brief.

Table 2 Meta-analytic results for auditory verbal hallucination-related outcomes

Table 3 Meta-analytic subgroup and moderator results for auditory verbal hallucination-related outcomes

Fig. 2 Forest plot showing meta-analytic results for AVH severity at post-treatment and follow-up. Negative SMDs portray smaller means in AVATAR compared with control groups. ⊕, Low bias; ⊖, high bias; AVATAR, Audio Visual Assisted Therapy Aid for Refractory Auditory Hallucinations; AVH, auditory verbal hallucination; Brf., brief; CBT, cognitive–behavioural therapy; Ext., extended; I2, Higgins’ heterogeneity statistic; k, number of comparisons; n, sample size; Q, Cochran’s Q-statistic; Qp, Cochran’s Q p-value; SMD, standardised mean difference in Hedges’ g; TAU, treatment as usual.66

Table 4 Meta-analytic results for clinical and functional outcomes

Table 5 Meta-analytic drop-out results

Opper et al. supplementary material

DOI: https://doi.org/10.1192/bjo.2026.11014.sm001

File 5.7 MB

Submit a response

eLetters

No eLetters have been published for this article.

Article contents

Audio Visual Assisted Therapy Aid for Refractory Auditory Hallucinations (AVATAR) therapy for voice hearers: systematic review and meta-analysis

Abstract

Keywords

Information

Method

Search strategy

Inclusion criteria

Data extraction, synthesis and effect sizes

Bias assessments, publication bias and quality of evidence

Results

Study selection

Study and sample characteristics

AVATAR therapy protocols

Risk of bias

Meta-analysis and systematic review results

Primary and secondary AVH outcomes: AVH severity, frequency and distress

Secondary outcomes: clinical and functional outcomes

Secondary outcomes: tolerability and acceptability

Publication bias

Discussion

Limitations

Supplementary material

Data availability

Acknowledgements

Author contributions

Funding

Declaration of interest

References

Opper et al. supplementary material

eLetters

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests