Impact statement
A co-designed, app-supported version of the World Health Organization Thinking Healthy Programme (THP-TAP) achieved perinatal depression outcomes comparable to standard delivery, supporting scalable and equitable access. Mediation analysis indicated that perceived social support, from significant others, family and friends, was the sole validated pathway to symptom improvement, with a larger indirect effect in the digital THP-TAP than in face-to-face delivery. Neither greater session attendance nor additional cognitive-behavioural therapy skill content conferred extra benefits.
Therefore, digital enhancements should prioritise mobilising and sustaining real-world support networks (family engagement, peer connectors and community prompts) rather than increasing platform “dose” or expanding curricula.
Background
Perinatal depression is a global health priority because it compromises maternal well-being, infant neuro-development and long-term human capital, especially in low- and middle-income countries (LMICs) (Gelaye et al., Reference Gelaye, Rondon, Araya and Williams2016; Atif et al., Reference Atif, Halaki, Raynes-Greenow and Chow2021). Pakistan illustrates the magnitude of the challenge with community prevalence estimates of 25–37%, far exceeding regional averages, yet specialist mental-health resources are scarce and primary-care services are already overstretched (Gelaye et al., Reference Gelaye, Rondon, Araya and Williams2016). Consequently, the task-sharing strategies employing lay health workers to deliver evidence-based psychosocial care have become central to maternal-health policy in LMICs.
The Thinking Healthy Programme (THP) is the most extensively tested task-shared intervention for perinatal depression in LMICs (Rahman et al., Reference Rahman, Malik, Sikander, Roberts and Creed2008). It integrates brief cognitive-behavioural therapy (CBT) techniques (behavioural activation, thought challenging and problem solving) with strategies to mobilise family support. Early cluster-randomised trials in Pakistan demonstrated large effect sizes when the standard World Health Organisation THP (WHO-THP) was delivered face-to-face by educated Lady Health Workers (LHWs), a government cadre with 15 months of formal training (Rahman et al., Reference Rahman, Malik, Sikander, Roberts and Creed2008). However, scaling up the LHW-delivered THP proved challenging, as LHWs are already responsible for family planning, vaccination and child health services (Hafeez et al., Reference Hafeez, Mohamud, Shiekh, Shah and Jooma2011). Attempts to scale THP through peer volunteers, however, revealed recurrent implementation hazards (Zafar et al., Reference Zafar, Sikander, Hamdani, Atif, Akhtar, Nazir, Maselko and Rahman2016; Atif et al., Reference Atif, Krishna, Sikander, Lazarus, Nisar, Ahmad, Raman, Fuhr, Patel and Rahman2017; Rahman et al., Reference Rahman, Akhtar, Hamdani, Atif, Nazir, Uddin, Nisar, Huma, Maselko, Sikander and Zafar2019; Sikander et al., Reference Sikander, Ahmad, Atif, Zaidi, Vanobberghen, Weiss, Nisar, Tabana, Ain, Bibi, Bilal, Bibi, Liaqat, Sharif, Zulfiqar, Fuhr, Price, Patel and Rahman2019; Rahman et al., Reference Rahman, Waqas, Nisar, Nazir, Sikander and Atif2021; Rahman et al., Reference Rahman, Malik, Atif, Nazir, Zaidi, Nisar, Waqas, Sharif, Chen, Wang and Sikander2023).
To preserve fidelity while easing workforce pressure, our multidisciplinary team co-produced with peers with lived experience, a digital adaptation of the WHO-THP in which animated, culturally tailored storylines present the CBT content on an Android application that functions offline (Atif et al., Reference Atif, Nazir, Sultan, Rauf, Waqas, Malik, Sikander and Rahman2022). Crucially, the delivery agents are peers with lived maternal depression experience rather than LHWs. Automation serves three system-level purposes: (i) standardised multimedia modules insulate the active CBT ingredients from programme drift; (ii) interactive decision rules, pause prompts and competence checks provide “in-session” supervision, mitigating voltage drop when external oversight is scarce; and (iii) offloading psycho-education to the device shortens initial training and reduces the specialist supervisory burden.
From a health-systems perspective, this redesign realigned task-sharing with operational reality. Economic modelling indicated that, once tablets are amortised, mean delivery costs per treated woman fall below those of the LHW-only model because peer stipends are modest and ongoing specialist supervision is largely virtual (Rahman et al., Reference Rahman, Malik, Atif, Nazir, Zaidi, Nisar, Waqas, Sharif, Chen, Wang and Sikander2023). Yet health-policy endorsement requires clear evidence that THP-TAP is at least as clinically effective as traditional WHO-THP delivered by educated LHWs. A single-blind, cluster-randomised non-inferiority trial in 70 rural villages addressed this question in a large non-inferiority cluster randomised controlled trial (cRCT) (Rahman et al., Reference Rahman, Malik, Atif, Nazir, Zaidi, Nisar, Waqas, Sharif, Chen, Wang and Sikander2023) demonstrated that digital THP achieved 3-month post-partum remission rates of 92.3% in the technology-assisted peer arm versus 83.5% in the standard LHW arm. The digital THP simultaneously reduced supervisory demands, freeing scarce professional time for other public health priorities.
The primary policy question of effectiveness in real-world settings thus appears resolved, but the scientific question of how the two delivery models achieve their effects remains open. Competing theoretical perspectives attribute change to specific CBT skill acquisition to common therapeutic factors (empathy and hope) or to augmented social support (Huibers et al., Reference Huibers, Lorenzo-Luaces, Cuijpers and Kazantzis2021). Clarifying which mechanisms dominate in each modality is not merely academic. It has real-world implications for refining the digital THP – for instance, if automated CBT dosage is the principal mediator, future iterations should prioritise multimedia refinement; if peer warmth and network activation drive outcomes, investment should focus on recruiting and retaining high-empathy counsellors (Cuijpers, Reference Cuijpers2019; Huibers et al., Reference Huibers, Lorenzo-Luaces, Cuijpers and Kazantzis2021; Waqas et al., Reference Waqas, Zafar, Akhtar, Naveed and Rahman2023; Waqas et al., Reference Waqas, Sikander, Malik, Atif and Rahman2024; Liaquat et al., Reference Liaquat, Waqas, Qadeer, Malik, Atif, Sikander, Wang and Rahman2025; Seward et al., Reference Seward, Garman, Davies, Hanlon, MacBeth, Stewart, Waqas, Araya and Lund2025).
Accordingly, our study aims to elucidate the mechanisms of change in THP-TAP by testing a causal mediation model that traces symptom reduction through both CBT-specific factors (acquisition and enactment of skills – behavioural activation, problem-solving and cognitive restructuring – measured via session-level adherence ratings and patient-reported activation) and common factors (peer–client empathic alliance and mobilisation of social support). Elucidating these pathways will move the field beyond “black-box” evaluations (Mulder et al., Reference Mulder, Murray and Rucklidge2017), inform optimisation of both digital peer-facilitated and face-to-face LHW delivery platforms and generate actionable knowledge about the active ingredients of low-intensity CBT for perinatal depression in LMIC health system settings.
Methods
Study design and rationale
This mediation analysis builds upon a completed cluster RCT carried out in three subdistricts of Rawalpindi District, Punjab, Pakistan – Kallar Syeddan, Gujar Khan and Rawalpindi. It utilised a non-inferiority, stratified design comparing two delivery models of the WHO-THP: the standard THP delivered by LHWs and the THP-TAP delivered by trained community peers (Rahman et al., Reference Rahman, Malik, Atif, Nazir, Zaidi, Nisar, Waqas, Sharif, Chen, Wang and Sikander2023). Given previous evidence confirming that LHW-delivered WHO-THP significantly reduces perinatal depression (Rahman et al., Reference Rahman, Malik, Sikander, Roberts and Creed2008), this study aimed to determine whether the peer-delivered technology-assisted THP was at least equally effective (Rahman et al., Reference Rahman, Malik, Atif, Nazir, Zaidi, Nisar, Waqas, Sharif, Chen, Wang and Sikander2023). Details regarding the study design can be read in the protocol published elsewhere (Rahman et al., Reference Rahman, Malik, Atif, Nazir, Zaidi, Nisar, Waqas, Sharif, Chen, Wang and Sikander2023).
Participant recruitment and eligibility
Participants for this mediation analysis were women screened in the parent trial. Pregnant women in their second or third trimester identified through LHW registers were screened using the Urdu-translated Structured Clinical Interview for DSM-IV (SCID) to diagnose major depressive episodes (MDEs), a method extensively validated in previous studies in Pakistan (Rahman et al., Reference Rahman, Malik, Sikander, Roberts and Creed2008). Eligible participants were women aged 18 years or older, diagnosed with a current Diagnostic and Statistical Manual of Mental Disorders, 4th Edition (DSM-IV) MDE and planning to remain in the study area for at least 1 year. Women requiring inpatient psychiatric or medical care, unable to communicate in Urdu or having severe medical or psychiatric comorbidities necessitating specialised treatment were excluded. Participation was voluntary, and informed written consent was obtained after participants had a 24-h period to review study details.
Ethical considerations
The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2013. Ethical approval was secured from the Research Ethics Committees of the Human Development Research Foundation (Pakistan), the University of Liverpool (ethical approval: UoL001668) and the National Bioethics Committee (Pakistan). Individual-level informed consent and village-level permission from the District Health Office were mandatory. Participant confidentiality and well-being were prioritised, with distress managed through empathetic communication and, when necessary, referrals to specialised mental health or social services. The University of Liverpool sponsored the trial and provided insurance coverage consistent with international clinical research standards. Confidentiality protocols ensured all personal information was securely stored, with no identifying details disclosed in any publications. Participants were informed that they could withdraw at any time without affecting their routine healthcare access.
Masking and baseline assessments
While participants and intervention providers could not be blinded to allocation, outcome assessors remained masked to participants’ treatment assignment. Participants were instructed not to disclose their assigned treatment arm during assessments, and any inadvertent disclosures were documented. Baseline assessments captured demographic data (age, education and socioeconomic status), obstetric history, depression severity, social support and domestic violence, conducted privately either in participants’ homes or at designated LHW Health Houses.
Intervention details
Interventions implemented in the parent trial included the standard WHO-THP, a manualized intervention based on CBT principles, delivered by LHWs through eight structured sessions from late pregnancy to 3 months postpartum (Rahman et al., Reference Rahman, Malik, Sikander, Roberts and Creed2008; Atif et al., Reference Atif, Nazir, Sultan, Rauf, Waqas, Malik, Sikander and Rahman2022; Rahman et al., Reference Rahman, Malik, Atif, Nazir, Zaidi, Nisar, Waqas, Sharif, Chen, Wang and Sikander2023). LHWs were trained over 5 days in communication skills, role-play exercises and the THP manual protocols. The THP-TAP was delivered by trained community peers via a multimedia Android application designed using Human-Centered Design methods. This app transformed WHO-THP content into culturally adapted animated modules. Peers, nominated from local communities, received a 3-day training covering communication techniques, CBT elements and operation of the tablet-based app. Animated modules included scripted scenarios and interactive “pause points,” enabling peer facilitators to contextualise content for participants’ everyday environments. Regular monthly supervision ensured intervention fidelity. Details regarding the development of the digital intervention are published elsewhere (Atif et al., Reference Atif, Nazir, Sultan, Rauf, Waqas, Malik, Sikander and Rahman2022).
Measurement of depression and mediators
Depressive symptoms were assessed using the Patient Health Questionnaire (PHQ-9), which has been validated in Pakistan with a cut-off score of ≥10 denoting probable depression (19). Administered at baseline, and 3 and 6 months post-partum, the PHQ-9 allowed us to capture continuous change in symptom severity and was preferred over a categorical SCID diagnosis because its greater variance enhances the statistical power of mediation analyses (Kroenke et al., Reference Kroenke, Spitzer and Williams2001).
Measures for mediators
Mediator selection was grounded in the cognitive-behavioural framework underpinning the WHO-THP, which posits that changes in cognition, behaviour and interpersonal support drive improvements in perinatal depressive symptoms (Beck, Reference Beck2020). Accordingly, mediators were selected based on their theoretical relevance to core CBT processes targeted by the intervention (e.g., behavioural activation, problem-solving and cognitive restructuring) and evidence on mechanisms of change in low-intensity psychosocial interventions.
A mediator was defined as a variable on the causal pathway between the initial exposure to the intervention and the outcome (mean PHQ-9 score). It is therefore a variable that is causally influenced by the intervention and, in turn, causally influences the outcome (Lee et al., Reference Lee, Cashin, Lamb, Hopewell, Vansteelandt, VanderWeele, MacKinnon, Mansell, Collins, Golub and McAuley2021). The selection of mediators was grounded in the theoretical framework underpinning the THP-TAP intervention and prior empirical evidence regarding key mechanisms of change in low-intensity psychosocial interventions (Table 1). Mediators were included in the final model if they were influenced by the intervention, influenced the outcomes or were associated with another mediator (p < 0.05). Using the above criteria for our two types of mediators, we can capture any potential mediator that is influenced by the intervention and influences either another mediator or the improvement in depressive symptoms.
Table 1. Proposed mediators of THP-TAP

Details of the mediators are given below:
Number of sessions attended (M1): Participants who agreed to participate in the trial did not necessarily attend all sessions. We anticipated that the “dose” of the intervention would not only influence the extent to which symptoms of depression were reduced, but also influence other mediators on the pathway to reduced symptoms of depression. To capture this effect, we created a mediator to reflect the number of sessions (M1: 0–8 sessions) attended in both the intervention and the control arms
Patient activation (M2): Empowering individuals to take an active role in managing their mental health is central to behavioural activation approaches. The THP-TAP intervention was designed to enhance motivation and engagement through structured, peer-supported sessions. We used the PREMIUM Abbreviated Activation Scale, a five-item measure adapted from the longer Behavioural Activation for Depression Scale (Manos et al., Reference Manos, Kanter and Luo2011; Singla et al., Reference Singla, MacKinnon, Fuhr, Sikander, Rahman and Patel2021). This instrument has been validated in similar low-resource settings and captures the degree to which individuals feel capable of engaging in activities that support their well-being.
Perceived social support (M3): Low levels of perceived social support have consistently been associated with increased vulnerability to perinatal depression (Beck, Reference Beck2001; Dennis et al., Reference Dennis, Janssen and Singer2004; Norhayati et al., Reference Norhayati, Hazlina, Asrenee and Emilin2015; Gelaye et al., Reference Gelaye, Rondon, Araya and Williams2016). In contexts of social adversity, such as those experienced by many women in Pakistan, stigma, poverty and limited family support may exacerbate feelings of isolation. The THP-TAP intervention encouraged peer-led engagement and emotional sharing, potentially enhancing perceived support. We assessed perceived social support using the culturally adapted version of the Multidimensional Scale of Perceived Social Support (Zimet et al., Reference Zimet, Dahlem, Zimet and Farley1988). This 12-item scale evaluates support across three domains – family, friends and a significant other – with items rated on a seven-point Likert scale. Higher scores indicate stronger perceived support.
Problem-solving confidence (M4a): Effective problem-solving is a core skill taught in the THP, enabling participants to break down overwhelming problems and generate actionable solutions. We assessed problem-solving confidence using the 11-item confidence subscale of the Problem-Solving Inventory (Heppner and Petersen, Reference Heppner and Petersen1982). This subscale reflects participants’ self-efficacy in applying learned strategies to real-life challenges, an important step in translating intervention content into behavioural change.
Cognitive restructuring skills (M4b): Cognitive restructuring involves identifying and challenging unhelpful thoughts, a key mechanism in CBT-based interventions. The THP-TA intervention integrates these strategies in a culturally relevant and simplified format, supported through the digital app and peer facilitation. This construct was measured using the cognitive restructuring subscale of the Frequency of Actions and Thoughts Scale (Terides et al., Reference Terides, Dear, Karin, Jones, Gandy, Fogliati, Kayrouz, Staples and Titov2016). This subscale includes three items evaluating the frequency with which participants engaged in reframing negative cognitions and adopting more adaptive thought patterns.
Empathy and compassion from peer therapists (M5): A strong therapeutic relationship, often marked by empathy and compassion, is a known facilitator of psychological improvement, particularly in task-shared interventions. Peer therapists trained in the THP-TA programme were equipped with skills to convey empathy and foster trust with participants. To assess this dimension, we used the Empathy Scale for Lay Therapists, a 12-item instrument developed and validated through expert consensus, including psychiatrists, psychologists and service user representatives (Liaquat et al., Reference Liaquat, Waqas, Qadeer, Malik, Atif, Sikander, Wang and Rahman2025). The scale captures participants’ perception of the emotional attunement and compassionate care provided by their peer therapist.
Mediator-outcome confounders: Due to the randomised nature of the exposure, it is not necessary to account for confounders of the association between exposure and outcome. However, for the current analyses, it was necessary to account for baseline confounders (unaffected by the intervention) that were associated with the mediator and/or the outcome. These mediator-outcome confounders can generate spurious correlations between the mediator and outcome when unadjusted for, potentially distorting these associations. We considered more than 30 potential baseline characteristics as potential confounders. Baseline values of all the mediators included in the models. Selection of our mediators was guided through DAGGITY, which help to remove colliders that can potentially bias our estimates (Shrier and Platt, Reference Shrier and Platt2008; Textor et al., Reference Textor, Hardt and Knuppel2011; Tennant et al., Reference Tennant, Murray, Arnold, Berrie, Fox, Gadd, Harrison, Keeble, Ranker, Textor, Tomova, Gilthorpe and Ellison2021).
Statistical analysis
General
To better understand how the intervention influenced the mediators to improve symptoms of depression (Figure 1), we compare mediators and mediator outcome confounders with exposure to the intervention (control vs. intervention arm). To facilitate the comparison between mediators and mediator outcome confounders with recovery from depression, we dichotomised the outcome of mean PHQ-9 scores to recover/not recover from depression (PHQ-9 > 10 = not recovered).

Figure 1. Causal mediation model demonstrating the proposed mediating pathways through which the ENHANCE intervention may improve symptoms of depression.
*M3: Perceived social support yielded statistically significant indirect mediaton effects: B = –0.072 (95% CI: –0.170, –0.018).
Mediation analyses
We aimed to investigate the extent to which symptoms of depression measured at 6 months by the PHQ-9 questionnaire were explained by the above-listed mediators (Figure 1). To achieve this, we used an interventional (in)direct effects approach to mediation analysis that was used to understand population-level effects (Vansteelandt and Daniel, Reference Vansteelandt and Daniel2017). This approach provides an advanced causal inference framework that overcomes limitations of other mediation approaches, such as Structural Equation Modelling, that assume all mediators operate along distinct pathways or that all relationships are linear (Loh et al., Reference Loh, Moerkerke, Loeys and Vansteelandt2020). Importantly, this approach also allowed us to test for moderated mediation, moderation by mediator-outcome confounders captured at baseline and exposure to the intervention on our outcome, as well as moderation by baseline variables and exposure to the intervention on our mediators. Findings are described following A Guideline for Reporting Mediation Analyses (AGReMA) guidelines for reporting mediation analyses (Lee et al., Reference Lee, Cashin, Lamb, Hopewell, Vansteelandt, VanderWeele, MacKinnon, Mansell, Collins, Golub and McAuley2021).
Estimation and model fit
Our analyses were conducted using a series of regression models for mediators and outcomes that included non-linear terms and interactions between treatment allocation (intervention arm/control arm), mediator-outcome confounders and mediators. Estimation of Interventional Indirect Effects was based on Monte Carlo integration using a 1,000-fold expanded dataset (Vansteelandt and Daniel, Reference Vansteelandt and Daniel2017). The expanded dataset was created in five steps (Supplementary Material). The bootstrap also accounted for clustering at the primary health clinics. Bias-corrected confidence intervals (CIs) were based on non-parametric bootstrapping with 1,000 resamples (Vansteelandt and Daniel, Reference Vansteelandt and Daniel2017).
Briefly, we decomposed the total effect of the THP-TAP intervention into interventional indirect effects via each of the posited mediators as well as via the direct effect (e.g., no mediation). Our regression models simultaneously included all mediators, mediator-outcome confounders and relevant interactions and non-linearities. The indirect effect via a particular mediator can be interpreted as the average change in the potential outcome (symptom of depression at 6 months) resulting from shifting the counterfactual distribution of that mediator from the exposed status (e.g., levels of social support in the intervention arm) to the unexposed status (levels of social support in the control arm), while setting each of the remaining mediators to random draws from either the exposed or unexposed group, depending on the specific decomposition.
Assumptions
Interventional Indirect Effects have important underlying assumptions that influence the validity of findings if violated: consistency, no interference and no unmeasured confounders (Vansteelandt and Daniel, Reference Vansteelandt and Daniel2017). Due to the randomised nature of the study, assumptions of no unmeasured confounders between treatment arm and each mediator, and between treatment arm and outcome, are fulfilled. However, there is a possibility of unmeasured mediator-outcome confounders. To address this, we constructed a Directed Acyclic Graph identifying all potential measured and unmeasured confounders. As the trial captured multiple potential confounders, we anticipated that the chance of unmeasured mediator-outcome confounders was small. The “no interference” assumption infers that one participant’s outcome does not influence another participant’s outcome – for example, in a group-psychological intervention for perinatal depression, a large reduction in symptoms for one woman may influence other women to also experience symptom reduction. Given that the parent trial was a clustered randomised trial, we assumed that contact between participants was minimal. The consistency assumption states that an individual’s potential outcome under their observed exposure history is precisely their observed outcome (Hernán and VanderWeele, Reference Hernán and VanderWeele2011; VanderWeele and Hernán, Reference VanderWeele and Hernán2013). Consistency is usually guaranteed in randomised experiments because the application of exposure to the participant is under the control of the investigators (Cole and Frangakis, Reference Cole and Frangakis2009). Given that the intervention was delivered via a digital tool will help to ensure the consistency assumption was upheld.
Missing data
At baseline, 980 women were recruited to the study. However, there were only 823 (84.3%) women with complete data for this analysis. Due to the non-random nature of missing data (e.g., stillbirth and neonatal death), this mediation analysis includes complete data only.
Please see Supplementary File for further details on the methods.
Results
General
Please refer to Supplementary Table 1 for a detailed overview of participants’ baseline characteristics and to Supplementary Figure 1 for the CONSORT flow diagram.
Table 2 below demonstrates that the mediators and mediator-outcome confounders were generally balanced between treatment arms. Perceived levels of social support appeared to be greater at baseline in the WHO-THP arm, shifting to be greater at 3 months in the THP-TAP arm. Women in the WHO-THP arm also had slightly higher levels of education, compared to women in the THP-TAP. Lastly, women in the THP-TAP arm had a higher mean income, compared to women in the WHO-THP arm.
Table 2. Unadjusted comparison between mediators and mediator-outcome confounders and allocation to the control and intervention arm using complete data

a The PREMIUM Abbreviated Activation Scale. All five items are assessed on a scale of 0 (“not at all”) to 5 (“yes, completely”) for a total continuous score of 25. Higher scores indicated increased levels of activation and avoidance (Rahman et al., Reference Rahman, Malik, Atif, Nazir, Zaidi, Nisar, Waqas, Sharif, Chen, Wang and Sikander2023).
b Multidimensional Scale of Perceived Social support (MSPSS). The MSPSS scores range from 12 to 84. Higher scores indicating increased levels of social support (Zimet et al., Reference Zimet, Dahlem, Zimet and Farley1988).
c Problem-solving confidence subscale of the Problem-solving Inventory (7-items, score range 7–28). Higher scores indicate higher levels of problem solving skills (Heppner and Petersen, Reference Heppner and Petersen1982).
d Cognitive restructure subscale of the Frequency of Actions and Thoughts Scale. Scores range from 0 to 12. Higher scores indicate higher levels of cognition (Terides et al., Reference Terides, Dear, Karin, Jones, Gandy, Fogliati, Kayrouz, Staples and Titov2016).
e Empathy Scale for Lay Therapists 12-items. Score range from 0 to 36. Higher scores indicate higher levels of empathy (Liaquat et al., Reference Liaquat, Waqas, Qadeer, Malik, Atif, Sikander, Wang and Rahman2025).
f Patient Health Questionnaire 9-items (PHQ-9). Each of 9 items scored 0–3 → total range = 0–27. Higher scores indicate more severe forms of depression (Kroenke et al., Reference Kroenke, Spitzer and Williams2001).
g Generalised Anxiety Disorder 7-Items (GAD-7). Total range = 0–21. Higher scores indicated increased levels of anxiety (Spitzer et al., Reference Spitzer, Kroenke, Williams and Löwe2006).
h WHO Disability Assessment Schedule 2.0 (WHO-DAS). Scores ranged from 0 to 52. Higher scores indicate increased disability (Ustün et al., Reference Ustün, Chatterji, Kostanjsek, Rehm, Kennedy, Epping-Jordan, Saxena, von Korff and Pull2010).
Table 3 below demonstrates differences between some mediators and mediator-outcome confounders in recovery from depression (PHQ-9 < 10) at the 6-month follow-up visit. Levels of activation and avoidance, perceived social support, cognitive restructuring and empathy from delivery agent were higher in women who recovered from depression. At baseline, women with higher levels of education, monthly family income and cohabiting with a partner/husband were more likely to recover from depression. At baseline, women with a family history of a psychiatric disorder, history of preterm deliveries, higher mean duration of pregnancy, levels of symptoms of depression and anxiety and levels of disability were less likely to recover from depression.
Table 3. Unadjusted comparison between mediators and mediator-outcome confounders with recovery from depression (PHQ-9 < 10) at the 6-month follow-up using complete data in the intervention arm only

a The PREMIUM Abbreviated Activation Scale. Higher scores indicated increased levels of activation and avoidance.
b Multidimensional Scale of Perceived Social support (MSPSS). Higher scores indicating increased levels of social support (Zimet et al., Reference Zimet, Dahlem, Zimet and Farley1988).
c Problem-solving confidence subscale of the Problem-solving Inventory. Higher scores indicate higher levels of problem solving skills (Heppner and Petersen, Reference Heppner and Petersen1982).
d Cognitive restructure subscale of the Frequency of Actions and Thoughts Scale. Higher scores indicate higher levels of cognition (Terides et al., Reference Terides, Dear, Karin, Jones, Gandy, Fogliati, Kayrouz, Staples and Titov2016).
e Empathy scale for Lay Therapists 12-items. Higher scores indicate higher levels of empathy (Liaquat et al., Reference Liaquat, Waqas, Qadeer, Malik, Atif, Sikander, Wang and Rahman2025).
f Patient Health Questionnaire 9-items (PHQ-9). Higher scores indicate more severe forms of depression (Kroenke et al., Reference Kroenke, Spitzer and Williams2001).
g Generalised Anxiety Disorder 7-Items (GAD-7). Higher scores indicated increased levels of anxiety (Spitzer et al., Reference Spitzer, Kroenke, Williams and Löwe2006).
h WHO Disability Assessment Schedule 2.0 (WHO-DAS). Higher scores indicate increased disability (Ustün et al., Reference Ustün, Chatterji, Kostanjsek, Rehm, Kennedy, Epping-Jordan, Saxena, von Korff and Pull2010).
Mediation analyses
There was some evidence of moderated mediation whereby women with more severe symptoms of perinatal depression at baseline and lower levels of perceived social support (M3) at the 3-month follow-up had more severe symptoms of depression at the 6-month follow-up (p = 0.050). Similar moderated mediation was also suggested for empathy (M5). Specifically, women with more severe symptoms of depression at baseline and lower levels of empathy (M5) at the 3-month follow-up had more severe symptoms of perinatal depression at the 6-month follow-up compared to women with fewer symptoms of depression (p = 0.05).
Table 4 describes the total causal effect, and the contribution of each of the indirect effects and the direct effect to the total causal effect. At 6 months, of the total mean difference in PHQ-9 scores between the intervention and control arm (adjusted mean difference in PHQ-9 scores: −0.214, bias-corrected 95% CI: −0.835, 0.437), 33% was mediated through perceived social support (M3 [adjusted mean difference in PHQ-9 scores attributable to perceived social support: −0.072, −0.170, −0.018], Table 4). In practice, this suggests that women receiving the digital intervention, THP-TAP, delivered by a peer had increased levels of social support compared to women who received the WHO-THP intervention by the LDWs, which mediated an improvement in symptoms of depression. There was no evidence that the ENHANCE intervention improved symptoms of depression through any of our other mediators. This suggests that the THP-TAP and the WHO-THP were similar with respect to their effects on the potential mediators.
Table 4. Total causal effect and interventional in(direct) effects for the ENHANCE programme at 6 months

a Estimates have been adjusted for mediator-outcome confounders captured at baseline including: PHQ-9 scores, levels of perceived social support, parity, income, education, living in a nuclear family, having a family history of mental illness, cohabiting with partner/husband, have a previous preterm birth, number of people living in the household, age, levels of anxiety and levels of disability.
b Interactions between mediator-outcome confounders includes perceived levels of social support (M3) and baseline PHQ-9 scores, empathy (M5) and baseline PHQ-9 scores.
c Estimation for the different effects was based on Monte Carlo integration using 1,000-fold expanded dataset.
d Bias-corrected confidence intervals were based on nonparametric bootstrap with 1,000 resamples adjusting for clustering.
Discussion
This study reports mediation analysis to determine whether THP-TAP shares the same CBT-specific and common-factor mechanisms as the standard established WHO-THP, or whether it achieves its effects via different pathways. Three main findings emerge. First, depressive symptoms at 6 months were statistically indistinguishable between arms, confirming the non-inferiority of the digital technology-assisted peer-delivered innovation for sustained mood outcomes. Second, exposure to the digital THP-TAP led to improvements in symptoms of depression mediated by perceived social support, compared to exposure to the WHP-THP. This underscores the technology-assisted format’s enhanced capacity to mobilise social networks in recovery. Third, there was no difference in symptom reduction for all other mediators, including the number of sessions attended, empathy and putative CBT-specific mediators (behavioural activation, problem-solving and cognitive restructuring).
Interpretation in relation to previous work
These mediation findings provide nuanced insights beyond our previous cRCT (Rahman et al., Reference Rahman, Malik, Nazir, Zaidi, Nisar, Waqas, Atif, Gibbs, Luo, Sikander and Wang2025), revealing that the THP-TAP achieves non-inferiority through mechanisms that differ subtly from those of the WHO-THP model. Whereas the standard THP relies on LHWs, THP-TAP employs peers embedded in the same villages. This proximity plausibly enhances credibility, trust and the peers’ capacity to mobilise husbands, mothers-in-law and neighbours to support behavioural activation homework and problem solving (Atif et al., Reference Atif, Krishna, Sikander, Lazarus, Nisar, Ahmad, Raman, Fuhr, Patel and Rahman2017; Rahman et al., Reference Rahman, Waqas, Nisar, Nazir, Sikander and Atif2021; Rahman et al., Reference Rahman, Malik, Atif, Nazir, Zaidi, Nisar, Waqas, Sharif, Chen, Wang and Sikander2023). The modest but significant perceived social support pathway is therefore theoretically coherent and concordant with prior qualitative work in Pakistan and India, where women emphasised “feeling understood” and “practical help from family” as critical to recovery (3, 6). These results align with earlier mediation analyses of the WHO-TAP and its peer-delivered adaptation Thinking Healthy Programme-Peer delivered (THPP), where patient activation and social support, but not mother–child attachment, were identified as key mediators of treatment effect in trials conducted in India and Pakistan (Singla et al., Reference Singla, MacKinnon, Fuhr, Sikander, Rahman and Patel2021; Elias et al., Reference Elias, Seward and Lund2024).
Our findings are also consistent with the growing evidence base supporting the delivery of psychotherapeutic interventions through digital and remote modalities for perinatal populations. For instance, the Scaling Up Maternal Mental healthcare by Increasing access to Treatment (SUMMIT) trial demonstrated that telemedicine-delivered behavioural activation was non-inferior to in-person psychotherapy for perinatal depression and anxiety across a range of outcomes, including depressive symptom reduction, therapeutic alliance and patient satisfaction (Singla et al., Reference Singla, Silver, Vigod, Schoueri-Mychasiw, Kim, La Porte, Ravitz, Schiller, Lawson, Kiss, Hollon, Dennis, Berenbaum, Krohn, Gibori, Charlebois, Clark, Dalfen, Davis, Gaynes, Leszcz, Katz, Murphy, Naslund, Reyes-Rodriguez, Stuebe, Zlobin, Mulsant, Patel and Meltzer-Brody2025). These results highlight that core relational mechanisms of therapy, such as empathy, alliance and perceived social support, can be effectively preserved within virtual care models. Importantly, SUMMIT also found comparable outcomes between specialist and non-specialist providers.
Our study builds upon and extends these insights to a different delivery model and implementation context. Whereas SUMMIT examined synchronous, therapist-led psychotherapy via telemedicine in high-income settings, our intervention evaluated a technology-assisted, peer-delivered model implemented in a low-resource community context. In such settings, digital tools not only serve as a conduit for care but also as a fidelity mechanism that standardises delivery, guides peer providers step-by-step and embeds therapeutic components that might otherwise depend on specialist expertise.
The focus on empathic engagement in our study provides an important complement to traditional measures of therapeutic alliance: It captures the affective and attitudinal aspects of relational quality that are central to effective psychosocial care, even when sessions are guided by structured digital content rather than conventional therapist–client interactions. Taken together, our results suggest that the relational foundations of psychotherapy – typically thought to rely on in-person human contact – can be maintained in digitally mediated, task-shared formats. This convergence with SUMMIT reinforces the notion that digital delivery does not necessarily compromise therapeutic quality. Rather, when supported by structured training, supervision and evidence-based digital tools, non-specialists and peers can achieve meaningful engagement and therapeutic outcomes that parallel those of specialist- or telemedicine-delivered interventions.
The absence of detectable mediation via CBT skill acquisition warrants reflection. One interpretation is measurement insensitivity: Session-level adherence checklists may capture the delivery of CBT elements but not the depth of internalisation by participants. Alternatively, THP-TAP’s animated storylines may have standardised CBT delivery so effectively that between-arm variance in skill acquisition (including cognitive restructuring, problem solving and behavioural activation) was too small to mediate outcome differences. Under either scenario, the data suggest that fidelity to specific CBT techniques was successfully maintained across platforms, supporting the premise that automation can shield core content from “programme drift” even when supervision intensity is reduced (4).
Mechanistic and practical implications
Taken together, the findings imply a division of labour between technology and human agents. The app guarantees consistent CBT dosage and sequencing, while peers with lived experience provide empathic resonance and activate existing social networks. This distinction is particularly important: Social support was perceived by perinatal women as less accessible in the LHW-delivered WHO-THP, where competing public health responsibilities constrain opportunities for relationship building. For health systems grappling with LHW overload, the technology-assisted peer-delivered intervention offers two advantages: (i) It preserves therapeutic potency without further taxing professional cadres, and (ii) it yields incremental gains in social connectedness, a protective factor with benefits beyond depression per se. From an implementation standpoint, the modest magnitude of the social-support pathway should explicitly cultivate community-engagement skills, and digital content could be updated to include structured prompts that encourage involvement of key relatives. Because cost-effectiveness models already indicate lower per-patient costs for THP-TAP once equipment is amortised (6), even small additional gains in social functioning are likely to improve value for money.
Strengths and limitations
This mediation study possesses several methodological strengths that enhance the credibility and relevance of its findings. First, it is embedded within a large, single-blind, pragmatic cluster-randomised trial spanning 70 villages. By utilising existing LHW infrastructure and community-nominated peers, the research is conducted in real-world health system conditions, which contributes to its external validity. Moreover, the use of robust causal mediation analysis and exploration of mediator interdependencies improves the robustness of results. Finally, linguistic validation of mediation instruments, as well as co-production with service users and engagement of district health authorities, enhanced the psychometric soundness of interviews.
However, this study also has several limitations. Although cluster randomisation reduced contamination, social ties across neighbouring villages may still have diluted between-arm contrasts. Outcomes and mediators were captured by self-report – inviting shared-method and social-desirability bias – and the final mediator assessment coincided with the primary outcome, so temporal ordering is inferred rather than observed. The parent trial was powered for non-inferiority, and not mediation, limiting sensitivity to detect small indirect effects. Additionally, findings may not generalise to urban LMIC settings with different workforce and digital infrastructure profiles. Another limitation is that the assessment of key behavioural outcomes relied solely on self-reported activation levels rather than incorporating objective measures such as those obtainable through wearable devices.
Future research
The findings endorse technology-assisted, peer-delivered THP-TAP as a service-ready alternative to the face-to-face LHW model. Because the App standardises CBT content and the peers are drawn from the same communities, districts can redeploy scarce supervisory resources towards recruitment, brief tablet-based training and light-touch quality monitoring rather than continuous skills coaching. The additional benefit in perceived social support suggests that embedding lived experience peers not only preserves therapeutic potency but also addresses a key social risk factor in the perinatal period. Ministries of Health and Non-governmental organizations (NGOs) implementing maternal-child-health packages could therefore integrate THP-TAP into routine antenatal and postnatal visits, use peer facilitators to engage husbands and mothers-in-law and leverage existing tablet roll-outs for immunisation or community surveillance to keep marginal costs low.
For researchers, the modest but significant social support pathway invites a sharper focus on relational mechanisms in low-intensity CBT. Future trials should incorporate objective and ecological measures of network engagement, experimentally manipulate “dosages” of digital content versus peer contact and test whether augmenting social support modules yields incremental benefit. Longitudinal follow-ups beyond the first post-partum year are needed to examine whether early gains translate into sustained maternal functioning and improved child developmental trajectories. Finally, adaptive platform trials that embed mechanism-driven optimisation cycles could refine both digital assets and peer-training curricula, moving the field towards precision task-sharing in global mental health care.
Conclusions
Digital THP delivered by peers with lived experience is clinically equivalent to, and on some social dimensions superior to, standard WHO-THP delivered by trained community health workers (CHWs). Mediation analysis indicates that increased perceived social support, rather than differential CBT skill mastery, constitutes an effective pathway of additional benefit. These findings bolster the case for integrating technology-assisted, peer-delivered psychosocial interventions into overstretched maternal health services and highlight community-embedded support as a modifiable target for future optimisation.
Furthermore, these findings underscore that technology-enabled interventions should be designed to strengthen, rather than replace, human connection. In this study, the digital platform functioned as a facilitator of supportive peer–mother relationships, highlighting the importance of maintaining interpersonal processes at the core of scalable psychosocial care.
Open peer review
To view the open peer review materials for this article, please visit http://doi.org/10.1017/gmh.2025.10109.
Supplementary material
The supplementary material for this article can be found at http://doi.org/10.1017/gmh.2025.10109.
Author contribution
AW, AR and SS designed the study. NA and AR designed the intervention. NS conducted the data analysis. NS and AW interpreted the study results and wrote the initial draft of the manuscript. AM and HN co-developed the intervention and oversaw the field work. DW and AN provided statistical oversight. All authors critically reviewed the manuscript and approved the final manuscript for submission.
Competing interests
The authors declare none.
Ethical statement
The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2013. Ethical approval was secured from the Research Ethics Committees of the Human Development Research Foundation (Pakistan), the University of Liverpool (ethical approval: UoL001668) and the National Bioethics Committee (Pakistan).





Comments
Dear Editors,
Please consider the enclosed manuscript, “Comparing standard and technology-assisted Peer-delivered CBT for Perinatal Depression: A causal mediation study”, for publication in Global Mental Health.
Perinatal depression is a leading, yet under-addressed, contributor to global disability. Although the World Health Organization’s task-shared Thinking Healthy Programme (WHO-THP) is effective, scale-up is constrained by human-resource demands. The ENHANCE cluster-randomised non-inferiority trial showed that our co-designed digital adaptation (THP-TAP) achieves comparable clinical benefit. What has not been examined is how each intervention works. In this paper we apply contemporary counterfactual causal-mediation methods to the ENHANCE data set (n = 823) to identify the therapeutic processes that drive symptom change.
Our principal finding is that perceived social support uniquely mediates the benefit of THP-TAP (adjusted indirect effect = –0.072 PHQ-9 points; bias-corrected 95 % CI –0.170 to –0.018). We observed no between-arm differences in behavioural activation, cognitive restructuring, problem-solving, session “dose,” or peer empathy; consequently, none of these factors differentially mediated the impact of face-to-face versus digitally delivered CBT on depressive symptoms. Accordingly, both the digital and traditional formats deliver these CBT-specific components with comparable fidelity and strength.
The study advances the field by:
• providing the first mechanism-focused comparison of a digital and face-to-face, peer-delivered CBT in a low-resource context;
• offering actionable design guidance for next-generation digital psychosocial interventions.
These contributions align well with Global Mental Health’s scope in global mental health, psychotherapy mechanisms, and digital psychiatry.
Thank you for considering our submission. We believe the manuscript will be of interest to your readership and advance understanding of how to optimise scalable treatments for perinatal depression. We would be delighted to address any queries you or the reviewers may have.
Yours sincerely,
Dr Ahmed Waqas, PhD
Corresponding author