Comparison of capability and health-related quality of life instruments in capturing aspects of mental well-being in people with schizophrenia and depression

Timea Mariann Helter; Joanna Coast; Agata Łaszewska; Tanja Stamm; Judit Simon

doi:10.1192/bjo.2022.514

Comparison of capability and health-related quality of life instruments in capturing aspects of mental well-being in people with schizophrenia and depression

Published online by Cambridge University Press: 27 June 2022

Tanja Stamm and

Timea Mariann Helter*: Affiliation:
Department of Health Economics, Center for Public Health, Medical University of Vienna, Austria
Joanna Coast: Affiliation:
Health Economics Bristol, Population Health Sciences, Bristol Medical School, University of Bristol, UK
Agata Łaszewska: Affiliation:
Department of Health Economics, Center for Public Health, Medical University of Vienna, Austria
Tanja Stamm: Affiliation:
Section for Outcomes Research, Center for Medical Statistics, Informatics, and Intelligent Systems, Medical University of Vienna, Austria; and Ludwig Boltzmann Institute for Arthritis and Rehabilitation, Vienna, Austria
Judit Simon: Affiliation:
Department of Health Economics, Center for Public Health, Medical University of Vienna, Austria; and Department of Psychiatry, University of Oxford, Warneford Hospital, Oxford, UK
*: Correspondence: Timea Mariann Helter. Email: timea.helter@muv.ac.at

Article contents

Abstract
Background
Aims
Method
Results
Conclusions
Method
Results
Discussion
Footnotes
References

Rights & Permissions

Abstract

Background

There is increasing evidence that assessing outcomes in terms of capability provides information beyond that of health-related quality of life (HRQoL) for outcome evaluation in mental health research and clinical practice.

Aims

To assess similarities and differences in the measurement properties of the ICECAP-A capability measure and Oxford Capabilities Questionnaire for Mental Health (OxCAP-MH) in people with schizophrenia experiencing depression, and compare these measurement properties with those of (a) the EuroQol EQ-5D-5L and EuroQol Visual Analogue Scale (EQ-VAS) and (b) mental health-specific (disease-specific) measures.

Method

Using data for 100 patients from the UK, measurement properties were compared using correlation analyses, Bland–Altman plots and exploratory factor analysis. Responsiveness was assessed by defining groups who worsened, improved or remained unchanged, based on whether there was a clinically meaningful change in the instrument scores between baseline and 9-month follow-up assessments.

Results

The two capability instruments had stronger convergent validity with each other (Spearman's rho = 0.677) than with the HRQoL (rho = 0.354–0.431) or the mental health-specific (rho = 0.481–0.718) instruments. The OxCAP-MH tended to have stronger correlations with mental health-specific instruments than the ICECAP-A, whereas the ICECAP-A had slightly stronger correlation with the EQ-VAS. Change scores on the capability instruments correlated weakly with change scores on the HRQoL scales (rho = 0.131–0.269), but moderately with those on mental health-specific instruments for the ICECAP-A (rho = 0.355–0.451) and moderately/strongly on the OxCAP-MH (rho = 0.437–0.557).

Conclusions

Assessing outcomes in terms of capabilities for people with schizophrenia and depression provided more relevant, mental health-specific information than the EQ-5D-5L or the EQ-VAS. The ICECAP-A and the OxCAP-MH demonstrated similar psychometric properties, but the OxCAP-MH was more correlated with disease-specific instruments.

Keywords

Capability approach EQ-5D-5L ICECAP-A OxCAP-MH schizophrenia

Information

Type: Papers
Information: BJPsych Open , Volume 8 , Issue 4 , July 2022 , e117

DOI: https://doi.org/10.1192/bjo.2022.514 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution licence (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
Copyright: Copyright © The Author(s), 2022. Published by Cambridge University Press on behalf of the Royal College of Psychiatrists

The capability approach is a theoretical framework with a range of potential applications. It has been highlighted as a way of helping people with mental health difficulties to engage with their values and priorities, for instance through influencing the design and delivery of mental health services.^{Reference Sen1} The capability approach was developed by Amartya Sen with a core focus on what individuals are free and able to do and be (i.e. are capable of).^{Reference Sen1} This approach places emphasis on promoting well-being through enabling people to realise their capabilities and engage in behaviours that they value.^{Reference Sen1}

Economic evaluations are comparative analyses of alternative courses of action based on the costs and consequences of the alternatives being considered.^{Reference Drummond, Sculpher, Torrance, O'Brien and Stoddart2} There is increasing interest in the use of the capability approach for the economic evaluation of health-related interventions, including those in mental healthcare.^{Reference Helter, Coast, Laszewska, Stamm and Simon3} One reason for this is the wider evaluative space this approach offers for the measurement of well-being in comparison to the commonly used methods of assessing effects in terms of health-related quality of life (HRQoL).^{Reference Simon, Paul, Gray, Rugkasa, Yeeles and Burns4} Currently the EuroQol EQ-5D instruments (including a self-rated health assessment recorded on a vertical visual analogue scale, the EQ-VAS) are the most commonly recommended and used HRQoL instruments for economic evaluations in healthcare.⁵ However, HRQoL instruments such as the EQ-5D may not capture important effects of interventions when these go beyond health change and therefore underestimate their full effect on welfare.^{Reference Francis and Byford6} Mental healthcare interventions and services usually target both health and social impairments because many people with severe and enduring mental illness experience significant functional and social challenges which may also spill over into effects on education, employment, justice and families.^{Reference Simon, Paul, Gray, Rugkasa, Yeeles and Burns4} There is evidence of these impacts beyond HRQoL in mental illness, and particularly in the case of schizophrenia.^{Reference Halling Hastrup, Nordentoft, Hjorthøj and Gyrd-Hansen7} People with schizophrenia are more likely to be homeless, unemployed or living in poverty compared with the general population; moreover, their disease is progressive and causes increasing disability, dependence on care and need for assistance from others in carrying out activities of daily life.^{Reference Wander8} The prevalence of depressive disorder in schizophrenia has been reported to be around 50%.^{Reference Buckley, Miller, Lehrer and Castle9} Evidence suggests that depression is linked to poorer outcomes in schizophrenia, as well as to particularly high levels of healthcare use and suicide.^{Reference Palmer, Pankratz and Bostwick10}

A recent literature review of capability instruments in economic evaluations of health-related interventions identified 14 such instruments, which differ substantially in their development methods, items, item levels, target populations and the interventions that were evaluated using them.^{Reference Helter, Coast, Laszewska, Stamm and Simon3} Two of the most commonly used and validated instruments, particularly for economic evaluations among adults with mental health problems,^{Reference Helter, Coast, Laszewska, Stamm and Simon3} are the ICECAP capability measure for adults (ICECAP-A) and the Oxford Capabilities Questionnaire for Mental Health (OxCAP-MH). Although both instruments are grounded in the capability approach, their conceptual approaches differ. The ICECAP-A belongs to a broader group of ICECAP capability instruments, each focusing on important capabilities according to a particular life stage.^{Reference Coast11} It was developed in the UK, using bottom-up participatory methods as recommended by Sen^{Reference Al-Janabi, Flynn and Coast12} to generate contextual capability attributes for the whole adult population. The OxCAP-MH was also developed in the UK, originally for capability well-being measurement in mental health, using an alternative top-down approach.^{Reference Francis and Byford6} This approach, which is rooted in Nussbaum's central human capabilities, is deemed less limited by geographical and cultural contexts.^{Reference Nussbaum13}

Empirical comparisons between HRQoL and capability instruments have been conducted in a variety of disease areas, but there is limited information available for mental health conditions.^{Reference Helter, Coast, Laszewska, Stamm and Simon3} Furthermore, comparative studies of alternative capability instruments in the area of mental health are lacking, including empirical studies that focus on the use of the ICECAP-A and OxCAP-MH instruments for the same patient population. So far, suggestions for choosing between these measures tended to rest only on the underlying properties of the descriptive and valuation systems.^{Reference Helter, Coast, Laszewska, Stamm and Simon3} Therefore, questions remain about how far different applications of the same broad theoretical concept of the capability evaluative space result in different measurement properties, and whether this influences the capability instruments’ properties when compared with HRQoL instruments for optimised research design in mental health. This study aimed to compare the measurement properties of two most commonly used capability instruments in the mental health context, the ICECAP-A and OxCAP-MH,^{Reference Helter, Coast, Laszewska, Stamm and Simon3} and compare them to the EQ-5D-5L and the EQ-VAS alongside some mental health-specific instruments in people with schizophrenia experiencing depression.

Method

Data

The analysis in this paper was based on data from a study that investigated the impact of positive memory training (PoMeT) on depression symptoms in people with schizophrenia (n = 100) in the UK between 2014 and 2016.^{Reference Steel, Korrelboom, Fazil Baksh, Kingdon, Simon and Wykes14} Individuals were eligible for inclusion if they were 18–65 years of age, had a DSM-5 diagnosis of schizophrenia or schizoaffective disorder, and had at least a mild level of depression as measured by scoring 14 or more on the Beck Depression Inventory-II (BDI-II).^{Reference Beck, Ward, Mendelson, Mock and Erbaugh15} Participants were identified by trial research assistants who worked in collaboration with care coordinators based in community mental health teams. Participants were assessed at four time points through the 9-month study period: baseline, 3 months, 6 months and 9 months. The assessors, who were masked to treatment allocation, administered the questionnaires to the participants. The PoMeT study was performed in accordance with the Helsinki Declaration and was approved by Berkshire Research Ethics Committee (REC ref. 13/SC/0634). All participants provided written informed consent. More details about the PoMeT trial can be found in Steel et al.^{Reference Steel, Korrelboom, Fazil Baksh, Kingdon, Simon and Wykes14}

Instruments

Two capability measures (ICECAP-A and OxCAP-MH), a generic HRQoL instrument (EQ-5D-5L, including the EQ-VAS), and four mental health-specific instruments (Beck Depression Inventory II (BDI-II), Generalized Anxiety Disorder 7 questionnaire (GAD-7), Rosenberg Self-Esteem Scale (RSES) and the Warwick–Edinburgh Mental Wellbeing Scale (WEMWBS)) were used in the study. The focus of this paper is on the comparison of the capability and generic HRQoL instruments with the mental health-specific instruments used as anchors.

Capability instruments

ICECAP-A

The ICECAP-A is a brief self-reported measure for the general adult population with five items, each of which can take one of four levels, ranging from full capability to no capability. The items include: stability (being able to feel settled and secure), attachment (being able to have love, friendship and support), autonomy (being able to be independent), achievement (being able to achieve and progress) and enjoyment (being able to have enjoyment and pleasure). The ICECAP-A value set used in this study was derived from the UK general population and ranges from 0 (no capability) to 1 (full capability).^{Reference Flynn, Huynh, Peters, Al-Janabi, Clemens and Moody16}

The instrument has shown validity,^{Reference Al-Janabi, Peters, Brazier, Bryan, Flynn and Clemens17} reliability,^{Reference Khan and Richardson18} responsiveness^{Reference Keeley, Al-Janabi, Nicholls, Foster, Jowett and Coast19} and feasibility^{Reference Keeley, Al-Janabi, Lorgelly and Coast20} in different populations, including depression,^{Reference Mitchell, Al-Janabi, Byford, Kuyken, Richardson and Iezzi21} and it is increasingly used in economic evaluations.^{Reference Afentou and Kinghorn22}

OxCAP-MH

The OxCAP-MH is a 16-item self-report instrument developed in the context of mental health outcome measurement. The 16 items cover a broad range of individual capability well-being aspects including: overall health; enjoying social and recreational activities; losing sleep over worry; friendship and support; having suitable accommodation; feeling safe; likelihood of discrimination and assault; freedom of personal and artistic expression; appreciation of nature; self-determination; and access to interesting activities or employment.^{Reference Simon, Paul, Gray, Rugkasa, Yeeles and Burns4} The OxCAP-MH items are rated on a 1–5 Likert scale where each question provides an equal contribution to the overall score. The standardised score ranges from 0 to 100, with higher scores reflecting higher levels of capabilities.^{Reference Vergunst, Jenkinson, Burns, Anand, Gray and Rugkåsa23} The OxCAP-MH has shown validity,^{Reference Vergunst, Jenkinson, Burns, Anand, Gray and Rugkåsa23,Reference Łaszewska, Schwab, Leutner, Oberrauter, Spiel and Simon24} responsiveness^{Reference Vergunst, Jenkinson, Burns, Anand, Gray and Rugkåsa23,Reference Łaszewska, Schwab, Leutner, Oberrauter, Spiel and Simon24} and feasibility^{Reference Simon, Paul, Gray, Rugkasa, Yeeles and Burns4} in several settings and areas of mental illness, including schizophrenia and depression.^{Reference Simon, Mayer, Łaszewska, Rugkåsa, Yeeles and Burns25}

HRQoL instruments

EQ-5D-5L, including EQ-VAS

The EQ-5D instruments are one of the most commonly used self-reported generic health status measures, and their validity and reliability have been reported for various health conditions and populations.^{Reference Payakachat, Ali and Tilford26} The more recent EQ-5D-5L version comprises five dimensions: mobility, self-care, usual activities, pain/discomfort and anxiety/depression, with 5-level answer options and with preference-based value sets developed in multiple countries.²⁷ This study used the UK crosswalk mapping value set developed for the EQ-5D-5L,^{Reference van Hout, Janssen, Feng, Kohlmann, Busschbach and Golicki28} following recommendations from the National Institute for Health and Care Excellence.⁵ Scores range between the worst (−0.594) and best (1.000) imaginable health state; zero refers to death. As part of the EQ-5D-5L, respondents’ self-rated health is also recorded on a vertical visual analogue scale (EQ-VAS), where scores range between 0 and 100, referring to worst imaginable health state and best imaginable health state respectively.

Mental health-specific instruments

Four mental health-specific instruments used in the PoMeT trial were applied as anchors in the correlation and responsiveness analyses. This was due to their ability to capture the clinically relevant status of the patients together with any changes in status, thereby enabling the responsiveness assessment of the OxCAP-MH, ICECAP-A, EQ-5D-5L and EQ-VAS instruments.

BDI-II

The BDI-II is a self-reported measure of depressive symptoms and their severity in adolescents and adults according DSM diagnostic criteria.^{Reference Beck, Ward, Mendelson, Mock and Erbaugh15} It has 21-items, each scored on 4-point polytomous response scale ranging from 0 to 3. Total scores range between 0 and 63, with higher scores representing more severe depression.^{Reference Beck, Ward, Mendelson, Mock and Erbaugh15}

GAD-7

The GAD-7 is a self-reported measure of anxiety symptoms over the past 2 weeks. It consists of seven items scored on a 0–3 scale, with higher scores indicating more severe symptoms (range from 0 to 21). The cut-off scores of 5, 10 and 15 reflect mild, moderate and severe anxiety symptoms respectively.^{Reference Spitzer, Kroenke, Williams and Löwe29}

RSES

The RSES is a 10-item self-report instrument that measures global self-worth by measuring both positive and negative feelings about the self. Items are answered using a 4-point polytomous response scale ranging from strongly agree to strongly disagree. Items 2, 5, 6, 8, 9 are reverse scored. The scale ranges from 0 to 30, with 30 indicating the highest score possible.^{Reference Rosenberg30}

WEMWBS

The WEMWBS was developed in the UK to assess mental well-being, including affective-emotional aspects, cognitive-evaluative dimensions and psychological functioning. It is a 14-item self-report instrument with 5 response categories (‘none of the time’, ‘rarely’, ‘some of the time’, ‘often’, ‘all of the time’), with a total score ranging from 14 to 70. A higher score indicates a higher level of mental well-being.^{Reference Tennant, Hiller, Fishwick, Platt, Joseph and Weich31}

Analyses

We conducted analyses to assess the strength and statistical significance of correlations between instruments using statistical and graphical methods: exploratory factor analysis (EFA) to assess the underlying construct of the measures; assessment of responsiveness, based on Spearman's rank correlations (Spearman's rho) between change scores on the instruments; and agreement analysis using Bland–Altman plots. All analyses were performed using the general population value sets for the ICECAP-A and EQ-5D-5L instruments, with the summed ranges for other instruments. Sensitivity analysis was performed using level sum scores (results are presented in supplementary Appendix 1, available at https://doi.org/10.1192/bjo.2022.514).

For all analyses, the level of significance was set at P < 0.05, unless stated otherwise. EFA was conducted using FACTOR 12.01.02 software for Windows (Rovira i Virgili University, Tarragona, Spain; see https://psico.fcep.urv.cat/utilitats/factor/Download.html) and STATA Version 16 for Windows was used for all other analyses. Analysis was conducted on complete cases, excluding missing items at the relevant time point, unless stated otherwise.

Convergent validity

We explored convergent validity through correlations across the instruments to test the expectation that capability instruments would be more highly correlated (converge) with each other than with the HRQoL instruments. Convergent validity indicates the degree to which two measures of constructs which are theoretically related are in fact related. Convergence between the instruments was tested through exploring the correlation (Spearman's rank) between their baseline scores and assessed based on Cohen's effect size classification, namely <0.3 is small, 0.3 to <0.5 is moderate and ≥0.5 is large.^{Reference Cohen32} Group comparisons of mean baseline scores were conducted using the Wilcoxon rank-sum test^{Reference Wilcoxon, Kotz and Johnson33} for two-group comparisons and Kruskal–Wallis one-way analysis of variance (ANOVA) for multiple group comparisons.^{Reference Kruskal and Wallis34}

Exploratory factor analysis

We used EFA on the baseline scores of the ICECAP-A, OxCAP-MH and EQ-5D-5L to explore the instruments' factor structure. We conducted EFA for all instruments on the same sample to determine potential overlaps of factors. Making the factor structure explicit contributes to the validity of an instrument.^{Reference Thompson and Daniel35} The factor solution was chosen according to the Kaiser criterion based on a scree plot and the eigenvalues, as described in supplementary Appendix 2, together with further details on the methods of the EFA.

Responsiveness

The responsiveness of OxCAP-MH, ICECAP-A, EQ-5D-5L and EQ-VAS was assessed using external anchor instruments. The process started with the definition of two instruments which could be used as autonomous anchors because they identify change that is unlikely to have arisen by chance.^{Reference Halling Hastrup, Nordentoft, Hjorthøj and Gyrd-Hansen7} The level of responsiveness was evaluated by defining groups who worsened, improved or remained unchanged, based on whether a clinically meaningful change in instrument scores between baseline and 9-month follow-up assessments was measured for individuals by both anchor instruments. The standard error of measurement (SEM) was calculated based on the difference between the baseline and 9-month values using the following formula:^{Reference Vergunst, Jenkinson, Burns, Anand, Gray and Rugkåsa23,Reference Wyrwich, Tierney and Wolinsky36}

$${\rm SE}{\rm M}_{{\rm diff}} = \sqrt {( {{\rm SEM}_1^2 + {\rm SEM}_2^2 } ) } $$

There is no consensus on how many SEMs an individual's score must change by for that change to be considered clinically meaningful. This paper used the threshold of one SEM, which is known frequently to correspond with a minimally important difference.^{Reference Vergunst, Jenkinson, Burns, Anand, Gray and Rugkåsa23,Reference Wyrwich, Tierney and Wolinsky36} Next, the percentages of the study respondents who improved, worsened or showed no change according to the capability, HRQoL and anchor questionnaires were calculated to explore the detected congruence in changes at the individual patient level. An additional analysis of responsiveness in terms of correlation between change scores (baseline to end-point) of the instruments can be found in supplementary Appendix 5.

Agreement analysis

The pattern and extent of agreement between ICECAP-A, OxCAP-MH, EQ-5D-5L and EQ-VAS scores were plotted on Bland–Altman diagrams because they are competitor measures for economic evaluations.^{Reference Martin Bland and Altman37} The difference between the instruments is shown on the vertical axis against the mean of the pair on the horizontal axis. For the calculation of the values for the Bland–Altman plots, OxCAP-MH, ICECAP-A, EQ-5D-5L and EQ-VAS scores were standardised to 0–1 intervals where necessary.

Results

Participant characteristics

The mean age of participants was 43 years and 75% were males. About half of the participants had received higher education and 86% were unemployed at the time of data collection. Schizophrenia was the primary diagnosis in 70% of the participants, and the rest suffered from schizoaffective disorder or psychosis not otherwise specified. Overall, 83% of participants reported previous hospital admission for psychiatric reasons, with a mean number of 4.5 admissions. The severity of depression was high for 59% of participants. There were significant differences between those with mild to moderate depression and those with depression of high severity in the baseline score for each instrument. Further participant characteristics are presented in Table 1.

Table 1 Patient characteristics and mean baseline OxCAP-MH, ICECAP-A, EQ-5D-5L and EQ-VAS scores

OxCAP-MH, Oxford Capabilities Questionnaire for Mental Health; ICECAP-A, ICECAP capability measure for adults; EQ-VAS, EuroQol Visual Analogue Scale; NOS, not otherwise specified.

a. Wilcoxon rank-sum test for two-group comparison, Kruskal–Wallis one-way ANOVA for multiple group comparison.

Convergent validity

Mean baseline scores for all instruments and their correlations are presented in Table 2. Graphical presentation of baseline correlations is included in supplementary Appendix 3.

Table 2 Baseline scores of the relevant outcome measures used in the trial and the associated Spearman's rank correlations

rho, Spearman's rank correlation coefficient; OxCAP-MH, Oxford Capabilities Questionnaire for Mental Health; ICECAP-A, ICECAP capability measure for adults; EQ-VAS, EuroQol Visual Analogue Scale; BDI-II, Beck Depression Inventory II; GAD-7, General Anxiety Disorder 7; RSES, Rosenberg Self-Esteem Scale; WEMWBS, Warwick–Edinburgh Mental Wellbeing Scale.

a. Moderate correlations (0.3–0.5) in italic, strong correlations (≥0.5) in bold.

b. Wilcoxon matched pairs signed rank test.

Correlations between the capability and HRQoL measures (rho = 0.354–0.431) were lower than those between ICECAP-A and OxCAP-MH (rho = 0.677). The ICECAP-A was slightly more correlated with EQ-VAS (rho = 0.431) than with the EQ-5D-5L (rho = 0.363), whereas the OxCAP-MH was slightly more correlated with the EQ-5D-5L (rho = 0.389) than with the EQ-VAS (rho = 0.354). The baseline scores of both capability instruments were more highly correlated with the mental health-specific instruments (rho = 0.481–0.718) than with the generic HRQoL instruments (rho = 0.354–0.431).

Exploratory factor analysis

A four-factor solution was selected for EFA. It found that all items of the instruments had commonalities greater than 0.35, i.e. none of the items struggled to load significantly on any factor. Factor loadings are shown in Table 3 for any factor >0.35.

Table 3 Exploratory factor analysis of the OxCAP-MH, ICECAP-A and EQ-5D-5L items with four factors using promin rotation (n = 78)^a

OxCAP-MH, Oxford Capabilities Questionnaire for Mental Health; ICECAP-A, ICECAP capability measure for adults.

a. Loadings ≤0.35 were removed.

Factor one is an undoubtedly HRQoL-related factor, which consisted of the five EQ-5D-5L and some ICECAP-A and OxCAP-MH items. None of the EQ-5D-5L items loaded on factors two, three and four, thereby suggesting that these factors capture the broader capability-related concepts. Multiple items of the ICECAP-A and OxCAP-MH loaded on these factors, but factor four consisted of only two OxCAP-MH items (‘appreciation of nature’ and ‘respect for people around’), with remarkably high commonalities.

Responsiveness

The GAD-7 and WEMWBS measures were selected as suitable reference anchor instruments for the analysis of responsiveness since they had the highest correlation with the four scales under investigation in this paper. Table 4 shows the number of participants who improved, showed no change or deteriorated based on the capability, HRQoL and anchor instruments. The OxCAP-MH, ICECAP-A, EQ-5D-5L and EQ-VAS measures identified different proportions of improved (48–52%, 48–52%, 29–35% and 39–52% respectively) and deteriorated (35–53%, 27–35%, 21–33% and 21–33%, respectively) participants in agreement with the anchor instruments. Out of the four scales, the OxCAP-MH identified the largest proportion of participants who had a clinically meaningful change in their capability status over the 9 months (44–54%), and it had the overall best congruency with the anchor measures, especially in terms of identifying deterioration in clinical status (53%), indicating better sensitivity to change than the rest of the measures.

Table 4 Number of patients improved, deteriorated or unchanged as defined by the investigated and anchor questionnaires (based on SEM) (n = 78)

GAD-7, General Anxiety Disorder 7; WEMWBS, Warwick–Edinburgh Mental Wellbeing Scale; OxCAP-MH, Oxford Capabilities Questionnaire for Mental Health; ICECAP-A, ICECAP capability measure for adults; EQ-VAS, EuroQol Visual Analogue Scale.

a. Changes in instrument scores between baseline and 9-month follow-up were categorised as improved, worsened or unchanged; definition of groups is based on the difference in standard error of measurement (SEM); numbers were rounded and do not always add up to 100%; values in agreement are in bold.

Agreement analysis

The Bland–Altman plots (Fig. 1(a)–(f)) showed that the ICECAP-A and OxCAP-MH have poorer agreement with the EQ-5D-5L than with each other or the EQ-VAS. More specifically, there was small average discrepancy between the four instruments; however, the limits of agreement were wider and therefore more ambiguous in the comparisons with the EQ-5D-5L descriptive system than in the direct comparison of the OxCAP-MH, ICECAP-A and EQ-VAS.

Fig. 1 (a) Bland–Altman plot of difference between OxCAP-MH and ICECAP-A change scores (n = 78). (b) Bland–Altman plot of difference between OxCAP-MH and EQ-5D-5L descriptive system change scores (n = 79). (c) Bland–Altman plot of difference between OxCAP-MH and EQ-VAS change scores (n = 79). (d) Bland–Altman plot of difference between ICECAP-A and EQ-5D-5L descriptive system change scores (n = 88). (e) Bland–Altman plot of difference between ICECAP-A and EQ-VAS change scores (n = 88). (f) Bland-Altman plot of difference of EQ-5D-5L descriptive system and EQ-VAS change scores (n = 90).

OxCAP-MH, Oxford Capabilities Questionnaire for Mental Health; ICECAP-A, ICECAP capability measure for adults; EQ-VAS, EuroQol Visual Analogue Scale.

Discussion

Summary and interpretation of the findings

This paper aimed to contribute to the utilisation of the capability approach in the context of mental health by assessing how far alternative capability instruments capture broader outcome information in people with schizophrenia and depression compared with the HRQoL and in relation to each other, with some mental health-specific measures serving as anchors. At baseline, the two capability instruments were strongly correlated with each other, but moderately correlated with the EQ-5D-5L and EQ-VAS. Both capability measures correlated with mental health-specific instruments more strongly than the two HRQoL measures and proved superior to the HRQoL scales in terms of responsiveness. Together with the results of the EFA and the Bland–Altman plots, our findings confirm that the capability evaluative space captures aspects of what might be important for individuals with severe mental health conditions beyond the aspects that commonly used HRQoL scales are able to measure. This is also suggestive of their ability to measure important effects of mental health interventions beyond health. In general, our research shows that capability instruments may be seen as complementary to HRQoL instruments in their measurement properties in this context.

This paper also compared the psychometric properties of the two capability instruments with each other in the context of the given severe mental health condition. The study empirically demonstrated that these two instruments show somewhat different psychometric properties when deployed on the same patient cohort despite their common theoretical framework. The ICECAP-A showed a slightly weaker anchor-based responsiveness to depression, anxiety and mental well-being as measured by the GAD-7 and WEMWBS scales than the OxCAP-MH. Furthermore, the EFA and responsiveness analyses suggested a somewhat broader evaluative space and better sensitivity to change for the OxCAP-MH in comparison with the ICECAP-A. These differences may be explained by: (a) their different approaches to development, (b) the number of items in the measures (5 items in the ICECAP-A, 16 items in the OxCAP-MH) and (c) the more mental health-specific nature of the OxCAP-MH compared with the fully generic nature of the ICECAP-A.

Relationship between the findings and the existing literature

This analysis found slightly weaker correlations between the capability instruments and the EQ-5D-5L than some of the previous studies conducted in the area of mental health. The OxCAP-MH was compared with the EQ-5D-3L and EQ-5D-5L in mixed mental health populations and found correlation coefficients between 0.45 and 0.66.^{Reference Vergunst, Jenkinson, Burns, Anand, Gray and Rugkåsa23,Reference Łaszewska, Schwab, Leutner, Oberrauter, Spiel and Simon24} Similar correlations were observed between the ICECAP-A and the EQ-5D when they were compared for opiate-dependent individuals: one study found that the ICECAP-A and EQ-5D-5L have similar construct validity when compared with other clinical measures.^{Reference Goranitis, Coast, Day, Copello, Freemantle and Seddon38} The slightly different results of the current study seem confirmatory of the previously identified weaknesses of the EQ-5D-5L instrument in measuring HRQoL in people with severe or complex mental illness.^{Reference Brazier39} Our results are also similar to the findings of a study comparing the ICECAP-A and EQ-5D-5L in the area of depression, which concluded that instruments designed specifically to measure depression and mental health explained a greater proportion of the variation in the ICECAP-A than in the EQ-5D-5L.^{Reference Mitchell, Al-Janabi, Byford, Kuyken, Richardson and Iezzi21} The slightly weaker correlations between the ICECAP-A and EQ-5D-5L could be explained by the specific area of depression, where evidence has shown that this disease is associated with a higher level of disability on the item of interpersonal activities.^{Reference Zlotnick, Kohn, Keitner and Della Grotta40} Moreover, previous factor analyses comparing the ICECAP-A with the items of the EQ-5D-5L and EQ-5D-3L found that these instruments measure two different constructs and therefore provide potentially different information. A recent systematic literature review found inconsistencies between the ICECAP-A and EQ-5D instruments, suggesting that the ICECAP-A is most appropriately regarded as a complement to and not a substitute for the EQ-5D-3L and EQ-5D-5L.^{Reference Wyrwich, Tierney and Wolinsky36}

For the OxCAP-MH, a recent study found that the capability measure had strong correlations with the EQ-5D-5L, EQ-VAS and mental health-specific instruments.^{Reference Vergunst, Jenkinson, Burns, Anand, Gray and Rugkåsa23,Reference Łaszewska, Schwab, Leutner, Oberrauter, Spiel and Simon24} This indicates that the OxCAP-MH indeed measures some aspects of HRQoL and can be considered enhanced rather than fully complementary to the EQ-5D-5L.^{Reference Łaszewska, Schwab, Leutner, Oberrauter, Spiel and Simon24} The same study deployed a two-factor EFA and found that all EQ-5D-5L items and seven OxCAP-MH items loaded on one factor, while nine remaining OxCAP-MH items loaded on a separate factor, also suggestive of the enhanced nature of the instrument when compared with the EQ-5D-5L. The results of the responsiveness analyses from the current study and those from Łaszewska et al^{Reference Łaszewska, Schwab, Leutner, Oberrauter, Spiel and Simon24} are also in line, both concluding that the OxCAP-MH is responsive to the changes in individuals’ health states over time measured by anchor questionnaires, and it discriminates between defined patient groups with high sensitivity.

Strengths and limitations

The study benefited from a data-set that enabled comparison between the two most commonly used capability instruments in the area of mental health and generic HRQoL and mental health-specific scales, and comparison between the two capability instruments themselves. The scope of the paper included various aspects of mental well-being and provided the opportunity for a comprehensive investigation of the capability instruments’ measurement properties in the same context and under the same conditions.

Limitations of this research included a restricted number of data points compared with the number of items, which might affect the robustness of the EFA and the responsiveness analyses, the two methods most sensitive to sample size problems. The study used value sets for the ICECAP-A and EQ-5D-5L but not for the OxCAP-MH, which can introduce an exogenous source of variance into statistical inference. Nevertheless, the sensitivity analysis reported in supplementary Appendix 1 confirmed that this had an insignificant effect on the findings of this study. Finally, the study conducted the comparison and established the psychometric properties of the instruments in a specific context; hence, the results should be generalised with caution to other diseases.

Implications for practice/policy

To our knowledge, this is the first study to empirically compare the two most commonly used capability instruments in the area of mental health and compare them simultaneously to HRQoL and mental health-specific scales. The findings of this study provide an in-depth analysis and understanding of how the method of operationalising the capability approach and the items included in the instruments may eventually influence the interpretation of an economic evaluation based on these instruments.

Our findings confirmed that capability instruments capture complementary information compared with HRQoL instruments. Therefore, consideration needs to be given to the inclusion of a valid capability instrument in future research studies in the area of (severe) mental illness. This is particularly relevant for economic evaluations that are conducted from a societal perspective, where outcomes beyond health play an important role. Currently, a context-specific choice between the ICECAP-A and OxCAP-MH should be based on the trade-off between the higher responsiveness and broader evaluative space of the OxCAP-MH against the somewhat lower participant burden and the currently available value set for the ICECAP-A.

Implications for future research

Establishing the psychometric properties of an instrument is context specific; therefore, these conclusions must be strengthened in other disease areas. Furthermore, comparisons of the two investigated capability instruments with other, newer well-being ones developed for the area of mental health, such as the Achieved Capabilities Questionnaire for Community Mental Health (ACQ-CMH) or Recovering Quality of Life (e.g. ReQol), would further contribute to our understanding of their comparative measurement characteristics and potentially differing outcome results if measured simultaneously. Future research should explore further the effect of value-based scoring once this also becomes available for the OxCAP-MH instrument.

Supplementary material

Supplementary material is available online at https://doi.org/10.1192/bjo.2022.514.

Data availability

The data that support the findings of this study are available on request from the corresponding author.

Acknowledgements

We thank all contributors and participants of the PoMeT trial. We also thank Georg Heinze for his comments on the statistical analyses used in this paper and David Mott, who discussed a previous version of this paper at the Winter 2020 meeting of the Health Economics Study Group in Newcastle, UK.

Author contributions

T.H. and J.S. conceived the research idea and developed the conceptual framework of this research; J.S. provided resources for the study; T.H. conducted the analysis, supervised by J.S. and with input on different aspects of the study from J.C., A.L. and T.S.; T.H. took the lead in writing the manuscript in close consultation with J.C., A.L., T.S. and J.S. All authors provided critical feedback and helped shape the research, analysis and manuscript. All authors approved the final manuscript.

Funding

The PoMeT trial was funded by the National Institute for Health Research (NIHR) under its Research for Patient Benefit (RfPB) Programme (grant reference PB-PG-0712-28021). J.C. is supported by the Wellcome Trust (ref. 205384/Z/16/Z).

Declaration of interest

J.C. has led the development of the ICECAP measures. J.S. has led the development of the OxCAP-MH measure. J.S. is a member of the BJPsych Open editorial board and did not take part in the review or decision-making process of this paper.

Footnotes

A previous version of this paper was presented at the Health Economics Study Group Winter Meeting, 6–8 January 2020 in Newcastle, UK; see https://www.researchsquare.com/article/rs-44326/v4.

References

Sen, A. Commodities and Capabilities. North-Holland, 1985.Google Scholar

Drummond, MF, Sculpher, MJ, Torrance, GW, O'Brien, BJ, Stoddart, GL. Methods for the Economic Evaluation of Health Care Programmes (3rd edn). Oxford University Press, 2005.Google Scholar

Helter, TM, Coast, J, Laszewska, A, Stamm, T, Simon, J. Capability instruments in economic evaluations of health-related interventions – a comparative review of the literature. Qual Life Res 2020; 29: 1433–64.CrossRef Google Scholar

Simon, JA, Paul, A, Gray, A, Rugkasa, J, Yeeles, K, Burns, T. Operationalising the capability approach for outcome measurement in mental health research. Soc Sci Med 2013; 98: 187–96.CrossRef Google Scholar PubMed

National Institute for Health and Care Excellence. Incorporating economic evaluation. In Developing NICE Guidelines: The Manual (April 2017 update) (Process and Methods PMG20): ch 7. NICE, 2017.Google Scholar

Francis, J, Byford, S. SCIE's Approach to Economic Evaluation in Social Care. Social Care Institute for Excellence, 2011.Google Scholar

Halling Hastrup, L, Nordentoft, M, Hjorthøj, C, Gyrd-Hansen, D. Does the EQ-5D measure quality of life in schizophrenia? J Ment Health Policy Econ 2011; 14: 187–96.Google Scholar

Wander, C. Schizophrenia: opportunities to improve outcomes and reduce economic burden through managed care. Am J Manag Care 2020; 26(suppl 3): s62–8.Google Scholar PubMed

Buckley, PF, Miller, BJ, Lehrer, DS, Castle, DJ. Psychiatric comorbidities and schizophrenia. Schizophr Bull 2009; 35: 383–402.CrossRef Google Scholar

Palmer, BA, Pankratz, VS, Bostwick, JM. The lifetime risk of suicide in schizophrenia: a reexamination. Arch Gen Psychiatry 2005; 62: 247–53.CrossRef Google Scholar PubMed

Coast, J. Assessing capability in economic evaluation: a life course approach? Eur J Health Econ 2019; 20: 779–84.CrossRef Google Scholar PubMed

Al-Janabi, H, Flynn, TN, Coast, J. Development of a self-report measure of capability wellbeing for adults: the ICECAP-A. Qual Life Res 2012; 21: 167–76.CrossRef Google Scholar PubMed

Nussbaum, M. Capabilities as fundamental entitlements: Sen and social justice. Fem Econ 2003; 9: 33–59.CrossRef Google Scholar

Steel, C, Korrelboom, K, Fazil Baksh, M, Kingdon, D, Simon, J, Wykes, T, et al. Positive memory training for the treatment of depression in schizophrenia: a randomised controlled trial. Behav Res Ther 2020; 135: 103734.CrossRef Google Scholar PubMed

Beck, AT, Ward, CH, Mendelson, M, Mock, J, Erbaugh, J. An inventory for measuring depression. Arch Gen Psychiatry 1961; 4: 561–71.CrossRef Google Scholar PubMed

Flynn, TN, Huynh, E, Peters, TJ, Al-Janabi, H, Clemens, S, Moody, A, et al. Scoring the ICECAP-Aa capability instrument: estimation of a UK general population tariff. Health Econ 2015; 24: 258–69.CrossRef Google Scholar PubMed

Al-Janabi, H, Peters, TJ, Brazier, J, Bryan, S, Flynn, TN, Clemens, S, et al. An investigation of the construct validity of the ICECAP-A capability measure. Qual Life Res 2013; 22: 1831–40.CrossRef Google Scholar PubMed

Khan, MA, Richardson, J. Variation in the apparent importance of health-related problems with the instrument used to measure patient welfare. Qual Life Res 2018; 27: 2885–96.CrossRef Google Scholar PubMed

Keeley, T, Al-Janabi, H, Nicholls, E, Foster, NE, Jowett, S, Coast, J. A longitudinal assessment of the responsiveness of the ICECAP-A in a randomised controlled trial of a knee pain intervention. Qual Life Res 2015; 24: 2319–31.CrossRef Google Scholar

Keeley, T, Al-Janabi, H, Lorgelly, P, Coast, J. A qualitative assessment of the content validity of the ICECAP-A and EQ-5D-5L and their appropriateness for use in health research. PLoS ONE 2013; 8(12): e85287.CrossRef Google Scholar PubMed

Mitchell, PM, Al-Janabi, H, Byford, S, Kuyken, W, Richardson, J, Iezzi, A, et al. Assessing the validity of the ICECAP-A capability measure for adults with depression. BMC Psychiatry 2017; 17(1): 46.CrossRef Google Scholar PubMed

Afentou, N, Kinghorn, P. A systematic review of the feasibility and psychometric properties of the ICEpop CAPability measure for adults and its use so far in economic evaluation. Value Health 2020; 23: 515–26.CrossRef Google Scholar PubMed

Vergunst, F, Jenkinson, C, Burns, T, Anand, P, Gray, A, Rugkåsa, J, et al. Psychometric validation of a multi-dimensional capability instrument for outcome measurement in mental health research (OxCAP-MH). Health Qual Life Outcomes 2017; 15(1): 250.CrossRef Google Scholar

Łaszewska, A, Schwab, M, Leutner, E, Oberrauter, M, Spiel, G, Simon, J. Measuring broader wellbeing in mental health services: validity of the German language OxCAP-MH capability instrument. Qual Life Res 2019; 28: 2311–23.CrossRef Google Scholar PubMed

Simon, J, Mayer, S, Łaszewska, A, Rugkåsa, J, Yeeles, K, Burns, T, et al. Cost and quality-of-life impacts of community treatment orders (CTOs) for patients with psychosis: economic evaluation of the OCTET trial. Soc Psychiatry Psychiatr Epidemiol 2021; 56: 85–95.CrossRef Google Scholar PubMed

Payakachat, N, Ali, MM, Tilford, JM. Can the EQ-5D detect meaningful change? A systematic review. PharmacoEconomics 2015; 33: 1137–54.CrossRef Google Scholar PubMed

EuroQol Research Foundation. EQ-5D-5L. Valuation: Standard Value Sets. EuroQol Research Foundation, 2019 (https://euroqol.org/eq-5d-instruments/eq-5d-5l-about/valuation-standard-value-sets/).Google Scholar

van Hout, B, Janssen, MF, Feng, YS, Kohlmann, T, Busschbach, J, Golicki, D, et al. Interim scoring for the EQ-5D-5L: mapping the EQ-5D-5L to EQ-5D-3L value sets. Value Health 2012; 15: 708–15.CrossRef Google Scholar PubMed

Spitzer, RL, Kroenke, K, Williams, JB, Löwe, B. A brief measure for assessing generalized anxiety disorder: the GAD-7. Arch Intern Med 2006; 166: 1092–7.CrossRef Google Scholar PubMed

Rosenberg, M. Society and the Adolescent Self-Image. Princeton University Press, 1965.CrossRef Google Scholar

Tennant, R, Hiller, L, Fishwick, R, Platt, S, Joseph, S, Weich, S, et al. The Warwick-Edinburgh Mental Well-being Scale (WEMWBS): development and UK validation. Health Qual Life Outcomes 2007; 5: 63.CrossRef Google Scholar PubMed

Cohen, J. Statistical Power Analysis for the Behavioral Sciences (2nd edn). Routledge, 1988.Google Scholar

Wilcoxon, F. Individual comparisons by ranking methods. In Breakthroughs in Statistics (eds Kotz, S, Johnson, NL): 196–202. Springer, 1992.CrossRef Google Scholar

Kruskal, WH, Wallis, WA. Use of ranks in one-criterion variance analysis. J Am Stat Assoc 1952; 47: 583–621.CrossRef Google Scholar

Thompson, B, Daniel, LG. Factor analytic evidence for the construct validity of scores: a historical overview and some guidelines. Educ Psychol Meas 1996; 56: 197–208.CrossRef Google Scholar

Wyrwich, KW, Tierney, WM, Wolinsky, FD. Further evidence supporting an SEM-based criterion for identifying meaningful intra-individual changes in health-related quality of life. J Clin Epidemiol 1999; 52: 861–73.CrossRef Google Scholar PubMed

Martin Bland, J, Altman, D. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986; 327: 307–10.CrossRef Google Scholar

Goranitis, I, Coast, J, Day, E, Copello, A, Freemantle, N, Seddon, J, et al. Measuring health and broader well-being benefits in the context of opiate dependence: the psychometric performance of the ICECAP-A and the EQ-5D-5L. Value Health 2016; 19(6): 820–8.CrossRef Google Scholar PubMed

Brazier, J. Is the EQ–5D fit for purpose in mental health? Br J Psychiatry 2010; 197: 348–9.CrossRef Google Scholar PubMed

Zlotnick, C, Kohn, R, Keitner, G, Della Grotta, SA. The relationship between quality of interpersonal relationships and major depressive disorder: findings from the National Comorbidity Survey. J Affect Disord 2000; 59: 205–15.CrossRef Google Scholar PubMed

Table 1 Patient characteristics and mean baseline OxCAP-MH, ICECAP-A, EQ-5D-5L and EQ-VAS scores

Table 2 Baseline scores of the relevant outcome measures used in the trial and the associated Spearman's rank correlations

Table 3 Exploratory factor analysis of the OxCAP-MH, ICECAP-A and EQ-5D-5L items with four factors using promin rotation (n = 78)a

Table 4 Number of patients improved, deteriorated or unchanged as defined by the investigated and anchor questionnaires (based on SEM) (n = 78)

Helter et al. supplementary material

PDF 1.3 MB

Submit a response

eLetters

No eLetters have been published for this article.

Article contents

Comparison of capability and health-related quality of life instruments in capturing aspects of mental well-being in people with schizophrenia and depression

Abstract

Keywords

Information

Method

Data

Instruments

Capability instruments

ICECAP-A

OxCAP-MH

HRQoL instruments

EQ-5D-5L, including EQ-VAS

Mental health-specific instruments

BDI-II

GAD-7

RSES

WEMWBS

Analyses

Convergent validity

Exploratory factor analysis

Responsiveness

Agreement analysis

Results

Participant characteristics

Convergent validity

Exploratory factor analysis

Responsiveness

Agreement analysis

Discussion

Summary and interpretation of the findings

Relationship between the findings and the existing literature

Strengths and limitations

Implications for practice/policy

Implications for future research

Supplementary material

Data availability

Acknowledgements

Author contributions

Funding

Declaration of interest

Footnotes

References

Helter et al. supplementary material

eLetters

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests