Integrating brain imaging features and genomic profiles for the subtyping of major depression

Liangying Yin; Yuping Lin; Jinghong Qiu; Yong Xiang; Ming Li; Xiao Xiao; Simon Sai-Yu Lui; Hon-Cheong So

doi:10.1017/S0033291725001096

Integrating brain imaging features and genomic profiles for the subtyping of major depression

Published online by Cambridge University Press: 22 May 2025

Liangying Yin

Yuping Lin ,

Jinghong Qiu ,

Yong Xiang ,

Ming Li ,

Xiao Xiao ,

Simon Sai-Yu Lui and

Hon-Cheong So

Show author details

Liangying Yin: Affiliation:
School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
Yuping Lin: Affiliation:
School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
Jinghong Qiu: Affiliation:
School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
Yong Xiang: Affiliation:
School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong
Ming Li: Affiliation:
KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research of Common Diseases, Kunming Institute of Zoology and The Chinese University of Hong Kong, Hong Kong SAR, China State Key Laboratory of Genetic Evolution & Animal Models, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China Yunnan Key Laboratory of Animal Models and Human Disease Mechanisms, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, Yunnan, China
Xiao Xiao: Affiliation:
KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research of Common Diseases, Kunming Institute of Zoology and The Chinese University of Hong Kong, Hong Kong SAR, China State Key Laboratory of Genetic Evolution & Animal Models, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China Yunnan Key Laboratory of Animal Models and Human Disease Mechanisms, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China Kunming College of Life Science, University of Chinese Academy of Sciences, Kunming, Yunnan, China
Simon Sai-Yu Lui: Affiliation:
Department of Psychiatry, The University of Hong Kong, Hong Kong, China Castle Peak Hospital, Hong Kong, China
Hon-Cheong So*: Affiliation:
School of Biomedical Sciences, The Chinese University of Hong Kong, Shatin, Hong Kong KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research of Common Diseases, Kunming Institute of Zoology and The Chinese University of Hong Kong, Hong Kong SAR, China Department of Psychiatry, The Chinese University of Hong Kong, Hong Kong SAR, China CUHK Shenzhen Research Institute, Shenzhen, China Margaret K.L. Cheung Research Centre for Management of Parkinsonism, The Chinese University of Hong Kong, Hong Kong SAR, China Brain and Mind Institute, The Chinese University of Hong Kong, Hong Kong SAR, China Hong Kong Branch of the Chinese Academy of Sciences Center for Excellence in Animal Evolution and Genetics, The Chinese University of Hong Kong, Hong Kong SAR, China
*: Corresponding author: Hon-Cheong So; Email: hcso@cuhk.edu.hk

Article contents

Abstract
Background
Methods
Results
Conclusions
Introduction
Method
Results
Sex-adjusted clustering analysis
Discussion
Conclusions
Competing interests
References

Rights & Permissions

Abstract

Background

Precise stratification of patients into homogeneous disease subgroups could address the heterogeneity of phenotypes and enhance understanding of the pathophysiology underlying specific subtypes. Existing literature on subtyping patients with major depressive disorder (MDD) mainly utilized clinical features only. Genomic and imaging data may improve subtyping, but advanced methods are required due to the high dimensionality of features.

Methods

We propose a novel disease subtyping framework for MDD by integrating brain structural features, genotype-predicted expression levels in brain tissues, and clinical features. Using a multi-view biclustering approach, we classify patients into clinically and biologically homogeneous subgroups. Additionally, we propose approaches to identify causally relevant genes for clustering.

Results

We verified the reliability of the subtyping model by internal and external validation. High prediction strengths (PS) (average PS: 0.896, minimum: 0.854), a measure of generalizability of the derived clusters in independent datasets, support the validity of our approach. External validation using patient outcome variables (treatment response and hospitalization risks) confirmed the clinical relevance of the identified subgroups. Furthermore, subtype-defining genes overlapped with known susceptibility genes for MDD and were involved in relevant biological pathways. In addition, drug repositioning analysis based on these genes prioritized promising candidates for subtype-specific treatments.

Conclusions

Our approach successfully stratified MDD patients into subgroups with distinct clinical prognoses. The identification of biologically and clinically meaningful subtypes may enable more personalized treatment strategies. This study also provides a framework for disease subtyping that can be extended to other complex disorders.

Keywords

MDD subtyping multi-view biclustering brain structural features genotype-predicted gene expression brain tissues

Information

Type: Original Article
Information: Psychological Medicine , Volume 55 , 2025 , e158

DOI: https://doi.org/10.1017/S0033291725001096 [Opens in a new window]
Creative Commons: This is an Open Access article, distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives licence (http://creativecommons.org/licenses/by-nc-nd/4.0), which permits non-commercial re-use, distribution, and reproduction in any medium, provided that no alterations are made and the original article is properly cited. The written permission of Cambridge University Press must be obtained prior to any commercial use and/or adaptation of the article.
Copyright: © The Author(s), 2025. Published by Cambridge University Press

Introduction

According to a recent report (Vos et al., Reference Vos, Lim, Abbafati, Abbas, Abbasi, Abbasifard and Abdelalim2020), major depressive disorder (MDD) was ranked among the top 10 most disabling conditions across various age groups. Besides, the life expectancy for patients with depressive disorders is usually shorter than the general population due to both unnatural and natural causes of mortality (Kessler & Bromet, Reference Kessler and Bromet2013; Korhonen et al., Reference Korhonen, Moustgaard, Tarkiainen, Östergren, Costa, Urhoj and Martikainen2021; Vos et al., Reference Vos, Lim, Abbafati, Abbas, Abbasi, Abbasifard and Abdelalim2020). In recent years, genome-wide association studies (GWAS) and neuroimaging studies have advanced our understanding of the pathogenesis of MDD, although most advances have yet to be translated to clinical practice.

Precise classification of patients into more clinically and genetically homogeneous subgroups could facilitate our understanding of the underlying biological mechanisms and address the issue of phenotype heterogeneity (Yin et al., Reference Yin, Cheung, Chen, Wong, Sham and So2018; Yin, Chau, Sham, & So, Reference Yin, Chau, Sham and So2019). Importantly, it will also facilitate the discovery of subtype-specific treatments and promote individualized preventive and intervention strategies. In the extant literature, subtyping of patients with MDD mainly utilized clinical features only. The resultant MDD subtypes identified using clinical subtyping may not have distinct biological mechanisms. On the other hand, with the incorporation of genomic information and sophisticated subtyping methods, we may be able to stratify patients into more biologically homogenous subgroups. Nevertheless, limited research has applied genomic data for stratification of MDD patients.

Over the past two decades, substantial efforts have been made to dissect the genetic architecture of MDD (Howard et al., Reference Howard, Adams, Clarke, Hafferty, Gibson, Shirali and Wigmore2019; Major Depressive Disorder Working Group of the Psychiatric GWAS Consortium, 2013). Despite the substantial number of susceptibility loci identified from GWAS, translational applications remain scarce. One challenge is that the identified loci often reside in non-coding regions and may have only modest effects. To address this problem, here we employed a gene-based approach based on genotype-predicted expression changes, which is likely more biologically relevant and interpretable (Yin et al., Reference Yin, Chau, Sham and So2019) than SNP-based analysis. Of note, since the number of genes is large, preselection of relevant genes is common in many omics studies. However, most studies employed a univariate screening approach, yet a single gene may be associated with the outcome due to confounding (e.g. by other genes). Here we propose to employ a causal selection algorithm (PC-simple) (Bühlmann, Kalisch, & Maathuis, Reference Bühlmann, Kalisch and Maathuis2010) for gene selection, which may be considered a simplified version of the well-known PC algorithm (Kalisch & Bühlman, Reference Kalisch and Bühlman2007) for inferring causal features. We also propose to incorporate polygenic risk scores (PRS) of related disorders/traits for subtyping. PRS is a useful representation of the overall genetic predisposition of subjects to a disease by aggregating the effects of multiple variants (Murray et al., Reference Murray, Lin, Austin, McGrath, Hickie and Wray2021; Torkamani, Wineinger, & Topol, Reference Torkamani, Wineinger and Topol2018). While PRS has been widely applied in risk prediction studies, there have been very few applications in disease subtyping (Murray et al., Reference Murray, Lin, Austin, McGrath, Hickie and Wray2021; Torkamani et al., Reference Torkamani, Wineinger and Topol2018; Yin et al., Reference Yin, Chau, Sham and So2019).

Our main contributions are summarized below. Firstly, we present a new framework for neuropsychiatric disorder subtyping by integrating genotype-predicted expression profiles of relevant tissues, brain structural features, as well as other depression-related clinical features into a multi-view sparse bi-clustering model. The multi-view bi-clustering method is an unsupervised machine learning approach that allows multiple types of data to be considered simultaneously. Our work is unique in that we integrate brain imaging features (more specifically, grey matter volume in different brain regions) and whole-genome (imputed) transcriptome data for psychiatric disorder subtyping, which has not been attempted before. Compared with psychopathologies, neuroimaging characteristics are believed to be more directly reflecting the neuroanatomical basis of the disease. Another important novelty is that we incorporated a causal gene selection algorithm; owing to the causal selection approach, the identified subtype-defining gene-sets may be more functionally relevant to the underlying disease mechanisms.

Secondly, apart from methodological innovations, we applied our proposed framework to the study of MDD and revealed two subgroups of MDD with differing prognoses. Previous subtyping studies of MDD mostly focused on clinical features only and seldom integrated multiple sources of (high-dimensional) data for deriving patient subgroups (Van Loo et al., Reference Van Loo, De Jonge, Romeijn, Kessler and Schoevers2012). We identified two subgroups of MDD patients with divergent prognoses. The group with poorer prognosis is also characterized by reduced gray matter volume across multiple brain regions. In addition, we revealed genetic differences across the two subgroups and the pathways involved. We believe these findings are of both clinical interest and scientific importance.

Method

A novel disease subtyping/patient stratification model

In brief, our proposed framework comprised 4 stages, i.e. data imputation, feature selection, disease subtyping, and validation (as shown in Figure 1). Next, we shall describe each step in greater detail below.

Figure 1. The workflow for the proposed invention in identifying disease subtypes.

Data imputation

Given that clustering analysis could not accommodate missing data, imputation was performed. Different methods were used to impute missing clinical and genetic features. For clinical data, the R package ‘missForest’ (Stekhoven & Bühlmann, Reference Stekhoven and Bühlmann2012) was employed to impute the missing data by a random forest algorithm. As for the estimation of expression profiles from GWAS, we employed ‘PrediXcan’ (Gamazon et al., Reference Gamazon, Wheeler, Shah, Mozaffari, Aquino-Michaels, Carroll and Cox2015) developed by Gamazon et al. to impute expression levels of relevant tissues from genotype data. The algorithm first built elastic-net-based prediction models with expression levels as the outcome from an external reference dataset (GTEx), which contained both genotype and expression data. Then, the developed prediction model was applied to new genotype data to estimate the expression levels in different tissues.

Feature selection by a causal approach

In this study, we proposed to use a gene-phenotype causal network inference method to identify causally relevant genes for the disorder of interest. Figure 1 shows the feature selection process. Confounder adjustments were separately performed for the disorder of interest and imputed expression profiles of the corresponding subjects. Specifically, we regressed the confounders separately against each imputed gene and phenotype. The residuals from these regressions were then used as adjusted gene expression and phenotype profiles in subsequent analyses.

In our primary analysis, we chose not to adjust for sex during feature selection. The main consideration is that our primary objective is to identify biologically and clinically homogenous subgroups, and sex itself is one of the major drivers of the biological differences in MDD. As such, we allow sex differences to emerge naturally in the clustering results. However, we recognize that a sex-adjusted analysis could provide complementary insights. For instance, one might wish to identify subtypes that are independent of sex, where clustering is driven by other genetic or clinical factors. Therefore, we also performed a subsidiary clustering analysis adjusted for sex.

The PC-simple algorithm (Bühlmann et al., Reference Bühlmann, Kalisch and Maathuis2010; Kalisch et al., Reference Kalisch, Mächler, Colombo, Maathuis and Bühlmann2012) was employed to infer the causal relationships between genes and the outcome, based on imputed expression profiles. In brief, PC-Simple can be regarded as a generalization of correlation screening that utilizes the ordered independence screening algorithm to estimate the causal relationships between genes and phenotypes. Intuitively, for each gene of interest, all subsets of other genes are controlled for, and those genes that remain significant will be kept as the set of ‘causal’ genes. A more detailed description is presented below.

Let $ X=\left[{X}^1,{X}^2,\dots {X}^p\right] $ be a $ n\times p $ matrix of adjusted gene expression data for p genes, Y be a vector of the corresponding adjusted phenotype dataset for n subjects. Suppose Y is defined by a linear model of X, i.e.:

(1)

$$ Y=\sum \limits_{j=1}^p{\beta}^j{X}^j+\varepsilon $$

Where $ \varepsilon \sim N\left(0,\sum \right) $ denotes the noise item, and it is independent of $ {X}^j\left(j=1,2,\dots p\right) $ . For Equation (1), we believe most or some of the coefficients $ {\beta}^j $ are zero, while the remaining are nonzero for the studied phenotype. Our goal was to uncover the active gene set $ G=\left\{j=1,2,\dots, p;\hskip0.3em ;{\beta}^j\ne 0\right\} $ with non-zero coefficients. Under the partial faithfulness assumption, we have:

(2)

$$ \rho \left(Y,{X}^j|{X}^S\right)\ne 0\hskip0.3em for\hskip0.3em all\hskip0.3em S\subseteq {\left\{j\right\}}^C\; if\ and\ only\ if\;{\beta}^j\ne 0 $$

We could derive the active gene set through recursively performing partial correlation screening with increased order of conditional set based on (2) for all gene-phenotype pairs ( $ Y,{X}^j $ ). Specifically, we first set the conditional set to null ( $ S=\varnothing $ ) and obtained the first candidate gene set with non-zero correlations with our studied phenotype. Then we sequentially increased the order of the conditional set to eliminate irrelevant genes until the candidate gene set did not vary anymore. The partial correlations for each gene-phenotype pair can be estimated as follows:

(3)

$$ \begin{array}{l}\hat{\rho}\left(Y,{X}^j|{X}^S\right)\\ {}=\frac{\hat{\rho}\left(Y,{X}^j|{X}^S\backslash \left\{{X}^k\right\}\right)-\hat{\rho}\left(Y,{X}^k|{X}^S\backslash \left\{{X}^k\right\}\right)\hat{\rho}\left({X}^j,{X}^k|{X}^S\backslash \left\{{X}^k\right\}\right)}{{\left[\left\{1-\hat{\rho}{\left(Y,{X}^k|{X}^S\backslash \left\{{X}^k\right\}\right)}^2\right\}\left\{1-\hat{\rho}{\left({X}^j,{X}^k|{X}^S\operatorname{}\left\{{X}^k\right\}\right)}^2\right\}\right]}^{1/2}}\end{array} $$

We tested partial correlations by Fisher’s Z-transform, which can be expressed as follows:

(4)

$$ Z\left(Y,{X}^j|{X}^S\right)=\frac{1}{2}\left\{\frac{1+\hat{\rho}\left(Y,{X}^j|{X}^S\right)}{1-\hat{\rho}\left(Y,{X}^j|{X}^S\right)}\right\} $$

The null hypothesis ( $ \hat{\rho}\left(Y,{X}^j|{X}^S\right)=0 $ ) is rejected if the following holds (Bühlmann et al., Reference Bühlmann, Kalisch and Maathuis2010):

(5)

$$ {\left(n-\left|S\right|-3\right)}^{\frac{1}{2}}\left|Z\left(Y,{X}^j|{X}^S\right)\right|>{\varPhi}^{-1}\left(1-\frac{\alpha }{2}\right) $$

where $ \varPhi $ denotes the standard normal cumulative distribution function, $ \alpha $ denotes the significance level for the null hypothesis test. In this study, we set $ \alpha =0.05 $ , the default setting in the original paper (Bühlmann et al., Reference Bühlmann, Kalisch and Maathuis2010). To boost the computational efficiency, we set the maximum order for partial correlation screening to 3. All genes that survived the 3-order partial correlation screening were regarded as directly causal genes for the studied phenotype. After deriving the tissue-specific gene-phenotype causal links for the studied phenotype, we could distinguish the directly causal genes from other ones, which would be included for the subsequent disease subtyping process.

Disease subtyping

For disease subtype discovery, we employed an extension of the biclustering algorithm in (Sun, Lu, Xu, & Bi, Reference Sun, Lu, Xu and Bi2015). In brief, we performed biclustering by matrix decomposition. Suppose $ {X}^d $ is a $ n\times {m}_d $ data matrix from the clinical or genetic view of patients, where n is the sample size, $ d $ denotes the index of the ‘view’ to be modelled, and $ {m}_d $ is the number of features in the d ^th view. For example, if one models clinical and genotype-predicted expressions in one tissue, there will be two views. It is possible to extend the approach to more than two views, for example, based on expression in different tissues or using other (preferably gene-based) ‘omics’ profiles. It is worth emphasizing that due to the heterogeneity of patients, the pathophysiology (e.g. genetic pathways) underlying the disease may be different for different subgroups of patients. Using a biclustering algorithm, each bicluster can be characterized by different sets of gene features; in other words, we allow different genes to be involved for different subgroups of patients. This adds flexibility to our model and is an important advantage compared to ordinary clustering approaches.

Subgroups of patients can be simultaneously derived by performing a sparse rank one approximation on the original matrices $ {X}^d $ ( $ d=1,2,..D $ , indicating data matrices from different views that characterize the same set of patients), i.e.

(6)

$$ {X}^d\approx \mathit{\operatorname{diag}}(w){u}_d{v}_d^T $$

where $ w $ is a binary vector of size $ n $ , serving as a common factor that forces different views of data to agree on the same grouping of patients. $ \mathit{\operatorname{diag}}(w) $ is a diagonal matrix of size $ n\times n $ with diagonal entries equal to $ w. $ $ {u}_d $ of size $ n $ and $ {v}_d $ of size $ {m}_d $ are the rank-one approximations of $ {X}^d $ respectively. Rows in $ {X}^d $ corresponding to the non-zero entries of $ \mathit{\operatorname{diag}}(w) $ form the row subgroups, and columns in $ {v}_d $ form the column subgroups (a.k.a., sub-feature groups) in different views. Subgroups of patients based on different views of data can be derived by solving the following optimization problem:

(7)

$$ \underset{\boldsymbol{w},{\boldsymbol{u}}_{\boldsymbol{d}}{\boldsymbol{v}}_{\boldsymbol{d}},\boldsymbol{d}=\mathbf{1},\mathbf{2},..\boldsymbol{D}}{\mathbf{min}}\sum_{\boldsymbol{d}=\mathbf{1}}^{\boldsymbol{D}}{\left\Vert {\boldsymbol{X}}^{\boldsymbol{d}}-\boldsymbol{\operatorname{diag}}\left(\boldsymbol{w}\right){\boldsymbol{u}}_{\boldsymbol{d}}{\boldsymbol{v}}_{\boldsymbol{d}}^{\boldsymbol{T}}\right\Vert}_{\boldsymbol{F}}^{\mathbf{2}} $$

subject to $ {\left\Vert \boldsymbol{w}\right\Vert}_{\mathbf{0}}\mathbf{\le}{\boldsymbol{s}}_{\boldsymbol{w}},{\left\Vert {\boldsymbol{v}}_{\boldsymbol{d}}\right\Vert}_{\mathbf{0}}\mathbf{\le}{\boldsymbol{s}}_{{\boldsymbol{v}}_{\boldsymbol{d}}},\boldsymbol{d}\mathbf{\in}\left[\mathbf{1},\boldsymbol{D}\right],\boldsymbol{w}\mathbf{\in }{\mathbf{\mathcal{B}}}_{\boldsymbol{n}} $ where $ {s}_w $ and $ {s}_{v_d} $ ’s are hyper-parameters that need to be predetermined to enforce sparsity of $ w $ and $ {v}_d $ ’s, i.e. the number of patients $ {n}_{b_k} $ and number of selected features $ {n}_{v_k}^d $ in each subgroup of the corresponding data view. $ D $ is the number of data views incorporated for clustering and $ {B}_n $ is the set that contains all possible binary vectors of length $ n $ . To obtain subsequent subgroups, we need to first update the data matrices by excluding previously identified patients, and then solve Equation (7).

Our proposed approach is capable of selecting features during the clustering process; however, we need to predetermine the number of selected features in each data view. Here we follow the suggestions given by the original authors (Sun et al., Reference Sun, Lu, Xu and Bi2015). Specifically, the number of selected features ( $ {n}_{v_k}^d $ ) in each data view was set to the number where the accumulated variance after PCA of $ {X}^d $ was over 90%.

The algorithm also requires the number and size of subgroups to be specified beforehand. We consider a value range of 2–6 for the number of subgroups, and the minimum number of subjects in each subgroup ( $ \min \left({n}_{b_k}\right) $ ) was set to 20. This minimum subgroup size was chosen to balance between clinical utility, generalizability, and statistical separation. While smaller subgroups may show better mathematical separation, they are more prone to overfitting and may not be clinically meaningful or generalizable. Extremely small subgroups might represent outliers rather than true subtypes. Moreover, subgroups need to be large enough to allow for meaningful clinical characterization and potential validation in future studies. To assess the impact of this parameter, we also performed sensitivity analyses with smaller minimum subgroup sizes, as detailed below.

Suppose the number of subgroups is k, the size of each subgroup will be firstly set to a value roughly equal to $ n/k $ . Then all combinations by adding or subtracting $ \min \left({n}_{b_k}\right) $ in each subgroup will be tried. A grid search approach was employed to determine the optimal solution. An evaluation metric is required to find the optimal solution. One of the most used metrics is mean squared residue (MSR). However, it only assesses the homogeneity within each subgroup and does not consider the heterogeneity between different subgroups. For well-separated subgroups, patients within the same subgroups should be highly homogenous, while patients belonging to different subgroups should be highly heterogeneous. In view of this, we employed the sum of ratios of the between-bicluster and within-bicluster distance (BBD/WBD) (Yin et al., Reference Yin, Chau, Sham and So2019) proposed by Yin et al. as the evaluation metric to identify the optimal solution.

Validation

We employed external and internal validations (as shown in Figure 1) to verify our identified subtypes. Regarding external validation, this method was used when external data of disease outcomes were available (the disease outcome variables were not used for deriving the clusters). We evaluated whether disease outcomes were significantly different across the derived subgroups.

Regarding internal validation, here we utilized the extended ‘prediction strength’ (PS) method (Tibshirani & Walther, Reference Tibshirani and Walther2005) developed in our previous study (Yin et al., Reference Yin, Chau, Sham and So2019), which was designed for multi-view (bi-)clustering analysis to validate identified subgroups. Intuitively, this approach tests whether the derived clusters are generalizable to a new dataset. To calculate the PS, firstly we split the sample into a ‘training set’ and a ‘testing set’, and then evaluated whether the disease subtyping model derived from the training set could ‘predict’ the actual disease subgroups based on the testing set alone. In essence, it measured how well the ‘predicted’ co-memberships (based on the model derived from training set) matched with those in an independent testing set. In this study, we calculated both the ‘min PS’ and ‘ave PS’ to evaluate the performance of our proposed method. The ‘min PS’ and ‘ave PS’ respectively measured the lowest and average proportion of co-memberships among all identified subtypes, as follow:

(8)

$$ \mathrm{min}\hskip0.3em ps=c{v}_{ave}\;\left\{\;{\mathit{\min}}_{1\le j\le k}\frac{1}{n_j\left({n}_j-1\right)}\sum \limits_{i\ne {i}^{\prime}\in {A}_j}D{\left[C\left({X}_{tr},k\right),{X}_{te}\right]}_{i{i}^{\prime }}\right\} $$

(9)

$$ \mathrm{ave}\hskip0.3em ps=c{v}_{ave}\;\left\{\;{ave}_{1\le j\le k}\frac{1}{n_j\left({n}_j-1\right)}\sum \limits_{i\ne {i}^{\prime}\in {A}_j}D{\left[C\left({X}_{tr},k\right),{X}_{te}\right]}_{i{i}^{\prime }}\right\} $$

Here $ C\left({X}_{tr},k\right) $ indicates the clustering operation on the training set with $ k $ subgroups. $ D{\left[C\left({X}_{tr},k\right),{X}_{te}\right]}_{i{i}^{\prime }} $ denotes the co-membership for subjects $ i $ and $ {i}^{\prime } $ in subgroup $ {A}_j $ . $ {n}_j $ is the number of subjects in subgroup $ {A}_j $ . $ c{v}_{ave} $ refers to the average of all cross-validation folds. Tibshirani and Walther (Reference Tibshirani and Walther2005) suggested that a PS of 0.8 or 0.9 or above indicates a reasonably good prediction strength.

Permutation testing of prediction strength

To rigorously evaluate the possibility of a single-cluster solution, we conducted a permutation test to assess the statistical significance of the observed prediction strength (PS) for the identified clustering solution. Under the null hypothesis of a single, homogeneous cluster, we generated a null distribution by independently permuting the values within each feature column. This procedure disrupts any existing relationships between features and subjects, effectively simulating the null scenario. The permutation p-value was then calculated as the proportion of permutations yielding a PS value greater than or equal to the observed PS.

Evaluation of minimum subgroup size on clustering analysis

To evaluate the influence of the minimum subgroup size on the identified subgroups, we conducted additional sensitivity analyses by relaxing the minimum subgroup size to 10 and 5. Clustering performance was assessed using within- and between-cluster distances (specifically, BBD/WBD ratio). We hypothesize that while cluster separation (e.g. as measured by BBD/WBD ratio) may be better with smaller minimum subgroup sizes, this might be attributable to overfitting. Consequently, we predicted that the stability and generalizability of the clustering solution would be reduced with smaller minimum subgroup sizes. To test this, we computed prediction strength for solutions with minimum subgroup sizes of 10 and 5.

To further evaluate the minimum subgroup size, we also performed a bootstrap-based stability analysis, following the approach by Yu et al. (Reference Yu, Chapman, Di Florio, Eischen, Gotz, Jacob and Blair2019). The stability of different clustering solutions was assessed using the Adjusted Rand Index (ARI) (Zhang, Wong, & Shen, Reference Zhang, Wong and Shen2012), a widely used metric for comparing clustering solutions. We employed bootstrap resampling (100 iterations) to calculate the co-membership matrix for each clustering solution. The co-membership matrix represents the proportion of times each pair of subjects is clustered together across the bootstrap samples. The ARI was then used to quantify the similarity between the co-membership matrix derived from the bootstrapped data and that derived from the original clustering solution.

Further analyses to evaluate the relevance of the subtype-defining genes

To further validate the identified subgroups, we examined whether genes selected by our proposed approach were enriched for GWAS hits for depression. Specifically, the GWAS summary statistics for depression (Howard et al., Reference Howard, Adams, Clarke, Hafferty, Gibson, Shirali and Wigmore2019) were first converted to gene-based statistics by FASTBAT (Bakshi et al., Reference Bakshi, Zhu, Vinkhuyzen, Hill, McRae, Visscher and Yang2016), then we tested whether the genes selected by our clustering framework had lower p-values than the non-selected ones.

In addition, we extracted genes selected by our method to figure out the genetic underpinning of each identified depression subtype. We then investigated whether there were significant differences in the expression profiles for these selected genes across the subgroups. Specifically, we employed both the default ‘t.test’ function in R and the ‘eBayes’ function from the R package ‘limma’ (Smyth, Reference Smyth2004) to detect differentially expressed genes (DEGs). Pathway analyses (based on hypergeometric tests) were also conducted using ‘ConsensumPathDB’ (Kamburov et al., Reference Kamburov, Pentchev, Galicka, Wierling, Lehrach and Herwig2011; Kamburov, Stelzl, Lehrach, & Herwig, Reference Kamburov, Stelzl, Lehrach and Herwig2013) to further explore the biological mechanisms. Furthermore, we conducted drug enrichment analyses on ‘Enrichr’ (Kuleshov et al., Reference Kuleshov, Jones, Rouillard, Fernandez, Duan, Wang and Lachmann2016) to identify repositioning candidates for each subtype.

Application to depression patients

We applied our disease subtyping model to depression subjects in the UK BioBank (UKBB). Here depression is defined by a combination of ICD-10 coded and self-reported disease. Since high missing rates may affect the imputation accuracy, we only keep patients with a comparably lower missing rate. It has been suggested that ~10% is a reasonable missingness cutoff, below which satisfactory imputation accuracy can be expected (Dong & Peng, Reference Dong and Peng2013; Madley-Dowd, Hughes, Tilling, & Heron, Reference Madley-Dowd, Hughes, Tilling and Heron2019). Following this, we only keep patients with variable missing rates less than 10%. Specifically, we included a variety of clinical features, including demographic/socioeconomic factors, mental health factors (especially symptoms related to depression), behavioral factors, medical history, and current health status (in particular cardiometabolic abnormalities) (Murray et al., Reference Murray, Lin, Austin, McGrath, Hickie and Wray2021; Torkamani et al., Reference Torkamani, Wineinger and Topol2018), for the subtyping analysis. Please refer to Supplementary Table S1 for all features. ¹³Besides, we estimated the expression levels for the cortex, frontal cortex, nucleus accumbens (basal ganglia), and putamen (basal ganglia) by ‘PrediXcan’ (Gamazon et al., Reference Gamazon, Wheeler, Shah, Mozaffari, Aquino-Michaels, Carroll and Cox2015) for all subjects with available genotypes. Previous studies suggested involvement of these brain regions in the pathophysiology of depression (Hare & Duman, Reference Hare and Duman2020; Lacerda et al., Reference Lacerda, Nicoletti, Brambilla, Sassi, Mallinger, Frank and Soares2003; Pandya, Altinay, Malone, & Anand, Reference Pandya, Altinay, Malone and Anand2012; Pizzagalli et al., Reference Pizzagalli, Holmes, Dillon, Goetz, Birk, Bogdan and Fava2009; Stockmeier & Rajkowska, Reference Stockmeier and Rajkowska2022; Zhang et al., Reference Zhang, Peng, Sweeney, Jia and Gong2018).

Besides, we also calculated PRSs of related neuropsychiatric traits/disorders. The traits for constructing PRS included autism spectrum disorder (ASD; N = 46,350) (Grove et al., Reference Grove, Ripke, Als, Mattheisen, Walters, Won and Anney2019), attention deficit hyperactivity disorder (ADHD; N = 53,293) (Demontis et al., Reference Demontis, Walters, Martin, Mattheisen, Als, Agerbo and Bækvad-Hansen2019), schizophrenia (SCZ; N = 105,318) (Pardiñas et al., Reference Pardiñas, Holmans, Pocklington, Escott-Price, Ripke, Carrera and Hamshere2018), bipolar disorder (BP; N = 41,653) (Ruderfer et al., Reference Ruderfer, Ripke, McQuillin, Boocock, Stahl, Pavlides and Loohuis2018), MDD (N = 500,199) (Howard et al., Reference Howard, Adams, Clarke, Hafferty, Gibson, Shirali and Wigmore2019) and post-traumatic stress disorder (PTSD; N = 200,000) (Nievergelt et al., Reference Nievergelt, Maihofer, Klengel, Atkinson, Chen, Choi and Gelernter2019). GWAS summary statistics were downloaded from the Psychiatric Genomics Consortium (PGC) (https://www.med.unc.edu/pgc) and The Integrative Psychiatric Research project (iPSYCH).

Before the standard PRS analysis, LD-clumping was required. In this application, we performed LD-clumping at $ {r}^2=0.1 $ within a distance of 1000 Kb (Choi & O’Reilly, Reference Choi and O’Reilly2019). PRS was generated by PRsice with a P-value threshold of 0.1. We incorporated these 6 PRSs as clinical features. In total, we included 69 depression-related clinical features, 139 brain structural features (volume of grey matter of different brain regions) and 6 PRSs of related neuropsychiatric disorders as input features in the clinical view.

Results

Overview and identification of causal genes

We extracted 28,335 depression subjects as cases and 285,921 as controls from UKBB as the input for the causal inference-based feature selection. To avoid possible biases introduced by population structure, we adjusted the predicted tissue-specific expression profiles and phenotype data by the top 10 principal components (PCs) of the corresponding genotype dataset first. Then we used the corrected input data to identify causally relevant genes for depression in each tissue. We identified 108, 101, 94, and 76 genes for cortex, frontal cortex, nucleus accumbens basal ganglia, and putamen basal ganglia, respectively. These genes were included as input in the corresponding genetic views for the subtyping of depression patients. In the clustering analysis, we included 352 depression patients with available brain imaging data for subtyping.

Feature selection and data view composition

We incorporated 5 different views for the subtyping of depression patients: one clinical view with 139 brain structural features and PRS of 6 related disorders, and 4 genetic views corresponding to 4 brain regions with predicted expression profiles of causally relevant genes. As mentioned earlier, PCA was employed to determine the number of selected features in each data view. Table 1 lists the number of selected features in the corresponding data view.

Table 1. The number of selected features in each data view

Identification of two distinct depression subgroups

The best performance was achieved when the depression subjects were stratified into 2 different subgroups with 20 subjects in the first subgroup and 332 subjects in the second subgroup (Supplementary Figure S1). For clinical features, 63 out of 145 features were selected as subtype-defining features (see Table 1), and all of them were brain structural features (Supplementary Table S2). Table 2 summarizes the top 20 most significantly different (defined by the derived p-values from the t-test by comparing the brain volumes of the 2 subgroups) subtype-defining clinical features. We observed significant differences in selected features between the two identified subgroups (Figure 2 and Supplementary Table S2). Notably, among the selected subtype-defining brain structural features, most were shared in two subtypes. Significant differences in gene expression levels were observed on some of the identified subtyping-defining genes using limma (Table 3). Similar results were observed using a t-test for differential expression analysis (Supplementary Table S3).

Table 2. The 20 brain imaging features with the most significant differences (ranked by p-value) between the 2 discovered MDD subtypes

Figure 2. Comparison of selected brain imaging features for depression patients between 2 subtypes.

Table 3. Differentially expressed genes(DEGs) between the 2 depression subtypes, analyzed using limma (with gene expression in subgroup one as baseline)

Clinical validation: treatment resistance and psychiatric admissions

Using the UKBB, we gathered prognosis-related variables for these depression subjects. More specifically, we extracted the treatment resistance status and admission frequency of corresponding patients from the general practitioner (GP) records to evaluate our identified depression subtypes. In this study, treatment-resistance depression (TRD) was defined as MDD patients who tried at least two different antidepressant drugs for adequate durations (Fabbri et al., Reference Fabbri, Hagenaars, John, Williams, Shrine, Moles and Free2021). Following the definition in ref (Fabbri et al., Reference Fabbri, Hagenaars, John, Williams, Shrine, Moles and Free2021), the time interval between two drugs should be no longer than 14 weeks, and each drug should be prescribed for at least 6 weeks.

Since the medication records are not available for all subjects in the UKBB, we only gathered the treatment resistance status for 292 patients. When comparing the differences in TRD between the two derived subgroups, patients with missing values were excluded. Fisher’s exact test was performed to examine whether there exist significant differences between the two derived subgroups for the missing rate of drug records. Notably, we did not find any significant differences between the two subgroups in accessibility of patient records for TRD status (p = 0.356). We observed significant differences in TRD between the two derived subgroups using a regression model (higher rates of TRD in subgroup 2, p = 0.049) (Figure 3).

Figure 3. Comparison of TRD status by subgroups for depression patients.

Moreover, we compared the differences in psychiatric admission frequencies across the two derived subgroups using the Wilcoxon signed rank test. In line with TRD, subjects with missing values were excluded. The admission frequency records for 307 patients were available for further analysis. No significant difference was observed in the availability of patients’ records for psychiatric admissions across the two subgroups (Fisher exact test, p-value = 0.730). Again, we observed a significant difference in hospitalization frequencies between the two subgroups (p = 0.033) (Figure 4).

Figure 4. Comparison of admission frequencies by subgroups for depression patient.

Prediction strength

Furthermore, we computed the extended prediction strength (PS) of our identified 20–332 solution and observed a minimum PS of 0.854 and an average PS of 0.896 (Table 4). According to (Tibshirani & Walther, Reference Tibshirani and Walther2005), a PS of > = 0.8 or 0.9 suggests strong stability of the clustering model and good generalizability to new datasets. Therefore, we could conclude that our proposed method is reliable and stable in revealing depression subtypes. Moreover, we compared our method with clinical variable-based disease subtyping, which only relies on non-genetic features. Table 4 shows that our proposed method, which also integrates whole-genome (GWAS-imputed) transcriptome data, achieved higher ‘min PS’ and ‘average PS’ than the clinical variable-only subtyping method.

Table 4. Comparison of extended PS derived from different solutions

Beyond the overall PS, we performed additional analyses to validate our findings. First, the PS for the 20–332 solution was significantly higher than expected under a single, homogenous cluster (p = 0.010) based on permutation, providing strong evidence for the existence of distinct subgroups. Second, following the principles of PS and using the clustering model derived from the training set only, we tested for differences in treatment-resistant depression (TRD) across the identified subgroups in the test set. We observed significant differences in TRD (p = 0.042), further supporting the clinical relevance and generalizability of our identified subtypes.

Summary of evidence supporting distinct subgroups

To summarize, evidence strongly suggests that a two-cluster solution more accurately represents the data than a single-cluster model. This conclusion is supported by significant differences in clinical outcomes and biological markers (gene expression and imaging features) across the subgroups and permutation tests demonstrating that the prediction strength of the two-cluster solution significantly exceeds what would be expected under a single-cluster null hypothesis (p < 0.05).

GWAS enrichment analysis

We downloaded the GWAS summary statistic for depression from PGC (Howard et al., Reference Howard, Adams, Clarke, Hafferty, Gibson, Shirali and Wigmore2019) to examine whether our selected genes were enriched for GWAS hits. Table 5 shows the enrichment analysis results. As expected, the genes picked up by our proposed framework were indeed enriched for known genes for depression. More specifically, subtype-defining genes for nucleus accumbens basal ganglia and putamen basal ganglia were significantly enriched for GWAS hits of depression.

Table 5. Enrichment analysis results for GWAS hits of depression

Functional annotation and pathway enrichment

Supplementary Table S3 demonstrates the subtype-defining gene sets identified by our proposed framework. Many selected (subtype-defining) genes were well-known susceptibility genes for depression or involved in the related pathophysiological process. For example, DTNBP1, NFYC, and PRKCH were selected by our algorithm as subtype-defining genes and showed differential expression across subtypes. Arias et al. (Reference Arias, Serretti, Mandelli, Gastó, Catalan, De Ronchi and Fananas2009) highlighted a significant association between DTNBP1 and responsiveness to antidepressants in MDD patients. Kocabas et al. (Reference Kocabas, Antonijevic, Faghel, Forray, Kasper, Lecrubier and Noro2010) corroborated this finding, citing its involvement in synaptic signaling and plasticity in patients with MDD. In another study, Fabbri and Serretti (Reference Fabbri and Serretti2016) demonstrated that NFYC was predictive of the depressive episode frequency in bipolar patients. Wong, Dong, Maestre-Mesa, and Licinio (Reference Wong, Dong, Maestre-Mesa and Licinio2008) revealed that PRKCH can affect the treatment response of MDD patients through dysregulating T cell and hypothalamic–pituitary–adrenal axis functions. For more details, please refer to Supplementary Table S3.

Supplementary Table S4 lists the top enriched pathways that characterize the identified disease subtypes. Numerous enriched pathways were involved in depression or related pathophysiology. Again, some enriched pathways were shared among two identified subtypes, while others were subtype-specific. Here we highlighted a few pathways that may be of interest. Regulation of PTEN stability and activity and tyrosine metabolism were found to be significantly enriched for depression patients. As suggested by Wang et al. (Reference Wang, Zhang, Xia, Chen, Fang and Ding2021), regulation of PTEN stability and activity could lead to an increase in depression-related behaviors in mice. Tyrosine metabolism might be related to anhedonia, a core symptom of depression, according to a recent study by Bekhbat et al. (Reference Bekhbat, Treadway, Goldsmith, Woolwine, Haroon, Miller and Felger2020). They also suggested that this pathway defined a subtype of depression with high C-reactive protein (CRP) and anhedonia. A study by Matsuda et al. (Reference Matsuda, Ikeda, Murakami, Nakagawa, Tsuji and Kitagishi2019) suggested that the ‘PIP3 activates AKT signaling’ pathway plays a critical role in the survival of various neurons, which may be implicated in depression-related behaviors. Chu et al. (Reference Chu, Wei, Zhu, Shen and Xu2017) reported that prostaglandin (PG) synthesis and regulation could lead to the onset of depression through downregulating PG D2 levels in the plasma. For more details about other enriched pathways, please refer to Supplementary Table S4.

Drug enrichment analysis

Supplementary Table S5 summarizes the enriched drugs for each identified depression subtype. Many were supported by previous studies for reversing depression-related symptoms or behaviors. For example, apigenin, and kaempferol were ranked among the top 5 enriched drugs for subtype 1 and valproic acid was ranked among the top 5 for subtype 2 (based on imputed expression in the cortex). Weng et al. (Reference Weng, Guo, Li, Yang and Han2016) found that apigenin could reverse depression-like behaviors in mice. Silva dos Santos, Goncalves Cirino, de Oliveira Carvalho, and Ortega (Reference Silva dos Santos, Goncalves Cirino, de Oliveira Carvalho and Ortega2021) reported that kaempferol, a flavonoid antioxidant, has a multipotential neuroprotective effect on depression. Valproic acid is a standard treatment for bipolar disorder and has been shown to be effective for bipolar depression (Smith et al., Reference Smith, Cornelius, Azorin, Perugi, Vieta, Young and Bowden2010). For more details on the corresponding enriched drugs, please refer to Supplementary Table S5.

Evaluation of smaller minimum subgroup sizes

Several sensitivity analyses were conducted to evaluate the influence of the minimum subgroup size on the identified subgroups (Table 6).

Table 6. Comparison of clustering solutions with varied minimum subgroup sizes

Notes: Min is the abbreviation for minimum, Ave is the abbreviation for average, SD is the abbreviation for standard deviation. ARI ranges between (−1,1), with ARI = 1 indicates perfect agreement between two clustering, ARI = 0 indicates no better than random agreement, ARI < 0 indicates worse than random agreement, TRD is the abbreviation for treatment resistance depression. For the clustering analysis with minimum subgroup size of 10 and 5, we employed Fisher’s exact test to compare the differences in TRD between the derived subgroups.

Prediction strength

While better BBD/WBD ratios were achieved with a minimum subgroup size at 10 or 5, subtyping solutions derived with smaller minimum subgroup sizes exhibited inferior prediction strength (PS) (Table 6), suggesting reduced generalizability and possible overfitting. The 20–332 cluster solution represents the solution achieving the best separation (as indicated by the BBD/WBD ratio), while having an average PS > = 90% (rounded to the nearest integer).

Bootstrap-based stability analysis

The stability of different clustering solutions was assessed with bootstrap resampling and ARI. Compared to solutions with smaller minimum subgroup sizes, the original solution with a minimum subgroup size of 20 exhibited significantly greater model stability and generalizability, as evidenced by a high ARI (0.89) and low SD of ARI across the bootstrap samples.

Differences in clinical outcomes

For subgroups derived with minimum subgroup sizes of 10 or 5, we did not observe significant differences in clinical outcomes (all p > 0.05), including treatment resistance and admission frequencies.

In summary, the 20–332 solution demonstrated a balance of optimal clustering performance, model stability, and generalizability. External validation confirmed the clinical relevance of the identified subgroups, with significant differences observed in clinical outcomes.

Sex-adjusted clustering analysis

We performed an additional clustering analysis adjusting for sex as a covariate. We identified 331, 293, 346, and 271 genes in the cortex, frontal cortex, nucleus accumbens basal ganglia, and putamen basal ganglia, respectively, as relevant features. Similar to our primary analysis, the optimal solution stratified subjects into two subgroups (20 and 332 subjects). The concordance of cluster labels between the sex-adjusted and original solutions was high for the larger subgroup (97.0%), though lower for the smaller subgroup (50%). The sex-adjusted analysis identified more subgroup-defining genes, which may be due to potentially better statistical power in covariate-adjusted models (Jiang et al., Reference Jiang, Kulkarni, Mallinckrodt, Shurzinske, Molenberghs and Lipkovich2017).

The derived subgroups from the sex-adjusted analysis showed distinct clinical outcomes. Specifically, significant differences in treatment-resistant depression (TRD) (p = 0.034) and hospitalization frequencies (p = 0.046) were observed between the subgroups (Supplementary Figures S2 and S3), mirroring the findings of our primary analysis. Furthermore, no significant differences in TRD or hospitalization frequencies were observed between the subgroups derived from the original and the sex-adjusted clustering solutions. These results suggest that adjusting for sex does not fundamentally alter the identification of clinically relevant MDD subgroups in our dataset. Detailed results are provided in the Supplementary Information (see Supplementary Tables S3e–i).

Clustering with principal components as input

We also conducted clustering analysis using principal components (PCs) accounting for >90% variance in each view as input. While this approach captured more variance, the resulting subgroups did not show significant differences in clinical outcomes (Supplementary Table S6). Therefore, our original approach of using PCA to select features directly may be more effective for identifying clinically distinct MDD subgroups. Please refer to the supplementary information for details.

Discussion

In this proof-of-concept study, we proposed a novel framework to identify depression subtypes by incorporating both clinical (primarily brain structural phenotypes) and genetic information. To the best of our knowledge, it is the first study to integrate (1) brain structural (imaging) data; (2) whole-genome (GWAS-imputed) transcriptome data; (3) depression-related clinical features; and (4) a causal algorithm of gene selection into a multi-view clustering framework for subtyping complex diseases.

Key findings and biological relevance

We demonstrated the validity of our proposed framework by applying it to MDD subjects extracted from the UKBB. Two different depression subtypes with significant differences in treatment resistance status and hospitalization frequencies were identified. More precisely, we found one subgroup with a good prognosis and another subgroup with a relatively poorer prognosis. Notably, we found that subgroup 2, the group with poorer prognosis, is characterized by reduced gray matter volumes across multiple brain regions. This finding corroborates previous studies (Ancelin et al., Reference Ancelin, Carrière, Artero, Maller, Meslin, Ritchie and Chaudieu2019; Hellewell et al., Reference Hellewell, Welton, Maller, Lyon, Korgaonkar, Koslow and Grieve2019; Wise et al., Reference Wise, Radua, Via, Cardoner, Abe, Adams and de Azevedo Marques Périco2017) also showing decreased gray matter volume in patients with depression or severe depression. For example, Hellewell et al. (Reference Hellewell, Welton, Maller, Lyon, Korgaonkar, Koslow and Grieve2019) showed replicable gray matter volume losses in various brain regions (such as cingulate gyrus, middle frontal gyrus) for MDD patients compared to normal subjects.

Internal validation (by PS) also supported the stability of our proposed method in revealing depression subtypes and satisfactory generalizability to new datasets. Furthermore, the subtype-defining genes were significantly enriched for GWAS hits for depression. We believe this proof-of-concept study has the potential to be extended to larger study samples in the future.

Encouragingly, many identified subtyping-defining overlaps with known susceptibility genes for MDD and were involved in relevant pathophysiological processes. For instance, as the top genes that were differentially expressed between 2 MDD subgroups, DTNBP1 and PRKCH have been proven to be associated with treatment response in MDD patients. Besides, the top enriched pathways that differentiate the two subgroups were also implicated in depression in previous studies.

Limitations related to polygenic risk scores and clinical features

On the other hand, we did not observe significant differences in the included PRSs between the two subgroups. This suggests that the information contained in PRS may have been captured by other genetic and/or non-genetic variables included in our analysis. The lack of consistent association between PRS and depression severity may also explain why PRS failed to show up as a subgroup-defining feature (Fusar-Poli et al., Reference Fusar-Poli, Rutten, van Os, Aguglia and Guloksuz2022). It remains to be studied whether PRS may improve disease subtyping in larger samples.

In a similar vein, it is worth noting that most clinical features (listed in Supplementary Table S1) were not selected by the clustering ML algorithm. There are several possible reasons. For example, the depression features included in UKBB were not based on validated depression scales and primarily came from self-reports only. In addition, the symptoms were not necessarily assessed during depressive episodes, and many patients may have been in remission when they answered the questionnaire. Longitudinal measures were also missing. We also speculate some ‘healthy volunteer bias’ could be present (Glanville et al., Reference Glanville, Coleman, Howard, Pain, Hanscombe, Jermy and O’reilly2021) as patients who were having an active depressive episode might be less likely to respond to questionnaires. Taken together, the depression symptoms recorded in UKBB may not be very informative. Although we also included various other features (e.g. behavioral factors, cardiometabolic measures etc.), these may suffer from similar limitations.

Overall speaking, our results may suggest that symptom-based features alone, especially if routinely collected, may not be sufficient for disease subtyping into biologically homogenous subtypes. Brain imaging features may be relatively more stable (Hao et al., Reference Hao, Talati, Shankman, Liu, Kayser, Tenke and Weissman2017). Genetic features are also fixed over time. They also have the advantage of being objectively measured and may reflect the biological basis of the disease better. It is encouraging that neuroimaging and genetic features together identify two subgroups with differing prognoses in our study. On the other hand, inclusion of more detailed clinical phenotypes and validated depression scales remains useful in future studies.

Strengths of the study

This study has several strengths. This study is, to our knowledge, the first to classify depression patients into biologically homogeneous subgroups by integrating neuroimaging, genome-wide (imputed) expression data, and other clinical features using a multi-view biclustering approach. Secondly, to our knowledge, we were the first to employ a causal inference approach to identify causally relevant genes to inform the feature selection process for clustering. Of note, compared to our previous work (Yin et al., Reference Yin, Chau, Sham and So2019), our proposed framework is different and novel in that we also incorporated causal feature selection in the clustering process and included high-dimensional neuroimaging features for subtyping. Thirdly, our proposed framework allowed transcriptomes from different tissues (here brain regions) to be modeled, which greatly improved the method’s flexibility. In contrast, raw expression data of brain regions are usually hardly accessible, costly to obtain, and only available for post-mortem samples. Since the gene expression levels were predicted from genotypes, they were unlikely to be confounded by other factors, such as medication usage, comorbidities, sample processing differences, and so forth. Our framework is also generalizable and could be used to subtype other neuropsychiatric disorders.

Comparison to other subtyping studies

Subtyping/cluster analysis of MDD is an important research area, and there were numerous previous studies employing different types of features for subtyping. We refer to (Beijers, Wardenaar, van Loo, & Schoevers, Reference Beijers, Wardenaar, van Loo and Schoevers2019; Harald & Gordon, Reference Harald and Gordon2012) for a detailed review of relevant studies but will highlight several of the most relevant works here. In one recent review by Beijers et al. (Reference Beijers, Wardenaar, van Loo and Schoevers2019), 29 papers covering 24 separate analyses were covered. Only one study employed genetic data for clustering (Yu, Arcos-Burgos, Licinio, & Wong, Reference Yu, Arcos-Burgos, Licinio and Wong2017). Using genetic variants from the Illumina HumanExome BeadChip, the authors performed hierarchical clustering analysis and identified an MDD subtype in their Mexican-American cohort. The MDD latent subtype showed some differences with the rest of the MDD subjects in terms of several symptoms (e.g. insomnia, anxiety, paranoia), yet no differences withstand multiple testing correction. Compared to this only prior study that used genomic data for MDD subtyping, our work differed in a range of aspects. Firstly, we integrated brain structural features together with genomics data for subtyping. In addition, we employed a more advanced multi-view biclustering algorithm that achieves consensus clusters across multiple data types. Our genetic data is based on GWAS data instead of variants in the exome regions alone. We also performed analysis at the gene level (using imputed expression) instead of the SNP level, which may be functionally more relevant with easier interpretation of results.

We also noted several studies that have employed neuroimaging features for MDD subtyping (Cheng et al., Reference Cheng, Xu, Yu, Nie, Li, Luo and Shan2014; Drysdale et al., Reference Drysdale, Grosenick, Downar, Dunlop, Mansouri, Meng and Etkin2017; Feder et al., Reference Feder, Sundermann, Wersching, Teuber, Kugel, Teismann and Pfleiderer2017; Price, Gates, Kraynak, Thase, & Siegle, Reference Price, Gates, Kraynak, Thase and Siegle2017; Price et al., Reference Price, Lane, Gates, Kraynak, Horner, Thase and Siegle2017). We refer the readers to (Beijers et al., Reference Beijers, Wardenaar, van Loo and Schoevers2019) for a review of these studies. A clear strength of our work is that we also integrated genomic data in subtyping, which we have shown to improve the prediction strength, an indicator of generalizability. Moreover, the largest sample size (N) for previous neuroimaging-based subtyping studies (based on review Beijers et al., Reference Beijers, Wardenaar, van Loo and Schoevers2019) was ~220 (Drysdale et al., Reference Drysdale, Grosenick, Downar, Dunlop, Mansouri, Meng and Etkin2017); our current work is therefore already one of the largest studies of a similar kind.

We also noted another related work that described genetic heterogeneity across several depression subtypes (Nguyen et al., Reference Nguyen, Harder, Xiong, Kowalec, Hägg, Cai and Lu2022). We emphasized that the objectives of this study are grossly different from Nguyen et al. (Reference Nguyen, Harder, Xiong, Kowalec, Hägg, Cai and Lu2022), who first defined MDD subtypes based on some predefined clinical characteristics and then compared the genetic basis across the subgroups. In contrast, our goal is to perform data-driven MDD subtyping based on a combination of (high-dimensional) neuroimaging and genomic features, which may be considered a more challenging task.

Limitations

Several limitations of this study should be borne in mind. First, the sample size of MDD patients with full genotype, brain structural, and other phenotype data was modest. Although this is already one of the largest studies that included neuroimaging features for subtyping, further increasing the sample size is an important next step. The limited sample size may also have restricted our ability to identify finer subgroups of MDD. Here we found a two-cluster solution to be the most optimal (balancing cluster separation and generalizability); however, we expect with larger sample sizes, finer subtypes and a larger number of clusters may be revealed. Second, we did not seek to replicate our proposed clustering model in independent datasets because such additional datasets with brain structural features and genotype information were not available at this point. Instead, we demonstrated the validity of our proposed approach by prediction strength and external validation using prognostic features. Third, we had relatively limited access to different outcome-related variables for validating our subgroups. However, with the continuous updates from UKBB, we expect this problem may be resolved in future studies.

Conclusions

To conclude, we have proposed a new disease subtyping framework, which was capable of identifying depression subtypes by utilizing genotype-predicted expression levels of relevant brain tissues and brain structural information as well as PRS of other diseases. Genes were selected based on a causal inference framework such that the most functionally relevant genes could be included for disease subtyping. Our proposed approach may open a new avenue for integrating genomic and neuroimaging data for the subtyping of neuropsychiatric disorders. We also believe this is a valuable endeavor to exploit the usage of causal inference for translational medicine and disease subtyping.

Supplementary material

The supplementary material for this article can be found at http://doi.org/10.1017/S0033291725001096.

Acknowledgements

This work was supported partially by an Innovation and Technology Fund (ITS/113/19), a National Natural Science Foundation China (NSFC) grant (81971706), the Lo Kwee Seong Biomedical Research Fund from The Chinese University of Hong Kong, the KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research of Common Diseases, and the Hong Kong Branch of the Chinese Academy of Sciences Center for Excellence in Animal Evolution and Genetics, The Chinese University of Hong Kong.

Competing interests

A patent based on this work was issued (HK22022054559.8). The authors declare that there is no conflict of interest.

References

Ancelin, M., Carrière, I., Artero, S., Maller, J., Meslin, C., Ritchie, K., … Chaudieu, I. (2019). Lifetime major depression and grey-matter volume. Journal of Psychiatry and Neuroscience, 44(1), 45–53.10.1503/jpn.180026CrossRef Google Scholar PubMed

Arias, B., Serretti, A., Mandelli, L., Gastó, C., Catalan, R., De Ronchi, D., & Fananas, L. (2009). Dysbindin gene (DTNBP1) in major depression: Association with clinical response to selective serotonin reuptake inhibitors. Pharmacogenetics and Genomics, 19(2), 121–128.10.1097/FPC.0b013e32831ebb4bCrossRef Google Scholar PubMed

Bakshi, A., Zhu, Z., Vinkhuyzen, A. A., Hill, W. D., McRae, A. F., Visscher, P. M., & Yang, J. (2016). Fast set-based association analysis using summary data from GWAS identifies novel gene loci for human complex traits. Scientific Reports, 6(1), 1–9.10.1038/srep32894CrossRef Google Scholar PubMed

Beijers, L., Wardenaar, K. J., van Loo, H. M., & Schoevers, R. A. (2019). Data-driven biological subtypes of depression: Systematic review of biological approaches to depression subtyping. Molecular Psychiatry, 24(6), 888–900.10.1038/s41380-019-0385-5CrossRef Google Scholar PubMed

Bekhbat, M., Treadway, M. T., Goldsmith, D. R., Woolwine, B. J., Haroon, E., Miller, A. H., & Felger, J. C. (2020). Gene signatures in peripheral blood immune cells related to insulin resistance and low tyrosine metabolism define a sub-type of depression with high CRP and anhedonia. Brain, Behavior, and Immunity, 88, 161–165.10.1016/j.bbi.2020.03.015CrossRef Google Scholar PubMed

Bühlmann, P., Kalisch, M., & Maathuis, M. H. (2010). Variable selection in high-dimensional linear models: Partially faithful distributions and the PC-simple algorithm. Biometrika, 97(2), 261–278.10.1093/biomet/asq008CrossRef Google Scholar

Cheng, Y., Xu, J., Yu, H., Nie, B., Li, N., Luo, C., … Shan, B. (2014). Delineation of early and later adult onset depression by diffusion tensor imaging. PloS One, 9(11), e112307.10.1371/journal.pone.0112307CrossRef Google Scholar PubMed

Choi, S. W., & O’Reilly, P. F. (2019). PRSice-2: Polygenic risk score software for biobank-scale data. Gigascience, 8(7), giz082.10.1093/gigascience/giz082CrossRef Google Scholar PubMed

Chu, C., Wei, H., Zhu, W., Shen, Y., & Xu, Q. (2017). Decreased prostaglandin D2 levels in major depressive disorder are associated with depression-like behaviors. International Journal of Neuropsychopharmacology, 20(9), 731–739.10.1093/ijnp/pyx044CrossRef Google Scholar PubMed

Demontis, D., Walters, R. K., Martin, J., Mattheisen, M., Als, T. D., Agerbo, E., … Bækvad-Hansen, M. (2019). Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder. Nature Genetics, 51(1), 63–75.10.1038/s41588-018-0269-7CrossRef Google Scholar PubMed

Dong, Y., & Peng, C. J. (2013). Principled missing data methods for researchers. SpringerPlus, 2(1), 1–17.10.1186/2193-1801-2-222CrossRef Google Scholar PubMed

Drysdale, A. T., Grosenick, L., Downar, J., Dunlop, K., Mansouri, F., Meng, Y., … Etkin, A. (2017). Resting-state connectivity biomarkers define neurophysiological subtypes of depression. Nature Medicine, 23(1), 28–38.10.1038/nm.4246CrossRef Google Scholar PubMed

Fabbri, C., Hagenaars, S. P., John, C., Williams, A. T., Shrine, N., Moles, L., … Free, R. C. (2021). Genetic and clinical characteristics of treatment-resistant depression using primary care records in two UK cohorts. Molecular Psychiatry, 26(7), 3363–3373.10.1038/s41380-021-01062-9CrossRef Google Scholar PubMed

Fabbri, C., & Serretti, A. (2016). Genetics of long-term treatment outcome in bipolar disorder. Progress in Neuro-Psychopharmacology and Biological Psychiatry, 65, 17–24.10.1016/j.pnpbp.2015.08.008CrossRef Google Scholar PubMed

Feder, S., Sundermann, B., Wersching, H., Teuber, A., Kugel, H., Teismann, H., … Pfleiderer, B. (2017). Sample heterogeneity in unipolar depression as assessed by functional connectivity analyses is dominated by general disease effects. Journal of Affective Disorders, 222, 79–87.10.1016/j.jad.2017.06.055CrossRef Google Scholar PubMed

Fusar-Poli, L., Rutten, B. P., van Os, J., Aguglia, E., & Guloksuz, S. (2022). Polygenic risk scores for predicting outcomes and treatment response in psychiatry: Hope or hype? International Review of Psychiatry, 34(7–8), 663–675.10.1080/09540261.2022.2101352CrossRef Google Scholar PubMed

Gamazon, E. R., Wheeler, H. E., Shah, K. P., Mozaffari, S. V., Aquino-Michaels, K., Carroll, R. J., … Cox, N. J. (2015). A gene-based association method for mapping traits using reference transcriptome data. Nature Genetics, 47(9), 1091–1098.10.1038/ng.3367CrossRef Google Scholar PubMed

Glanville, K. P., Coleman, J. R., Howard, D. M., Pain, O., Hanscombe, K. B., Jermy, B., … O’reilly, P. F. (2021). Multiple measures of depression to enhance validity of major depressive disorder in the UK biobank. BJPsych Open, 7(2), e44.10.1192/bjo.2020.145CrossRef Google Scholar PubMed

Grove, J., Ripke, S., Als, T. D., Mattheisen, M., Walters, R. K., Won, H., … Anney, R. (2019). Identification of common genetic risk variants for autism spectrum disorder. Nature Genetics, 51(3), 431–444.10.1038/s41588-019-0344-8CrossRef Google Scholar PubMed

Hao, X., Talati, A., Shankman, S. A., Liu, J., Kayser, J., Tenke, C. E., … Weissman, M. M. (2017). Stability of cortical thinning in persons at increased familial risk for major depressive disorder across 8 years. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging, 2(7), 619–625.Google Scholar PubMed

Harald, B., & Gordon, P. (2012). Meta-review of depressive subtyping models. Journal of Affective Disorders, 139(2), 126–140.10.1016/j.jad.2011.07.015CrossRef Google Scholar PubMed

Hare, B. D., & Duman, R. S. (2020). Prefrontal cortex circuits in depression and anxiety: Contribution of discrete neuronal populations and target regions. Molecular Psychiatry, 25(11), 2742–2758.10.1038/s41380-020-0685-9CrossRef Google Scholar PubMed

Hellewell, S. C., Welton, T., Maller, J. J., Lyon, M., Korgaonkar, M. S., Koslow, S. H., … Grieve, S. M. (2019). Profound and reproducible patterns of reduced regional gray matter characterize major depressive disorder. Translational Psychiatry, 9(1), 176.10.1038/s41398-019-0512-8CrossRef Google Scholar PubMed

Howard, D. M., Adams, M. J., Clarke, T., Hafferty, J. D., Gibson, J., Shirali, M., … Wigmore, E. M. (2019). Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions. Nature Neuroscience, 22(3), 343–352.10.1038/s41593-018-0326-7CrossRef Google Scholar PubMed

Jiang, H., Kulkarni, P. M., Mallinckrodt, C. H., Shurzinske, L., Molenberghs, G., & Lipkovich, I. (2017). Covariate adjustment for logistic regression analysis of binary clinical trial data. Statistics in Biopharmaceutical Research, 9(1), 126–134.10.1080/19466315.2016.1234973CrossRef Google Scholar

Kalisch, M., & Bühlman, P. (2007). Estimating high-dimensional directed acyclic graphs with the PC-algorithm. Journal of Machine Learning Research, 8(3), 613–636.Google Scholar

Kalisch, M., Mächler, M., Colombo, D., Maathuis, M. H., & Bühlmann, P. (2012). Causal inference using graphical models with the R package pcalg. Journal of Statistical Software, 47(11), 1–26.10.18637/jss.v047.i11CrossRef Google Scholar

Kamburov, A., Pentchev, K., Galicka, H., Wierling, C., Lehrach, H., & Herwig, R. (2011). ConsensusPathDB: Toward a more complete picture of cell biology. Nucleic Acids Research, 39(suppl_1), D712–D717.10.1093/nar/gkq1156CrossRef Google Scholar

Kamburov, A., Stelzl, U., Lehrach, H., & Herwig, R. (2013). The ConsensusPathDB interaction database: 2013 update. Nucleic Acids Research, 41(D1), D793–D800.10.1093/nar/gks1055CrossRef Google Scholar PubMed

Kessler, R. C., & Bromet, E. J. (2013). The epidemiology of depression across cultures. Annual Review of Public Health, 34, 119–138.10.1146/annurev-publhealth-031912-114409CrossRef Google Scholar PubMed

Kocabas, N. A., Antonijevic, I., Faghel, C., Forray, C., Kasper, S., Lecrubier, Y., … Noro, M. (2010). Dysbindin gene (DTNBP1) in major depressive disorder (MDD) patients: Lack of association with clinical phenotypes. The World Journal of Biological Psychiatry, 11(8), 985–990.10.3109/15622975.2010.512089CrossRef Google Scholar PubMed

Korhonen, K., Moustgaard, H., Tarkiainen, L., Östergren, O., Costa, G., Urhoj, S. K., & Martikainen, P. (2021). Contributions of specific causes of death by age to the shorter life expectancy in depression: A register-based observational study from denmark, finland, sweden and italy. Journal of Affective Disorders, 295, 831–838.10.1016/j.jad.2021.08.076CrossRef Google Scholar

Kuleshov, M. V., Jones, M. R., Rouillard, A. D., Fernandez, N. F., Duan, Q., Wang, Z., … Lachmann, A. (2016). Enrichr: A comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Research, 44(W1), W90–W97.10.1093/nar/gkw377CrossRef Google Scholar PubMed

Lacerda, A. L., Nicoletti, M. A., Brambilla, P., Sassi, R. B., Mallinger, A. G., Frank, E., … Soares, J. C. (2003). Anatomical MRI study of basal ganglia in major depressive disorder. Psychiatry Research: Neuroimaging, 124(3), 129–140.10.1016/S0925-4927(03)00123-9CrossRef Google Scholar PubMed

Madley-Dowd, P., Hughes, R., Tilling, K., & Heron, J. (2019). The proportion of missing data should not be used to guide decisions on multiple imputation. Journal of Clinical Epidemiology, 110, 63–73.10.1016/j.jclinepi.2019.02.016CrossRef Google Scholar

Major Depressive Disorder Working Group of the Psychiatric GWAS Consortium. (2013). A mega-analysis of genome-wide association studies for major depressive disorder. Molecular Psychiatry, 18(4), 497–511.10.1038/mp.2012.21CrossRef Google Scholar

Matsuda, S., Ikeda, Y., Murakami, M., Nakagawa, Y., Tsuji, A., & Kitagishi, Y. (2019). Roles of PI3K/AKT/GSK3 pathway involved in psychiatric illnesses. Diseases, 7(1), 22.10.3390/diseases7010022CrossRef Google Scholar PubMed

Murray, G. K., Lin, T., Austin, J., McGrath, J. J., Hickie, I. B., & Wray, N. R. (2021). Could polygenic risk scores be useful in psychiatry?: A review. JAMA Psychiatry, 78(2), 210–219.10.1001/jamapsychiatry.2020.3042CrossRef Google Scholar PubMed

Nguyen, T., Harder, A., Xiong, Y., Kowalec, K., Hägg, S., Cai, N., … Lu, Y. (2022). Genetic heterogeneity and subtypes of major depression. Molecular Psychiatry, 27(3), 1667–1675.10.1038/s41380-021-01413-6CrossRef Google Scholar PubMed

Nievergelt, C. M., Maihofer, A. X., Klengel, T., Atkinson, E. G., Chen, C., Choi, K. W., … Gelernter, J. (2019). International meta-analysis of PTSD genome-wide association studies identifies sex-and ancestry-specific genetic risk loci. Nature Communications, 10(1), 1–16.10.1038/s41467-019-12576-wCrossRef Google Scholar PubMed

Pandya, M., Altinay, M., Malone, D. A., & Anand, A. (2012). Where in the brain is depression? Current Psychiatry Reports, 14(6), 634–642.10.1007/s11920-012-0322-7CrossRef Google Scholar PubMed

Pardiñas, A. F., Holmans, P., Pocklington, A. J., Escott-Price, V., Ripke, S., Carrera, N., … Hamshere, M. L. (2018). Common schizophrenia alleles are enriched in mutation-intolerant genes and in regions under strong background selection. Nature Genetics, 50(3), 381–389.10.1038/s41588-018-0059-2CrossRef Google Scholar PubMed

Pizzagalli, D. A., Holmes, A. J., Dillon, D. G., Goetz, E. L., Birk, J. L., Bogdan, R., … Fava, M. (2009). Reduced caudate and nucleus accumbens response to rewards in unmedicated individuals with major depressive disorder. American Journal of Psychiatry, 166(6), 702–710.10.1176/appi.ajp.2008.08081201CrossRef Google Scholar PubMed

Price, R. B., Gates, K., Kraynak, T. E., Thase, M. E., & Siegle, G. J. (2017). Data-driven subgroups in depression derived from directed functional connectivity paths at rest. Neuropsychopharmacology, 42(13), 2623–2632.10.1038/npp.2017.97CrossRef Google Scholar PubMed

Price, R. B., Lane, S., Gates, K., Kraynak, T. E., Horner, M. S., Thase, M. E., & Siegle, G. J. (2017). Parsing heterogeneity in the brain connectivity of depressed and healthy adults during positive mood. Biological Psychiatry, 81(4), 347–357.10.1016/j.biopsych.2016.06.023CrossRef Google Scholar PubMed

Ruderfer, D. M., Ripke, S., McQuillin, A., Boocock, J., Stahl, E. A., Pavlides, J. M. W., … Loohuis, L. M. O. (2018). Genomic dissection of bipolar disorder and schizophrenia, including 28 subphenotypes. Cell, 173(7), 1705–1715, e16.10.1016/j.cell.2018.05.046CrossRef Google Scholar

Silva dos Santos, J., Goncalves Cirino, J. P., de Oliveira Carvalho, P., & Ortega, M. M. (2021). The pharmacological action of kaempferol in central nervous system diseases: A review. Frontiers in Pharmacology, 11, 2143.10.3389/fphar.2020.565700CrossRef Google Scholar PubMed

Smith, L. A., Cornelius, V. R., Azorin, J. M., Perugi, G., Vieta, E., Young, A. H., & Bowden, C. L. (2010). Valproate for the treatment of acute bipolar depression: Systematic review and meta-analysis. Journal of Affective Disorders, 122(1–2), 1–9.10.1016/j.jad.2009.10.033CrossRef Google Scholar PubMed

Smyth, G. K. (2004). Linear models and empirical bayes methods for assessing differential expression in microarray experiments. Statistical Applications in Genetics and Molecular Biology, 3(1)10.2202/1544-6115.1027CrossRef Google Scholar PubMed

Stekhoven, D. J., & Bühlmann, P. (2012). MissForest—non-parametric missing value imputation for mixed-type data. Bioinformatics, 28(1), 112–118.10.1093/bioinformatics/btr597CrossRef Google Scholar PubMed

Stockmeier, C. A., & Rajkowska, G. (2022). Cellular abnormalities in depression: Evidence from postmortem brain tissue. Dialogues in Clinical Neuroscience, 6(2), 185–197.10.31887/DCNS.2004.6.2/cstockmeierCrossRef Google Scholar

Sun, J., Lu, J., Xu, T., & Bi, J. (2015). Multi-view sparse co-clustering via proximal alternating linearized minimization. International Conference on Machine Learning. PMLR, 757–766 .Google Scholar

Tibshirani, R., & Walther, G. (2005). Cluster validation by prediction strength. Journal of Computational and Graphical Statistics, 14(3), 511–528.10.1198/106186005X59243CrossRef Google Scholar

Torkamani, A., Wineinger, N. E., & Topol, E. J. (2018). The personal and clinical utility of polygenic risk scores. Nature Reviews Genetics, 19(9), 581–590.10.1038/s41576-018-0018-xCrossRef Google Scholar PubMed

Van Loo, H. M., De Jonge, P., Romeijn, J., Kessler, R. C., & Schoevers, R. A. (2012). Data-driven subtypes of major depressive disorder: A systematic review. BMC Medicine, 10, 1–12.10.1186/1741-7015-10-156CrossRef Google Scholar PubMed

Vos, T., Lim, S. S., Abbafati, C., Abbas, K. M., Abbasi, M., Abbasifard, M., … Abdelalim, A. (2020). Global burden of 369 diseases and injuries in 204 countries and territories, 1990–2019: A systematic analysis for the global burden of disease study 2019. The Lancet, 396(10258), 1204–1222.10.1016/S0140-6736(20)30925-9CrossRef Google Scholar

Wang, X., Zhang, L., Xia, Z., Chen, J., Fang, Y., & Ding, Y. (2021). PTEN in prefrontal cortex is essential in regulating depression-like behaviors in mice. Translational Psychiatry, 11(1), 1–12.10.1038/s41398-021-01312-yCrossRef Google Scholar PubMed

Weng, L., Guo, X., Li, Y., Yang, X., & Han, Y. (2016). Apigenin reverses depression-like behavior induced by chronic corticosterone treatment in mice. European Journal of Pharmacology, 774, 50–54.10.1016/j.ejphar.2016.01.015CrossRef Google Scholar PubMed

Wise, T., Radua, J., Via, E., Cardoner, N., Abe, O., Adams, T. M., … de Azevedo Marques Périco, C. (2017). Common and distinct patterns of grey-matter volume alteration in major depression and bipolar disorder: Evidence from voxel-based meta-analysis. Molecular Psychiatry, 22(10), 1455–1463.10.1038/mp.2016.72CrossRef Google Scholar PubMed

Wong, M. L., Dong, C., Maestre-Mesa, J., & Licinio, J. (2008). Polymorphisms in inflammation-related genes are associated with susceptibility to major depression and antidepressant response. Molecular Psychiatry, 13(8), 800–812.10.1038/mp.2008.59CrossRef Google Scholar PubMed

Yin, L., Chau, C. K., Sham, P., & So, H. (2019). Integrating clinical data and imputed transcriptome from GWAS to uncover complex disease subtypes: Applications in psychiatry and cardiology. The American Journal of Human Genetics, 105(6), 1193–1212.10.1016/j.ajhg.2019.10.012CrossRef Google Scholar PubMed

Yin, L., Cheung, E. F., Chen, R. Y., Wong, E. H., Sham, P., & So, H. (2018). Leveraging genome-wide association and clinical data in revealing schizophrenia subgroups. Journal of Psychiatric Research, 106, 106–117.10.1016/j.jpsychires.2018.09.010CrossRef Google Scholar

Yu, C., Arcos-Burgos, M., Licinio, J., & Wong, M. L. (2017). A latent genetic subtype of major depression identified by whole-exome genotyping data in a Mexican-American cohort. Translational Psychiatry, 7(5), e1134.10.1038/tp.2017.102CrossRef Google Scholar

Yu, H., Chapman, B., Di Florio, A., Eischen, E., Gotz, D., Jacob, M., & Blair, R. H. (2019). Bootstrapping estimates of stability for clusters, observations and model selection. Computational Statistics, 34, 349–372.10.1007/s00180-018-0830-yCrossRef Google Scholar

Zhang, S., Wong, H., & Shen, Y. (2012). Generalized adjusted rand indices for cluster ensembles. Pattern Recognition, 45(6), 2214–2226.10.1016/j.patcog.2011.11.017CrossRef Google Scholar

Zhang, F., Peng, W., Sweeney, J. A., Jia, Z., & Gong, Q. (2018). Brain structure alterations in depression: Psychoradiological evidence. CNS Neuroscience & Therapeutics, 24(11), 994–1003.10.1111/cns.12835CrossRef Google Scholar PubMed

Figure 1. The workflow for the proposed invention in identifying disease subtypes.

Table 1. The number of selected features in each data view

Table 2. The 20 brain imaging features with the most significant differences (ranked by p-value) between the 2 discovered MDD subtypes

Figure 2. Comparison of selected brain imaging features for depression patients between 2 subtypes.

Table 3. Differentially expressed genes(DEGs) between the 2 depression subtypes, analyzed using limma (with gene expression in subgroup one as baseline)

Figure 3. Comparison of TRD status by subgroups for depression patients.

Figure 4. Comparison of admission frequencies by subgroups for depression patient.

Table 4. Comparison of extended PS derived from different solutions

Table 5. Enrichment analysis results for GWAS hits of depression

Table 6. Comparison of clustering solutions with varied minimum subgroup sizes

Yin et al. supplementary material

File 127.1 KB

Article contents

Integrating brain imaging features and genomic profiles for the subtyping of major depression

Abstract

Keywords

Information

Introduction

Method

A novel disease subtyping/patient stratification model

Data imputation

Feature selection by a causal approach

Disease subtyping

Validation

Permutation testing of prediction strength

Evaluation of minimum subgroup size on clustering analysis

Further analyses to evaluate the relevance of the subtype-defining genes

Application to depression patients

Results

Overview and identification of causal genes

Feature selection and data view composition

Identification of two distinct depression subgroups

Clinical validation: treatment resistance and psychiatric admissions

Prediction strength

Summary of evidence supporting distinct subgroups

GWAS enrichment analysis

Functional annotation and pathway enrichment

Drug enrichment analysis

Evaluation of smaller minimum subgroup sizes

Prediction strength

Bootstrap-based stability analysis

Differences in clinical outcomes

Sex-adjusted clustering analysis

Clustering with principal components as input

Discussion

Key findings and biological relevance

Limitations related to polygenic risk scores and clinical features

Strengths of the study

Comparison to other subtyping studies

Limitations

Conclusions

Supplementary material

Acknowledgements

Competing interests

References

Yin et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests