Abstract
Precise stratification of clinical patients into more homogeneous disease subgroups could address the heterogeneity of disease phenotypes and enhance our understanding on possible biological mechanisms and pathophysiology of more specified subtypes. This approach could promote individualized and effective prevention/intervention strategies. In the extant literature, subtyping of patients with depressive disorders (Dep) mainly utilized clinical features only. Genomics data could be useful subtyping features but advanced methods are needed for subtyping psychiatric entities such as depression. To solve this issue, we proposed a novel disease subtyping framework for complex diseases such as Dep. It combines brain structural features with genotype-predicted gene expression levels of relevant brain tissues as well as polygenic risk scores (PRS) of related disorders. It is able to classify patients into both clinically and biologically homogeneous subgroups, based on a multiview biclustering method. Moreover, causal inference was employed to identify causally relevant genes in different brain tissues to inform feature selection under the proposed framework. We verified the reliability of the subtyping model by internal and external validation. The calculated prediction strengths(PS) (average PS:0.896, min PS: 0.854) supported the robustness and generalizability of our proposed approach. External validation results demonstrated that our proposed approach could stratify Dep patients into subgroups with varied treatment responses and hospitalization risks. Besides, some subtype-defining genes in our study overlapped with several well-known susceptibility genes for depression and were involved in the pathophysiology for the disease. Encouragingly, many enriched drugs based on identified subtype-defining genes have been reported in previous studies to be effective in reducing depression-related symptoms.
Competing Interest Statement
The authors have declared no competing interest.
Funding Statement
This work was supported partially by an Innovation and Technology Fund (ITS/113/19), a National Natural Science Foundation China grant (81971706), the Lo Kwee Seong Biomedical Research Fund from The Chinese University of Hong Kong and the KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research of Common Diseases, Kunming Institute of Zoology and The Chinese University of Hong Kong, China.
Author Declarations
I confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.
Yes
The details of the IRB/oversight body that provided approval or exemption for the research described are given below:
The data were obtained from the UK Biobank (UKBB) with application no. 37268.
I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.
Yes
I understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).
Yes
I have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.
Yes
Data Availability
All data produced in the present study are available upon reasonable request to the authors.