TY - JOUR
T1 - Heterogeneity analysis provides evidence for a genetically homogeneous subtype of bipolar-disorder
AU - Bipolar Disorder Working Group of the Psychiatric Genomics Consortium
AU - McGrouther, Caroline C.
AU - Rangan, Aaditya V.
AU - Di Florio, Arianna
AU - Elman, Jeremy A.
AU - Schork, Nicholas J.
AU - Kelsoe, John
N1 - Publisher Copyright:
Copyright: © 2025 McGrouther et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2025
Y1 - 2025
N2 - BACKGROUND: Bipolar Disorder (BD) is a complex disease. It is heterogeneous, both at the phenotypic and genetic level, although the extent and impact of this heterogeneity is not fully understood. One way to assess this heterogeneity is to look for patterns in the subphenotype data. Because of the variability in how phenotypic data was collected by the various BD studies over the years, homogenizing this subphenotypic data is a challenging task, and so is replication. An alternative methodology, taken here, is to set aside the intricacies of subphenotype and allow the genetic data itself to determine which subjects define a homogeneous genetic subgroup (termed 'bicluster' below). RESULTS: In this paper, we leverage recent advances in heterogeneity analysis to look for genetically-driven subgroups (i.e., biclusters) within the broad phenotype of Bipolar Disorder. We first apply this covariate-corrected biclustering algorithm to a cohort of 2524 BD cases and 4106 controls from the Bipolar Disease Research Network (BDRN) within the Psychiatric Genomics Consortium (PGC). We find evidence of genetic heterogeneity delineating a statistically significant bicluster comprising a subset of BD cases which exhibits a disease-specific pattern of differential-expression across a subset of SNPs. This disease-specific genetic pattern (i.e., 'genetic subgroup') replicates across the remaining data-sets collected by the PGC containing 5781/8289, 3581/7591, and 6825/9752 cases/controls, respectively. This genetic subgroup (discovered without using any BD subtype information) was more prevalent in Bipolar type-I than in Bipolar type-II. CONCLUSIONS: Our methodology has successfully identified a replicable homogeneous genetic subgroup of bipolar disorder. This subgroup may represent a collection of correlated genetic risk-factors for BDI. By investigating the subgroup's bicluster-informed polygenic-risk-scoring (PRS), we find that the disease-specific pattern highlighted by the bicluster can be leveraged to eliminate noise from our GWAS analyses and improve risk prediction. This improvement is particularly notable when using only a relatively small subset of the available SNPs, implying improved SNP replication. Though our primary focus is only the analysis of disease-related signal, we also identify replicable control-related heterogeneity.
AB - BACKGROUND: Bipolar Disorder (BD) is a complex disease. It is heterogeneous, both at the phenotypic and genetic level, although the extent and impact of this heterogeneity is not fully understood. One way to assess this heterogeneity is to look for patterns in the subphenotype data. Because of the variability in how phenotypic data was collected by the various BD studies over the years, homogenizing this subphenotypic data is a challenging task, and so is replication. An alternative methodology, taken here, is to set aside the intricacies of subphenotype and allow the genetic data itself to determine which subjects define a homogeneous genetic subgroup (termed 'bicluster' below). RESULTS: In this paper, we leverage recent advances in heterogeneity analysis to look for genetically-driven subgroups (i.e., biclusters) within the broad phenotype of Bipolar Disorder. We first apply this covariate-corrected biclustering algorithm to a cohort of 2524 BD cases and 4106 controls from the Bipolar Disease Research Network (BDRN) within the Psychiatric Genomics Consortium (PGC). We find evidence of genetic heterogeneity delineating a statistically significant bicluster comprising a subset of BD cases which exhibits a disease-specific pattern of differential-expression across a subset of SNPs. This disease-specific genetic pattern (i.e., 'genetic subgroup') replicates across the remaining data-sets collected by the PGC containing 5781/8289, 3581/7591, and 6825/9752 cases/controls, respectively. This genetic subgroup (discovered without using any BD subtype information) was more prevalent in Bipolar type-I than in Bipolar type-II. CONCLUSIONS: Our methodology has successfully identified a replicable homogeneous genetic subgroup of bipolar disorder. This subgroup may represent a collection of correlated genetic risk-factors for BDI. By investigating the subgroup's bicluster-informed polygenic-risk-scoring (PRS), we find that the disease-specific pattern highlighted by the bicluster can be leveraged to eliminate noise from our GWAS analyses and improve risk prediction. This improvement is particularly notable when using only a relatively small subset of the available SNPs, implying improved SNP replication. Though our primary focus is only the analysis of disease-related signal, we also identify replicable control-related heterogeneity.
UR - http://www.scopus.com/inward/record.url?scp=85217357289&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85217357289&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0314288
DO - 10.1371/journal.pone.0314288
M3 - Article
C2 - 39879180
AN - SCOPUS:85217357289
SN - 1932-6203
VL - 20
SP - e0314288
JO - PloS one
JF - PloS one
IS - 1
ER -