Source code for "Overdispersed gene expression in schizophrenia" 2020, NPJ Schizophrenia https://doi.org/10.1038/s41537-020-0097-5
Schizophrenia (SCZ) is a severe, highly heterogeneous psychiatric disorder with varied clinical presentations. The polygenic genetic architecture of SCZ makes identification of causal variants a daunting task. Gene expression analyses hold the promise of revealing connections between dysregulated transcription and underlying variants in SCZ. However, the most commonly used differential expression analysis often assumes grouped samples are from homogeneous populations and thus cannot be used to detect expression variance differences between samples. Here, we applied the test for equality of variances to normalized expression data, generated by the CommonMind Consortium (CMC), from brains of 212 SCZ and 214 unaffected control (CTL) samples. We identified 87 genes, including VEGFA (vascular endothelial growth factor) and BDNF (brain-derived neurotrophic factor), that showed a significantly higher expression variance among SCZ samples than CTL samples. In contrast, only one gene showed the opposite pattern. To extend our analysis to gene sets, we proposed a Mahalanobis distance-based test for multivariate homogeneity of group dispersions, with which we identified 110 gene sets with a significantly higher expression variability in SCZ, including sets of genes encoding phosphatidylinositol 3-kinase (PI3K) complex and several others involved in cerebellar cortex morphogenesis, neuromuscular junction development, and cerebellar Purkinje cell layer development. Taken together, our results suggest that SCZ brains are characterized by overdispersed gene expression—overall gene expression variability among SCZ samples is significantly higher than that among CTL samples. Our study showcases the application of variability-centric analyses in SCZ research.