Abstract
Multiple System Atrophy is a rare neurodegenerative disease with alpha-synuclein aggregation in glial cytoplasmic inclusions and either predominant olivopontocerebellar atrophy or striatonigral degeneration, leading to dysautonomia, parkinsonism, and cerebellar ataxia. One prior genome-wide association study in mainly clinically diagnosed patients with Multiple System Atrophy failed to identify genetic variants predisposing for the disease. Since the clinical diagnosis of Multiple System Atrophy yields a high rate of misdiagnosis when compared to the neuropathological gold standard, we studied common genetic variation in only autopsy-confirmed cases (N = 731) and controls (N = 2,898).
The most strongly disease-associated markers were rs16859966 on chromosome 3 (P = 8.6 × 10−7, odds ratio (OR) = 1.58, [95% confidence interval (CI) = 1.32-1.89]), rs7013955 on chromosome 8 (P = 3.7 × 10−6, OR = 1.8 [1.40-2.31]), and rs116607983 on chromosome 4 (P = 4.0 × 10−6, OR = 2.93 [1.86-4.63]), all of which were supported by at least one additional genotyped and several imputed single nucleotide polymorphisms with P-values below 5 × 10−5. The genes closest to the chromosome 3 locus are ZIC1 and ZIC4 encoding the zinc finger proteins of cerebellum 1 and 4 (ZIC1 and ZIC4).
Since mutations of ZIC1 and ZIC4 and paraneoplastic autoantibodies directed against ZIC4 are associated with severe cerebellar dysfunction, we conducted immunohistochemical analyses in brain tissue of the frontal cortex and the cerebellum from 24 Multiple System Atrophy patients. Strong immunohistochemical expression of ZIC4 was detected in a subset of neurons of the dentate nucleus in all healthy controls and in patients with striatonigral degeneration, whereas ZIC4 positive neurons were significantly reduced in patients with olivopontocerebellar atrophy.
These findings point to a potential ZIC4-mediated vulnerability of neurons in Multiple System Atrophy.
Introduction
Multiple System Atrophy (MSA) is a rapidly progressive rare neurodegenerative disease presenting with variable combinations of dysautonomia, parkinsonism and cerebellar ataxia[10]. Two forms of MSA can be clinically distinguished, characterized by either predominant parkinsonism (MSA-P) or predominant cerebellar symptoms (MSA-C)[12]. Its estimated prevalence is 3.4 to 4.9 cases per 100,000 individuals in the general population, and 7.8 cases per 100,000 in persons older than 40 years[32]. The mean survival time from disease onset is 6 to 10 years [16, 41]. Currently, only limited symptomatic treatments and no disease-modifying therapies are available[28].
The typical symptoms of MSA are caused by the progressive degeneration of neurons in different brain regions, particularly in the substantia nigra, striatum, inferior olivary nucleus, pons and cerebellum, but also other parts of the central nervous systems, emphasizing the multisystem character of MSA [12, 40]. The histological hallmarks in brains of MSA patients are glial cytoplasmic inclusions (GCIs, Papp-Lantos bodies) in oligodendrocytes containing aggregated and misfolded alpha-synuclein[14]. Neuropathologically, two subtypes can be distinguished, one with predominant olivopontocerebellar atrophy (OPCA), the other with mainly striatonigral degeneration (SND)[15, 26]. In addition, a mixed phenotype displaying features of both OPCA and SND is found in the brains of some patients[15, 26].
The pathogenesis of MSA is unclear. MSA is considered as a sporadic disease[11]. Epidemiological studies have investigated the influence of environmental factors in MSA, including exposure to farming-related factors (pesticides, solvents, mycotoxins, dust, fuels, oils, fertilizers, animals) and certain lifestyles (consumption of well water, rural living, diet and physical activity)[8, 23, 39]. Apart from a marginal effect of pesticides, no other environmental factors have been convincingly associated with an increased risk of developing MSA[8, 23, 39].
Hypothesis-driven candidate gene studies have been inconsistent with respect to variants that might be associated with MSA. Associations of MSA with the genes COQ2, SNCA, MAPT, and PRNP have been discussed[7, 20, 21, 29, 31, 33]. One prior genome-wide association study (GWAS) did not identify hits of statistical significance at a genome-wide level, despite the analysis of 918 cases and controls[30]. This GWAS had mainly included clinically diagnosed MSA cases. It needs to be stressed that clinical diagnosis is frequently not accurate in MSA. For example, a recent clinico-pathological study demonstrated a false positive diagnosis at autopsy in 38% of clinically diagnosed MSA patients[17].
In order to avoid inclusion of misdiagnosed patients in the GWAS described here, we only included autopsy-confirmed cases and appropriate ethnicity-matched controls.
Methods
Patient recruitment
Ethical approval had been obtained from all responsible ethics committees. All participants had given written consent.
Neuropathologists at each recruitment site (Supplementary material, Supplementary Table S1) based the definite neuropathological diagnosis of MSA on histopathological criteria taking into account GCIs immunoreactive for alpha-synuclein in characteristic anatomical distribution as defining feature of MSA[37]. Age, gender, disease history (including disease onset and duration) and neuropathological findings were recorded in standardized manner for all cases.
Controls were ethnically matched to cases and either derived from biobanks KORA-gen[42] or popGen[24] (Europe sites) or from North American site (Alzheimer’s Disease Genetics Consortium)[19]. The Alzheimer’s Disease Genetics Consortium assembled and genotyped DNA from subjects enrolled in the 29 NIA-Alzheimer’s Disease Centers located across the USA. For this study, the ADGC provided a subset of mostly clinical, cognitively normal controls. Patients and controls were of North-Western European and African American ancestry.
DNA extraction
We isolated DNA from 30 mg frozen cerebellar cortex using QIAamp DNA Mini Kit (Qiagen, Venlo, Netherlands). DNA extraction was performed at German Center for Neurodegenerative Diseases (DZNE), Munich Germany. DNA was stored at -80 °C until use. DNA concentration was measured using a NanoDrop Spectrophotometer. DNA quality was determined by gel electrophoresis.
Genotyping
All samples were genotyped on Infinium Global Screening Arrays (GSA, Illumina, San Diego, USA). The cases were genotyped at the Institute of Clinical Molecular Biology, Kiel University, Germany. The samples were genotyped in one batch on array version 2.0 for cases and version 1.0 for controls. Genotypes were called using Illumina Genome studio according to the manufacturers instructions using in-house cluster-files.
Quality control and imputation
We used PLINK (v. 1.9) [1] and R (v. 3.6.3)[35] for all analyses. Only variants successfully genotyped in both the patient and the control populations were included in the subsequent analyses. Variants with multi-character allele codes, insertions, deletions, duplicated markers and all A/T and G/C variants were excluded. We excluded all samples discordant between reported and genotypic sex. Missing sex was imputed and samples with ambiguously imputed sex were discarded. After a first step of filtering out samples and variants with call rate of less than 85%, we excluded variants with an individual call rate of less than 98% in a second filtering step. Next, we removed variants with a minor allele frequency (MAF) below 0.01, significant deviation from Hardy-Weinberg equilibrium (HWE; P < 1 × 10−6) in controls or informative missingness (P < 1 × 10−5). Subsequently, we excluded individuals with a variant call rate of less than 98% or an outlying heterozygosity rate (mean ± 3 standard deviations). We used a pruned dataset containing only markers in low linkage-disequilibrium (LD) regions (pairwise r2 < 0.2) to test for duplicated individuals and cryptic relatedness (Pihat > 0.125) using pairwise genome-wide estimates of the proportion of identity by descent. Of each detected sample pair we excluded the individual with a lower call rate. Ethnical outliers were identified by a principal component analysis (PCA) together with the publically available 1000 Genomes data with known ethnicities.[2] Since the study population has genetically a mainly European ancestry, as ascertained by the PCA, we determined a European center and excluded samples more than 1.5 times the maximal European Euclidean distance away from this center. After a first association analysis of genotyped single nucleotide polymorphisms (SNPs) only, we inspected visually the cluster plots of all variants with a P-value below 1 × 10−5 and discarded variants without adequate cluster separation.
Imputation was carried out on the quality assured dataset using the TOPMed Imputation Server, resulting in 271,284,138 available SNPs.[34] Variants were again filtered for MAF and deviation from HWE in controls with the same thresholds as before. Additionally, SNPs with an imputation quality score R2 less than 0.7 were excluded, leaving 8,131,900 variants for analyses. As final step of the quality control procedure, we employed the R package PCAmatchR to ethnically match cases to controls with a 1:4 ratio to overcome possible difficulties with population stratification, leading to 3240 individuals for the analyses.[6]
Association analysis
We used logistic regression to test the additive genetic model of each marker for association with disease status. Following scree plot analysis, we incorporated the first two dimensions of the PCA and sex as covariates. We used a genome-wide significance threshold of P < 5 × 10−8 and the threshold of P < 5 × 10−6 for suggestive association. Conditional analyses including in turn each SNP with a suggestive association as additional covariate were conducted to identify adjacent independent signals. Furthermore, we tested for clumps of correlated SNPs, i.e. to assess how many independent loci had been associated and determined the number of variants supporting the lead SNPs at each locus, i.e. variants that have a P-value < 5 × 10−5, are in LD (r2 ≥ 0.4) and not further than 250 kilobases (kb) away from the respective SNP. Visualization of the results was carried out with R and LocusZoom [27]for regional plots. Variant positions in this manuscript are reported on human genome version 38 (GRCh38/hg38).
Immunohistochemistry on MSA patients’ brain
Formalin-fixed and paraffin embedded (FFPE) tissue from MSA patients and controls without neurological or psychiatric diseases was obtained from the Neurobiobank Munich (NBM; Germany). All autopsy cases of the NBM were collected on the basis of an informed consent according to the guidelines of the ethics commission of the Ludwig-Maximilians-University Munich, Germany (# 345-13). MSA cases had been diagnosed according to established histopatological diagnostic criteria.[26, 37]
For ZIC4 immunohistochemistry, 5 µm thick sections of FFPE tissue of the frontal cortex and the cerebellar hemisphere including the dentate nucleus were prepared. After deparaffinization, heat-induced epitope retrieval was performed in Tris/EDTA, pH 9 at 95 °C for 30 minutes. For blocking of endogenous peroxidase and unspecific protein binding, the sections were incubated with 5% H2O2 in methanol for 20 minutes and I-Block reagent (Applied Biosystems, Waltham, MA, USA) for 15 minutes. Subsequently, ZIC4 primary antibody (rabbit, polyclonal, Merck/Sigma-Aldrich, Darmstadt, Germany) was applied over night at 4 °C at a dilution of 1:100. Signal detection was performed using the DCS ChromoLine DAB kit (DCS, Hamburg, Germany) according to the manufacturer’s instructions. Sections were counterstained for 1 minute with Mayer’s hemalum solution (Waldeck, Münster, Germany).
To determine the fractions of ZIC4 positive neurons of all neurons in the dentate nucleus, stained slides were scanned using a slide scanner (Axio Scan. Z1, Zeiss, Oberkochen, Germany) and visualized using the free ZEN lite software (v 3.3; Zeiss, Oberkochen, Germany). For statistical evaluation of the data, Student’s t-test was used, statistical significance was defined as P < 0.05.
Results
Patient sample
From the initial sample of 731 cases, 13 cases had to be excluded due to insufficient tissue quality. After thorough quality control and filtering, 648 cases and 2,592 controls covering 8,131,900 variants were included in the association analysis (Supplementary Fig. S1). The number of excluded samples and variants in each step of the quality control procedure is shown in Supplementary Tables S2 and S3.
Genotyping
We performed logistic regression incorporating sex and determined the first two dimensions of PCA as covariates using the scree plot method. The genomic inflation factor of λ = 1.01 (unimputed λ = 1.01, Supplementary Fig. S2) indicates that no significant population stratification was present (Fig. 1a). We did not identify any disease-associated variants with a P-value below the genome-wide significance threshold of P < 5 × 10−8, but suggestive associations with P < 5 × 10−6 at ten different loci (Fig. 1b) with the leading SNP at each locus shown in Table 1. Conditional analyses including in turn any SNP with P < 5 × 10−6 excluded the presence of multiple independent signals at each locus. All variants with suggestive associations are listed in Supplementary Table S4. The most noteworthy hits were rs16859966 on chromosome 3 (P = 8.6 × 10−7, odds ratio (OR) = 1.58, [95% confidence interval (CI) = 1.32-1.89]), rs7013955 on chromosome 8 (P = 3.7 × 10−6, OR = 1.8 [1.40-2.31]) and rs116607983 on chromosome 4 (P = 4.0 × 10−6, OR = 2.93 [1.86-4.63]), which were supported by at least one additional genotyped as well as serveal imputed SNPs with P-values below 5 × 10−5 as discovered in the clumping analysis (Table 1). The genes closest to the chromosome 3 locus are the Long Intergenic Non-Protein Coding RNA 2032 (LINC02032) approximately 100 kb downstream and the zinc finger proteins of cerebellum 1 and 4 genes (ZIC1, ZIC4), located roughly 600 kb upstream (Fig. 1c). The top SNP rs7013955 on chromosome 8 maps to the lysyl oxidase-like 2 gene (LOXL2; Fig. 1d). The association signal around SNP rs116607983 on chromosome 4 is located in a region devoid of protein-coding genes approximately 2000 kb to either side (Figure 1e). A fourth locus on chromosome 5 (rs2279135) was also supported by multiple clumped SNPs, but all SNPs including the lead SNP were imputed (Table 1). Several variants clumped at the chromosome 5 locus were located in the ARHGEF37 gene, coding for Rho Guanine Nucleotide Exchange Factor 37 (Fig. 1f). None of the identified SNPs is an expression quantitative trait locus (eQTL) in brain tissues according the Genotype Tissue Expression (GTEx) project.[9] At four of the six remaining loci with variants exhibiting suggestive associations at most two supporting SNPs were present, which were all imputed; in the other two loci no supporting SNPs could be found in the clumping analysis (Table 1, Supplementary Fig. S3). We did not investigate these loci further, because it is unlikely that they represent valid associations. No suggestive associations were detected with previously reported Parkinson’s disease associations from a meta-analysis of 17 datasets from Parkinson’s disease GWAS (Supplementary Table S5).[22]
ZIC4 immunohistochemistry on MSA patients’ brain
ZIC4 and ZIC1 are known to play a critical role in the embryonal development of the cerebellum. Heterozygous deletions comprising the ZIC1 and ZIC4 locus have been associated with the Dandy-Walker malformation, a rare congenital condition characterized by a hypoplastic cerebellar vermis and an enlarged fourth ventricle.[5, 13] In mice, deletions of ZIC1 and ZIC4 lead to a striking phenotype similar to the Dandy-Walker malformation with cerebellar hypoplasia and foliation defects.[5, 13] In addition, paraneoplastic autoantibodies against ZIC4 protein are linked to severe cerebellar dysfunction and degeneration.[3, 4]
Since cerebellar degeneration and corresponding symptoms are also a central hallmark of MSA, we decided to follow up on a potential role of ZIC4 in MSA patient brains by performing immunohistochemical stainings. For ZIC1 no primary antibody was appropriately sensitive and specific on human tissue in our hands. Thus, FFPE tissue of the frontal cortex and the cerebellum of MSA patients (N=10 SND, N=14 OPCA/mixed phenotype) and healthy controls (N=5) were stained with antibodies raised against ZIC4.
Nuclear and cytoplasmic staining of frontal cortex neurons was observed in all brains examined without differences between healthy controls and MSA patients (Fig. 2 a-c). In the cerebellar dentate nucleus, we found strong expression of ZIC4 in a subset of neurons in healthy controls as well as MSA patients with predominant SND (Fig. 2 d,e,g,h). In contrast, MSA patients with mixed subtype or OPCA showed reduced numbers of ZIC4-positive neurons, which were furthermore only weakly stained (Fig. 2 f, i). Quantification of the proportions of ZIC4-positive neurons among all dentate nucleus neurons revealed relatively constant proportions in healthy controls and patients with MSA-SND (33.2% ± 0.0 vs. 32.6% ±0.0), whereas in patients with MSA-OPCA or MSA-mixed phenotype we found significantly lower percentages of ZIC4-positive neurons (15.5% ± 0.1) (Fig. 2 j).
Discussion
As part of the study, brain banks were contacted worldwide and all available caucasian MSA brains were included. As in the prior GWAS with 918 predominantly clinically diagnosed MSA patients, our current GWAS of 648 autopsy-confirmed MSA patients, did not identify disease-associated common variants below the genome-wide significance threshold.
Since our prior GWAS of 219 patients with autopsy-confirmed corticobasal degeneration did identify significant disease-associated common variants, our current findings strongly suggest that the genetic contribution to disease risk is smaller in MSA.[18]
Nevertheless, our study reveals several suggestive associations at different loci, which may provide relevant hypotheses for follow-up investigations into the pathogenesis of MSA.
Specifically, we identified a variant on chromosome 3 (rs16859966, P = 8.6 × 10−7,, OR = 1.58 [1.32-1.89]) located upstream ZIC1 and ZIC4. ZIC1 and ZIC4 are located in close genomic proximity to each other and encode transcription factors highly expressed in different brain areas[1, 38].
Proper function of these proteins is critical for the development of the CNS, particularly the cerebellum.[5] Although no effect of rs16859966 on ZIC1 or ZIC4 expression is recorded in the GTEx database, rare genetic variants or deletions in ZIC1 or ZIC4 result in congenital cerebellar defects.[5, 13, 25] A heterozygous deletion of ZIC1 and ZIC4 causes the Dandy-Walker malformation, a developmental disorder of the cerebellum.[13, 36] Moreover, paraneoplastic autoantibodies against ZIC4 induce cerebellar degeneration.[4] Due to the pronounced cerebellar degeneration in MSA, we followed-up on a possible role of ZIC4 in MSA.
While we could detect a relatively constant proportion of approximately one third ZIC4-positive neurons among all neurons in the cerebellar dentate nucleus in healthy controls and patients with MSA-SND, cases with MSA-OPCA or the mixed MSA phenotype showed significantly lower fractions of ZIC4-positive neurons. This finding suggests that ZIC4 may be involved in the neurodegeneration in MSA. The involvement of ZIC4 mutations in the Dandy-Walker cerebellar malformation and the paraneoplastic ZIC4-autoantibody-associated cerebellar degeneration could suggest a pathomechanism in MSA, by which altered ZIC4 expression could increase neuronal vulnerability. Further analyses of a potential functional interaction of alpha-synuclein and ZIC4 are currently ongoing.
We would like to strongly encourage independent replication studies to confirm or refute the hypotheses provided by our study.
Data Availability
All data produced in the present work are contained in the manuscript
Supplementary Tables
Supplementary Figures
Acknowledgements
We would like to thank all those who contributed towards our research, particularly the patients and families who donated brain tissue - without their donation this study would not have been possible. We thank Vanessa Boll and Lena Jaschkowitz for excellent technical assistance.
The study received support from the Deutsche Parkinson Gesellschaft, the Else-Kröner-Fresenius-Stiftung, the CurePSP foundation, the Alzheimer’s Disease Genetics Consortium (ADGC) NIA grant U01AG032984, the National Alzheimer’s Coordinating Center (NACC) NIA grant U01 AG016976 and the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy within the framework of the Munich Cluster for Systems Neurology (EXC 2145 SyNergy – ID 390857198), the Hannover Cluster RESIST (EXC 2155 – ID 390874280) and the Kiel Cluster (EXC2167, Precision Medicine in Chronic Inflammation (PMI)), the DFG grant (HO2402/18-1), the VolkswagenStiftung (Niedersächsisches Vorab), the Petermax-Müller Foundation (Etiology and Therapy of Synucleinopathies and Tauopathies), and ERARE18-124 (MSA-omics) under the frame of E-Rare-3, the ERA-Net for Research on Rare Diseases.
Data for this study were prepared, archived, and distributed by the National Institute on Aging Alzheimer’s Disease Data Storage Site (NIAGADS) at the University of Pennsylvania (NIA grant U24-AG041689).
Samples from the National Cell Repository for Alzheimer’s Disease (NCRAD), which receives government support under a cooperative agreement grant (U24 AG21886) awarded by the National Institute on Aging (NIA), were used in this study. We thank contributors who collected samples used in this study, as well as patients and their families, whose help and participation made this work possible;
A part of the samples was collected at the Mayo Clinic. The Mayo Clinic is an American Parkinson Disease Association (APDA) Mayo Clinic Information and Referral Center, an APDA Center for Advanced Research, and is supported by a Lewy Body Dementia Center Without Walls U54NS110435 (to D.W.D. & O.A.R.). The brain bank was supported, in part, by the Mangurian Foundation Lewy Body Dementia Program at Mayo Clinic.
The London Neurodegenerative Diseases Brain Bank, King’s College London is supported by the MRC (Medical Research Council, UK) and the Brains for Dementia Research project (jointly funded by the Alzheimer’s Society and Alzheimer’s Research UK).
Data were contributed to this study by the Center on Alpha-synuclein Strains in Alzheimer Disease & Related Dementias at the University of Pennsylvania Perelman School of Medicine (U19 AG062418, Trojanowski JQ-PI) and the former Morris K. Udall Center at the University of Pennsylvania Perelman School of Medicine (P50 NS053488, Trojanowski JQ-PI), NIA P01-AG066597 and AG072979 (formerly AG010124).