PT - JOURNAL ARTICLE AU - Moya, Raquel AU - Wang, Xiaohan AU - Tsien, Richard W. AU - Maurano, Matthew T. TI - Structural characterization of a polymorphic repeat at the <em>CACNA1C</em> schizophrenia locus AID - 10.1101/2024.03.05.24303780 DP - 2024 Jan 01 TA - medRxiv PG - 2024.03.05.24303780 4099 - http://medrxiv.org/content/early/2024/05/16/2024.03.05.24303780.short 4100 - http://medrxiv.org/content/early/2024/05/16/2024.03.05.24303780.full AB - Genetic variation within intron 3 of the CACNA1C calcium channel gene is associated with schizophrenia and bipolar disorder, but analysis of the causal variants and their effect is complicated by a nearby variable-number tandem repeat (VNTR). Here, we used 155 long-read genome assemblies from 78 diverse individuals to delineate the structure and population variability of the CACNA1C intron 3 VNTR. We categorized VNTR sequences into 7 Types of structural alleles using sequence differences among repeat units. Only 12 repeat units at the 5′ end of the VNTR were shared across most Types, but several Types were related through a series of large and small duplications. The most diverged Types were rare and present only in individuals with African ancestry, but the multiallelic structural polymorphism Variable Region 2 was present across populations at different frequencies, consistent with expansion of the VNTR preceding the emergence of early hominins. VR2 was in complete linkage disequilibrium with fine-mapped schizophrenia variants (SNPs) from genome-wide association studies (GWAS). This risk haplotype was associated with decreased CACNA1C gene expression in brain tissues profiled by the GTEx project. Our work suggests that sequence variation within a human-specific VNTR affects gene expression, and provides a detailed characterization of new alleles at a flagship neuropsychiatric locus.Competing Interest StatementThe authors have declared no competing interest.Funding StatementThis study was funded in part by grants from the National Institutes of HealthAuthor DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:All data were publicly available at the start of the study: Long-read haplotype assemblies were downloaded from the HGSVC2 FTP site, the HPRC S3 bucket, and the T2T GitHub site. Summary statistics from the schizophrenia GWAS are available at the PGC site. Variant calls for HGSVC2 (Ebert et al. 2021) and 1000 Genomes individuals (Byrska-Bishop et al. 2022) are available through the 1000 Genomes FTP site. Ancestry information is available on IGSR. WGS alignment files for archaic human individuals can be found at the public FTP sites.I confirm that all necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived, and that any patient/participant/sample identifiers included were not known to anyone (e.g., hospital staff, patients or participants themselves) outside the research group so cannot be used to identify individuals.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines, such as any relevant EQUATOR Network research reporting checklist(s) and other pertinent material, if applicable.YesAll data produced in the present study are available upon reasonable request to the authors ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data_collections/HGSVC2/release/v1.0/assemblies https://s3-us-west-2.amazonaws.com/human-pangenomics/index.html?prefix=working/ http://walters.psycm.cf.ac.uk/clozuk_pgc2.meta.sumstats.txt.gz http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data_collections/HGSVC2/release/v2.0/integrated_callset/variants_freeze4_snv_snv_alt.vcf.gz http://ftp.1000genomes.ebi.ac.uk/vol1/ftp/data_collections/1000G_2504_high_coverage/working/20201028_3202_phased/CCDG_14151_B01_GRM_WGS_2020-08-05_chr12.filtered.shapeit2-duohmm-phased.vcf.gz https://www.internationalgenome.org/data-portal/sample http://cdna.eva.mpg.de/neandertal/altai/AltaiNeandertal/bam/ http://ftp.eva.mpg.de/neandertal/Chagyrskaya/BAM/ http://cdna.eva.mpg.de/denisova/alignments/ http://ftp.eva.mpg.de/neandertal/Vindija/bam/Pruefer_etal_2017/